Page 112 - ASBIRES-2017_Preceedings
P. 112

Alagalla & Jayakody



















                                          Figure 1: Results by registered data

                                                                Table  1  displays  the  results
                      4 RESULTS                          according  to the  number  of days  activated
               As  mentioned  previously  MOOC           by  the  students.  Status  of  the  course
       data analytics consists of two sections such      completion was selected as the class type.
       as graph Analysis and data Analysis. Hive
       and  hbase  were  used  to  get  more  results            Following  table  1  illustrates  the
       using  querying  the  data  from  dataset.        accuracy  of  the  classification  model  by
       Accuracy  of  the  output  results  were  taken   changing  the  training  data  set  portion  and
       by  analyzing  number  of  days  activated,       the classification algorithm.
       number of chapters read, number of video
       tutorial were used by students.                       Table 1: Results by number of days
                                                                          activated
       4.1 Graph Analysis

               According  to  Figure  2,  it  displays
       the  registered  user’s  behavior  to  their                         60%   training   70%   training   80%   training
       educational level such as bachelors, masters
       and PhD. Further, it can be used to compare       MultinomialNB
       different  educational  level  with  number  of                   0.96    0.96      0.96
       days activated. Further, this can be used to      BernouliNB      96.67  96.67      96.67
       identify the types of people mostly used the      SGDClassifier
       MOOC’s  As  a  result;  administrators  can                       95.67  95.68      81.45
       create another process to motivate them to        SVC_classifier   95.68  95.68     95.79
       complete the course.
                                                         LinearSVC_cla   95.66  95.68      95.82
                Figure 3 illustrates the variation of    ssifier
       registering  students  on  MOOC’s  by  time       MNB_classfier   95.67  95.68      92.16
       frame. This graph can be used to get a clear      Logistic        95.67  95.67      96.07
       idea  about  the  student’s  registration  time   Regression
       frames.                                                  Table  2  displays  the  results
       4.2 Data Analysis Results                         according to the number of video tutorials
                                                         watched  by  the  student  for  number  of
               Data  mining  techniques  were
       applied  to  generate  the  best  classification   machine learning algorithm
       model.




                                                      102
   107   108   109   110   111   112   113   114   115   116   117