Page 109 - ASBIRES-2017_Preceedings
P. 109

th
                      Proceedings of the 9  Symposium on Applied Science, Business & Industrial Research – 2017
                      ISSN 2279-1558, ISBN 978-955-7442-09-9

                            Hadoop Based Graph Analytics and Data Analytics Tools on Massive
                                                      Open Online Courses

                                                     Alagalla AWHP, Jayakody JRKC
                                 Department of Computing &Information System, Wayamba University of Sri Lanka
                                                        heshanalagalla@gmail.com

                                                             ABSTRACT

                             Massive Open Online Course (MOOC) is one of the famous online courses in the
                      current  educational  systems.  Most  of  the  universities  are  using  MOOC’s  all  over  the
                      world.  Therefore,  students  follow  MOOC  courses  to  study  subjects  to  enhance  their
                      knowledge, skills in their chosen field. Even though there is a huge student’s base for
                      MOOC courses there are prevailing problems as well. They are less percentage of course
                      completion  rate  and  difficulty  to  follow  the  course  contents  as  scheduled  by  the
                      universities.  Currently,  it  is  very  hard  to  find  analytics  tool  to  analyze  students’  data
                      which is generated due to the usage of numerous MOOC’s activities such as assignment
                      submission, group activities and forum post etc. to identify the students’ failure rates.
                      Therefore,  this  research  mainly  focused  on  developing  a  tool  to  analyze  MOOC  data
                      effectively.  MOOC  data  set  of  Harvard/MIT,  which  includes  600000  records  and  36
                      attributes, were  selected  for  this research.  Graph  analytics  was  implemented with  the
                      Hadoop  framework  and  hue  was  used  for  the  graph  visualization.  Multi-dimensional
                      cubes  were  created  to  analyze  the  data.  Numbers  of  graph  patterns  were  realized  to
                      identify the students based on region, subject and duration of the courses. Data mining
                      technique  was  used  to  develop  a  machine  learning  model  to  predict  the  pass  rate
                      accuracy.  Logistic  regression  model  was  given  the  highest  accuracy  with  96.08.  The
                      research  output  of  Graph  analytics  and  data  analytics  patterns  would  be  utilized  in
                      future for effective functioning of MOOC systems.

                      KEYWORDS:  Big  data  analysis,  Machine  learning,  MOOC’s  (Massive  open  online
                      courses)

                                                                       sense,  MOOCs  are  an  incredibly  valuable
                                1 INTRODUCTION
                                                                       in  addition  to  educational  provision.
                             MOOC is an online course aimed at         MOOCs can be useful in opening up access
                      unlimited  participation  and  Open  Access      to  high-quality  content,  particularly  in
                      through the web.  In addition to traditional     developing  countries,  but  successful
                      course  materials  such  as  lectures,  lectures   replication  and  substantial  investment  in
                      notes  and  problem  sets,  many  MOOCs          local  support  and  partnerships  will  be
                      provide interactive user forums to support       needed.    MOOCs      are    valuable   in
                      community  interactions  among  students,        developing  basic  conceptual  learning,  and
                      teachers,  and  teaching  assistants.  MOOCs     in  creating  large  online  communities  of
                      are  a  recent  and  widely  researched          interest or practice.
                      development in distance learning that was               Further,  MOOCs  are  an  extremely
                      first introduced in 2008 and emerged as a        valuable  form  of  lifelong  learning  and
                      popular  mode  of  learning.  MOOCs,             continuing  education.  Due  to  the plentiful
                      particularly xMOOCs, deliver high quality        advantages which are mentioned above, the
                      content  from  some  of  the  world’s  best      students’ base of MOOC is increasing day
                      universities  for  free  to  anyone  with  a     by day which causes several challenges to
                      computer and an internet connection. This        MOOC administrators of universities. Such
                      is  an  amazing  value  proposition.  In  this



                                                                    99
   104   105   106   107   108   109   110   111   112   113   114