Page 110 - ASBIRES-2017_Preceedings
P. 110
Alagalla & Jayakody
as less completion rate of courses and Joksimovic and Siemens (2014) examined
difficulty to follow courses schedules by numerous problems associated with
students. There are no such tools available MOOC. The data collection and data
to analyze the MOOC data at the beginning analysis methods of MOOC research are
stage of the courses to identify the root areas that are poorly investigated in prior
causes of the above mentioned problem. literature.
Therefore, this research focused on
developing an efficient solution by using a 3 METHODOLOGY
dataset which consists of 600000 records The research data set was mainly
from the Harvard/MIT MOOC’s system.
taken from Harvard and MIT MOOC’s
The main focus of this research is system. It consists of 600000 records in the
to analyze the root causes of students’ dataset. The research methodology was
failures by data analysis of MOOC’s broken into two sections such as graph
courses through the Graph analysis and
Data analysis. Graph analysis was analysis and data analysis.
implemented to analyze more than two
data attributes at once accurately by
Hadoop big data framework and its
packages. Moreover, Data analysis was
implemented to get the accuracy of the
MOOC’s results by Python and its machine
learning packages.
As a result output, this system gave
the most affected attribute and its
probability of influence to the student
results. In addition, this system can be used
to improve the MOOC’s system to be a Figure 1: Architectural diagram of system
productive tool for student centric courses
and improve student’s results. 3.1 Graph Analysis
2 LITERATURE REVIEW Graph analysis was designed
through the Hadoop framework. Dataset
2.1 Impacts of MOOCs on Education consists of numerous attributes to analyze
such as number of days activated users,
Impact of MOOCs on education
system is discussed more broadly. number of videos seen by users, number of
discussions gone through by forums,
Specifically it focuses on the disruptive number of chapters covered, number of
potential of MOOCs, and so a large portion events gone through by users etc. Hadoop
of the scholar’s review is relevant to this framework used to analyze data by graphs.
issue. For instance, she identifies Backend of the methodology was
characteristics of some MOOCs such as developed with hive, hBase and pig.
their size, automation in grading, and their (Phaneendra & Reddy, 2013, Mridul,
openness (particularly with regard to Khajuria, Dutta, Kumar, 2014, Gasevic,
cMOOCs) as factors with the potential to Kovanovic, Joksimovic & Siemens, 2014).
affect approaches to teaching and learning. Map reduce process was used by the
The size and openness of MOOCs are also Hadoop framework to generate the
highlighted by Kennedy (2014) in terms of graphs.Map reduce process
their potential to “[disrupt] conventional
thinking about the role, value, and cost of 1. Prepare the Map() input – the "Map
higher education” Gasevic, Kovanovic, Reduce system" designates Map
100