Large-scale e-learning recommender system based on Spark and Hadoop

Karim Dahdouh; Ahmed Dakkak; Lahcen Oughdir; Abdelali Ibriz

doi:10.1186/s40537-019-0169-4

Journal of Big Data (Jan 2019)

Large-scale e-learning recommender system based on Spark and Hadoop

Karim Dahdouh,
Ahmed Dakkak,
Lahcen Oughdir,
Abdelali Ibriz

Affiliations

Karim Dahdouh: Engineering Sciences Laboratory, FPT, Sidi Mohamed Ben Abdellah University
Ahmed Dakkak: Engineering Sciences Laboratory, FPT, Sidi Mohamed Ben Abdellah University
Lahcen Oughdir: Engineering Sciences Laboratory, FPT, Sidi Mohamed Ben Abdellah University
Abdelali Ibriz: High School of Technology, Sidi Mohamed Ben Abdellah University

DOI: https://doi.org/10.1186/s40537-019-0169-4
Journal volume & issue: Vol. 6, no. 1
pp. 1 – 23

Abstract

Read online

Abstract The present work is a part of the ESTenLigne project which is the result of several years of experience for developing e-learning in Sidi Mohamed Ben Abdellah University through the implementation of open, online and adaptive learning environment. However, this platform faces many challenges, such as the increasing amount of data, the diversity of pedagogical resources and a large number of learners that makes harder to find what the learners are really looking for. Furthermore, most of the students in this platform are new graduates who have just come to integrate higher education and who need a system to help them to take the relevant courses that take into account the requirements and needs of each learner. In this article, we develop a distributed courses recommender system for the e-learning platform. It aims to discover relationships between student’s activities using association rules method in order to help the student to choose the most appropriate learning materials. We also focus on the analysis of past historical data of the courses enrollments or log data. The article discusses particularly the frequent itemsets concept to determine the interesting rules in the transaction database. Then, we use the extracted rules to find the catalog of more suitable courses according to the learner’s behaviors and preferences. Next, we deploy our recommender system using big data technologies and techniques. Especially, we implement parallel FP-growth algorithm provided by Spark Framework and Hadoop ecosystem. The experimental results show the effectiveness and scalability of the proposed system. Finally, we evaluate the performance of Spark MLlib library compared to traditional machine learning tools including Weka and R.

Published in Journal of Big Data

ISSN: 2196-1115 (Online)
Publisher: SpringerOpen
Country of publisher: United Kingdom
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware; Technology: Technology (General): Industrial engineering. Management engineering: Information technology; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://journalofbigdata.springeropen.com

About the journal

Abstract

Keywords