Skip to main content

Table 1 Comparative analysis

From: Educational data mining: prediction of students' academic performance using machine learning algorithms

References Variables Objectives Level Dataset Algorithms Accuracy
Min Max
Asif et al. (2017) The marks for all the courses that are taught in the four years of the degree programme Predicting students' performance Undergraduate students 210 DT, 1-NN, NB, NN, RF NN (62.50%) NB (83.65%)
Cruz-Jesus et al. (2020) Year of the study cycle, gender, age, number of enrolled years in high school, scholarship, internet access, class size, school size, economic level, population density, number of unit courses attended Predicting students' performance High schools students 110627 ANN, DT, ET, RF, SVM, kNN, LR LR (81.1%) SVM (51.2%)
Fernandes et al. (2019) Class with persons with special needs, Classroom usage environment, Gender, age (mean), Student benefit, city, neighbourhood, Student with special needs, Grade (mean), Absence (mean) Predict academic outcomes of student performance High schools students Dataset1:19000
Dataset2:19834
Gradient Boosting Machine 89.5% 91.9%
Hoffait and Schyns (2017) Gender, Nationality, Studies, Prior schooling, math, scholarship, success Predicting students at high risk of failure secondary school students 2244 RF, LR, ANN ANN (70.4%) RF (90%)
Rebai et al. (2020) Socioeconomic status, school type, school location, competition, teacher characteristic (experience, salary), class size, school size, gender, parental education, political context, parental pressure to identify the key factors that impact schools' academic performance and to explore their relationships Secondary schools 105 schools RT, RF   
Ahmad and Shahzadi (2018) Previous degree marks, Home environment, Study habits Learning skills, Hardworking and Academic interaction Identification of students in the risk group Undergraduate students 300 MPNN   95%
Musso et al., (2020) Learning strategies, coping strategies, cognitive factors, social support, background, self-concept, self-satisfaction, use of IT and reading Grade point average, academic retention, and degree completion Undergraduate students 655 ANN 60.5% 80.7%
Waheed et al., (2020) Students’ demographics, clickstream events Pass-fail, withdrawn-pass, distinction-fail, distinction-pass Undergraduate students 32593 ANN, SVM, LR 84% 93%
Xu et al. (2019) Internet usage behaviours comprise online time, internet connection frequency, internet traffic volume, and online time Predicting students' performance Undergraduate students 4000 DT, NN, SVM 71% 76%
Bernacki et al. (2020) Log records in the learning management system Predict achievement Undergradeate students 337 LR, NB, J-48 DT, J-Rip DT J-48 (53.71%) LR (67.36%)
Burgos et al. (2018) Historical student course grade data Drop out of a course Undergradeate students 100 SVM, FFNN, PESFAM, LOGIT_Act SVM (62.50) LOGIT_Act(97.13%)