TY - GEN
T1 - Identifying At-risk Students from Course-specific Predictive Analytics
AU - Kwan, Chung Lim Christopher
PY - 2019/11/19
Y1 - 2019/11/19
N2 - Identifying at-risk students in a large class of an engineering mathematics course during the delivery of teaching and learning activities is not an easy task to be accomplished by many instructors, particularly in the first few weeks of their studies. In the paper, course-specific predictive analytics, called the multiple linear regression model, the logistic regression model and the classification and regression tree (CART) model are trained, tested and compared with the use of LMS data in the first semester of the academic year 2017-18 such as the level of achievements in online class activities, the mini-project, the mid-term test, assignments, and the final examination for classifying at-risk students as early as possible during the course of study. A feature selection method is used to select statistically significant variables in the development of multiple linear regression and logistic regression models for enhancing the generalizability of both models. It is found that 3 key variables such as the level of achievements in the 6th online class activity, the mid-term test and assignment 2, which may have pedagogically meaningful information, are crucial for classifying at-risk students. Despite the highest accuracy of the CART model, the logistic regression model significantly outperforms the multiple linear regression and the CART models in terms of the recall and f-measure of the testing set. Instead of selecting 3 key variables, the present logistic regression model which only comprises 2 statistically significant variables such as the level of achievements in the 6th online class activity and the mid-term test can be employed to identify at-risk students for early intervention of their studies once the results of the mid-term test and the 6th online class activity are made available at the end of week 7.
AB - Identifying at-risk students in a large class of an engineering mathematics course during the delivery of teaching and learning activities is not an easy task to be accomplished by many instructors, particularly in the first few weeks of their studies. In the paper, course-specific predictive analytics, called the multiple linear regression model, the logistic regression model and the classification and regression tree (CART) model are trained, tested and compared with the use of LMS data in the first semester of the academic year 2017-18 such as the level of achievements in online class activities, the mini-project, the mid-term test, assignments, and the final examination for classifying at-risk students as early as possible during the course of study. A feature selection method is used to select statistically significant variables in the development of multiple linear regression and logistic regression models for enhancing the generalizability of both models. It is found that 3 key variables such as the level of achievements in the 6th online class activity, the mid-term test and assignment 2, which may have pedagogically meaningful information, are crucial for classifying at-risk students. Despite the highest accuracy of the CART model, the logistic regression model significantly outperforms the multiple linear regression and the CART models in terms of the recall and f-measure of the testing set. Instead of selecting 3 key variables, the present logistic regression model which only comprises 2 statistically significant variables such as the level of achievements in the 6th online class activity and the mid-term test can be employed to identify at-risk students for early intervention of their studies once the results of the mid-term test and the 6th online class activity are made available at the end of week 7.
KW - At-risk Students
KW - Classification and Regression Tree
KW - F-measure
KW - Logistic Regression Model
KW - Multiple Linear Regression Model
KW - Recall
UR - http://www.scopus.com/inward/record.url?scp=85077680267&partnerID=8YFLogxK
M3 - Conference article published in proceeding or book
T3 - ICCE 2019 - 27th International Conference on Computers in Education, Proceedings
SP - 356
EP - 360
BT - ICCE 2019 - 27th International Conference on Computers in Education, Proceedings
A2 - Chang, Maiga
A2 - So, Hyo-Jeong
A2 - Wong, Lung-Hsiang
A2 - Shih, Ju-Ling
A2 - Yu, Fu-Yun
A2 - Banawan, Michelle P.
A2 - Chang, Ben
A2 - Chen, Weiqin
A2 - Coronel, Andrei D.
A2 - Gottipati, Swapna
A2 - Hoppe, H. Ulrich
A2 - Jong, Morris S.Y.
A2 - Liao, Calvin
A2 - Mason, Jon
A2 - Ouyang, Fan
A2 - Panjaburee, Patcharin
A2 - Rodrigo, Ma. Mercedes T.
A2 - Song, Yanjie
A2 - Srisawasdi, Niwat
A2 - Tlili, Ahmed
A2 - Yin, Chengjiu
PB - Asia-Pacific Society for Computers in Education
T2 - 27th International Conference on Computers in Education, ICCE 2019
Y2 - 2 December 2019 through 6 December 2019
ER -