Feature Selection for Clustering Online Learners

被引:33
作者
Huang, Lei [1 ]
Wang, Xinghui [1 ]
Wu, Zhouhua [1 ]
Wang, Feiyu [2 ]
机构
[1] Guangxi Radio & TV Univ, Smart Educ Lab, Nanning, Peoples R China
[2] Miami Univ, Oxford, OH 45056 USA
来源
2019 EIGHTH INTERNATIONAL CONFERENCE ON EDUCATIONAL INNOVATION THROUGH TECHNOLOGY (EITT) | 2019年
关键词
clustering; learning analytics; educational data mining; feature selection; dimensionality reduction;
D O I
10.1109/EITT.2019.00009
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
As one of the most important approaches for learning analytics and educational data mining, various clustering algorithms have been explored and compared in the analysis of online learners by their behavior. However, choosing which features for clustering has a strong impact on the quality of clustering, and has not received enough attention yet. By using an entropy-based feature selection method, this research broadens the range of candidate initial features and provides efficient algorithms for extracting the most important features. Experiment with real-life data reveals that this method not only overcomes the data sparsity and complexity problems for clustering in high-dimensional feature space but also surpasses dimensionality reduction methods like PCA in interpretability.
引用
收藏
页码:1 / 6
页数:6
相关论文
共 14 条
[1]  
Arora S., 2017, AM J DISTANCE ED
[2]  
Balakrishnan G., 2013, ELECT ENG COMPUTER S, V53, P57
[3]  
Bogarin A., 2014, P 4 INT C LEARN AN K, P11
[4]  
Bovo Angela, 2013, 2013 Second International Conference on E-Learning and E-Technologies in Education (ICEEE 2013), P121, DOI 10.1109/ICeLeTE.2013.6644359
[5]  
Dash M, 2000, LECT NOTES ARTIF INT, V1805, P110
[6]  
Ferguson Rebecca., 2015, P 5 INT C LEARNING A, P51, DOI [DOI 10.1145/2723576, DOI 10.1145/2723576.2723606, 10.1145/2723576.2723606]
[7]  
Guyon I., 2020, J MACH LEARN RES, V3, P1157, DOI [DOI 10.1162/153244303322753616, 10.1162/153244303322753616]
[8]  
Han J, 2012, MOR KAUF D, P1
[9]  
Kizilcec Rene F, 2013, P 3 INT C LEARN AN K, P170, DOI 10.1145/2460296.2460330
[10]   Clustering and Sequential Pattern Mining of Online Collaborative Learning Data [J].
Perera, Dilhan ;
Kay, Judy ;
Koprinska, Irena ;
Yacef, Kalina ;
Zaiane, Osmar R. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (06) :759-772