Towards Automatic Cross-Language Classification of Cognitive Presence in Online Discussions

被引:30
作者
Barbosa, Gian [1 ]
Camelo, Raissa [1 ]
Cavalcanti, Anderson Pinheiro [2 ]
Miranda, Pericles [1 ]
Mello, Rafael Ferreira [1 ]
Kovanovic, Vitomir [3 ,4 ]
Gasevic, Dragan [5 ]
机构
[1] Univ Fed Rural Pernambuco, Dept Comp, Recife, PE, Brazil
[2] Univ Fed Pernambuco, Ctr Informat, Recife, PE, Brazil
[3] Univ South Australia, Sch Educ, Adelaide, SA, Australia
[4] Univ South Australia, Teaching Innovat Unit, Adelaide, SA, Australia
[5] Monash Univ, Fac Informat Technol, Melbourne, Vic, Australia
来源
LAK20: THE TENTH INTERNATIONAL CONFERENCE ON LEARNING ANALYTICS & KNOWLEDGE | 2020年
关键词
Community of Inquiry Model; Cross-Language Classification; Content Analytics; Online Discussion; Optimization;
D O I
10.1145/3375462.3375496
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper presents a study that examined automated cross-language classification of online discussion messages for the levels of cognitive presence, a key construct from the widely used Community of Inquiry (CoI) model of online learning. Specifically, we examined the classification of 1,500 Portuguese language discussion messages using a classifier trained on a corpus of the 1,747 English language discussion messages. In the study, a random forest classifier was developed using a small set of 108 validated indicators of psychological processes, linguistic coherence, and online discussion structure. The classifier obtained 67% accuracy and Cohen's K of 0.32, showing a moderate level of inter-rater agreement above chance and the general viability of the proposed approach. Most importantly, the findings suggest that certain aspects of cognitive presence construct are highly generalizable and transfer across different languages. Finally, the paper also presents a novel method for addressing class imbalance problem using a generic algorithm heuristic technique, which provided substantial improvements over the use of imbalanced dataset. Results and practical implications are further discussed.
引用
收藏
页码:605 / 614
页数:10
相关论文
共 54 条
[1]  
Anderson T., 2001, J ASYNCHRONOUS LEARN, V5, DOI [DOI 10.24059/OLJ.V5I2.1875, 10.24059/olj.v5i2.1875]
[2]   Three Generations of Distance Education Pedagogy [J].
Anderson, Terry ;
Dron, Jon .
INTERNATIONAL REVIEW OF RESEARCH IN OPEN AND DISTRIBUTED LEARNING, 2011, 12 (03) :80-97
[3]  
[Anonymous], 2019, Industrial-Strength Natural Language Processing
[4]   Developing a community of inquiry instrument: Testing a measure of the Community of Inquiry framework using a multi-institutional sample [J].
Arbaugh, J. B. ;
Cleveland-Innes, Martha ;
Diaz, Sebastian R. ;
Garrison, D. Randy ;
Ice, Philip ;
Richardson, Jennifer C. ;
Swan, Karen P. .
INTERNET AND HIGHER EDUCATION, 2008, 11 (3-4) :133-136
[5]  
Archer DR, 2001, J DISTANCE ED, V5, P1, DOI DOI 10.1080/08923640109527071
[6]  
Back T., 2018, EVOLUTIONARY COMPUTA
[7]  
Balage FilhoP P., 2013, P 9 BRAZILIAN S INFO, P215
[8]  
Barbosa Gian F. S., 2019, P 8 BRAZ C INT SYST
[9]  
Batista G. E. A. P. A., 2003, WOB, P35
[10]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32