AI-assisted analysis of content, structure, and sentiment in MOOC discussion forums

被引:3
作者
Yee, Michael [1 ]
Roy, Anindya [2 ]
Perdue, Meghan [2 ]
Cuevas, Consuelo [1 ]
Quigley, Keegan [1 ]
Bell, Ana [3 ]
Rungta, Ahaan [2 ]
Miyagawa, Shigeru [4 ]
机构
[1] MIT Lincoln Lab, Artificial Intelligence Technol Grp, Lexington, MA 02421 USA
[2] MIT, Open Learning, Cambridge, MA USA
[3] MIT, Dept Elect Engn & Comp Sci, Cambridge, MA USA
[4] MIT, Dept Linguist & Philosophy, Cambridge, MA USA
关键词
MOOCs; discussion forums; forum posts; natural language processing; text classification; machine learning; transformers; artificial intelligence;
D O I
10.3389/feduc.2023.1250846
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Discussion forums are a key component of online learning platforms, allowing learners to ask for help, provide help to others, and connect with others in the learning community. Analyzing patterns of forum usage and their association with course outcomes can provide valuable insight into how learners actually use discussion forums, and suggest strategies for shaping forum dynamics to improve learner experiences and outcomes. However, the fine-grained coding of forum posts required for this kind of analysis is a manually intensive process that can be challenging for large datasets, e.g., those that result from popular MOOCs. To address this issue, we propose an AI-assisted labeling process that uses advanced natural language processing techniques to train machine learning models capable of labeling a large dataset while minimizing human annotation effort. We fine-tune pretrained transformer-based deep learning models on category, structure, and emotion classification tasks. The transformer-based models outperform a more traditional baseline that uses support vector machines and a bag-of-words input representation. The transformer-based models also perform better when we augment the input features for an individual post with additional context from the post's thread (e.g., the thread title). We validate model quality through a combination of internal performance metrics, human auditing, and common-sense checks. For our Python MOOC dataset, we find that annotating approximately 1% of the forum posts achieves performance levels that are reliable for downstream analysis. Using labels from the validated AI models, we investigate the association of learner and course attributes with thread resolution and various forms of forum participation. We find significant differences in how learners of different age groups, gender, and course outcome status ask for help, provide help, and make posts with emotional (positive or negative) sentiment.
引用
收藏
页数:17
相关论文
共 43 条
  • [41] AI-assisted 3D analysis of grasping and reaching behavior of squirrel monkeys during recovery from cervical spinal cord injury
    Duque, Daniela Hernandez
    Yang, Pai-Feng
    Gore, John C.
    Chen, Li Min
    BEHAVIOURAL BRAIN RESEARCH, 2025, 476
  • [42] Design and Analysis of a Highly Sensitive Terahertz Biosensor Using Graphene Metasurfaces and Surface Plasmon Resonance for Protein Detection with AI-Assisted Locally Weighted Linear Regression for Behavior Prediction
    Dhandapani, Gokila
    Wekalao, Jacob
    Patel, Shobhit K.
    Al-zahrani, Fahad Ahmed
    PLASMONICS, 2024,
  • [43] One-step colorimetric isothermal detection of COVID-19 with AI-assisted automated result analysis: A platform model for future emerging point-of-care RNA/DNA disease diagnosis
    Jaroenram, Wansadaj
    Chatnuntawech, Itthi
    Kampeera, Jantana
    Pengpanich, Sukanya
    Leaungwutiwong, Pornsawan
    Tondee, Benyatip
    Sirithammajak, Sarawut
    Suvannakad, Rapheephat
    Khumwan, Pakapreud
    Dangtip, Sirintip
    Arunrut, Narong
    Bantuchai, Sirasate
    Nguitragool, Wang
    Wongwaroran, Suchawit
    Khanchaitit, Paisan
    Sattabongkot, Jetsumon
    Teerapittayanon, Surat
    Kiatpathomchai, Wansika
    TALANTA, 2022, 249