AI-assisted analysis of content, structure, and sentiment in MOOC discussion forums

被引：3

作者：

Yee, Michael ^{[1
]}

Roy, Anindya ^{[2
]}

Perdue, Meghan ^{[2
]}

Cuevas, Consuelo ^{[1
]}

Quigley, Keegan ^{[1
]}

Bell, Ana ^{[3
]}

Rungta, Ahaan ^{[2
]}

Miyagawa, Shigeru ^{[4
]}

机构：

[1] MIT Lincoln Lab, Artificial Intelligence Technol Grp, Lexington, MA 02421 USA

[2] MIT, Open Learning, Cambridge, MA USA

[3] MIT, Dept Elect Engn & Comp Sci, Cambridge, MA USA

[4] MIT, Dept Linguist & Philosophy, Cambridge, MA USA

来源：

FRONTIERS IN EDUCATION | 2023年 / 8卷

关键词：

MOOCs; discussion forums; forum posts; natural language processing; text classification; machine learning; transformers; artificial intelligence;

D O I：

10.3389/feduc.2023.1250846

中图分类号：

G40 [教育学];

学科分类号：

040101 ; 120403 ;

摘要：

Discussion forums are a key component of online learning platforms, allowing learners to ask for help, provide help to others, and connect with others in the learning community. Analyzing patterns of forum usage and their association with course outcomes can provide valuable insight into how learners actually use discussion forums, and suggest strategies for shaping forum dynamics to improve learner experiences and outcomes. However, the fine-grained coding of forum posts required for this kind of analysis is a manually intensive process that can be challenging for large datasets, e.g., those that result from popular MOOCs. To address this issue, we propose an AI-assisted labeling process that uses advanced natural language processing techniques to train machine learning models capable of labeling a large dataset while minimizing human annotation effort. We fine-tune pretrained transformer-based deep learning models on category, structure, and emotion classification tasks. The transformer-based models outperform a more traditional baseline that uses support vector machines and a bag-of-words input representation. The transformer-based models also perform better when we augment the input features for an individual post with additional context from the post's thread (e.g., the thread title). We validate model quality through a combination of internal performance metrics, human auditing, and common-sense checks. For our Python MOOC dataset, we find that annotating approximately 1% of the forum posts achieves performance levels that are reliable for downstream analysis. Using labels from the validated AI models, we investigate the association of learner and course attributes with thread resolution and various forms of forum participation. We find significant differences in how learners of different age groups, gender, and course outcome status ask for help, provide help, and make posts with emotional (positive or negative) sentiment.

引用

页数：17

共 43 条

[41] AI-assisted 3D analysis of grasping and reaching behavior of squirrel monkeys during recovery from cervical spinal cord injury
Duque, Daniela Hernandez
Yang, Pai-Feng
Gore, John C.
Chen, Li Min
BEHAVIOURAL BRAIN RESEARCH, 2025, 476
[42] Design and Analysis of a Highly Sensitive Terahertz Biosensor Using Graphene Metasurfaces and Surface Plasmon Resonance for Protein Detection with AI-Assisted Locally Weighted Linear Regression for Behavior Prediction
Dhandapani, Gokila
Wekalao, Jacob
Patel, Shobhit K.
Al-zahrani, Fahad Ahmed
PLASMONICS, 2024,
[43] One-step colorimetric isothermal detection of COVID-19 with AI-assisted automated result analysis: A platform model for future emerging point-of-care RNA/DNA disease diagnosis
Jaroenram, Wansadaj
Chatnuntawech, Itthi
Kampeera, Jantana
Pengpanich, Sukanya
Leaungwutiwong, Pornsawan
Tondee, Benyatip
Sirithammajak, Sarawut
Suvannakad, Rapheephat
Khumwan, Pakapreud
Dangtip, Sirintip
Arunrut, Narong
Bantuchai, Sirasate
Nguitragool, Wang
Wongwaroran, Suchawit
Khanchaitit, Paisan
Sattabongkot, Jetsumon
Teerapittayanon, Surat
Kiatpathomchai, Wansika
TALANTA, 2022, 249

← 1 2 3 4 5 →