A Data Augmentation Approach to Sentiment Analysis of MOOC Reviews

被引:0
作者
Li, Guangmin [1 ]
Zhou, Long [1 ]
Tong, Qiang [1 ]
Ding, Yi [1 ]
Qi, Xiaolin [2 ]
Liu, Hang [3 ]
机构
[1] Hubei Normal Univ, Sch Comp & Informat Engn, Huangshi, Peoples R China
[2] Wuhan Technol & Business Univ, Acad Affairs Off, Wuhan, Peoples R China
[3] Cent China Normal Univ, Coll Phys Sci & Technol, Wuhan, Peoples R China
关键词
Data augmentation; sentiment analysis; MOOC; natural language processing; deep learning;
D O I
10.14569/IJACSA.2024.01508122
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
To address the lack of Chinese online course review corpora for aspect-based sentiment analysis, we propose Semantic Token Augmentation and Replacement (STAR), a semantic-relative distance-based data augmentation method. STAR leverages natural language processing techniques such as word embedding and semantic similarity to extract high-frequency words near aspect terms, learns their word vectors to obtain synonyms and replaces these words to enhance sentence diversity while maintaining semantic consistency. Experiments on a Chinese MOOC dataset show STAR improves Macro-F1 scores by 3.39%-8.18% for LCFS-BERT and 1.66%-8.37% for LCF-BERT compared to baselines. These results demonstrate STAR's effectiveness in improving the generalization ability of deep learning models for Chinese MOOC sentiment analysis.
引用
收藏
页码:1258 / 1264
页数:7
相关论文
共 34 条
  • [1] Abonizio Hugo Queiroz, 2022, IEEE Transactions on Artificial Intelligence, V3, P657, DOI 10.1109/TAI.2021.3114390
  • [2] Brinkmann A, 2024, Arxiv, DOI arXiv:2310.12537
  • [3] Chen SG, 2021, Arxiv, DOI arXiv:2109.01758
  • [4] Understanding Learners' Perception of MOOCs Based on Review Data Analysis Using Deep Learning and Sentiment Analysis
    Chen, Xieling
    Wang, Fu Lee
    Cheng, Gary
    Chow, Man-Kong
    Xie, Haoran
    [J]. FUTURE INTERNET, 2022, 14 (08)
  • [5] Coulombe C, 2018, Arxiv, DOI arXiv:1812.04718
  • [6] Pre-Training With Whole Word Masking for Chinese BERT
    Cui, Yiming
    Che, Wanxiang
    Liu, Ting
    Qin, Bing
    Yang, Ziqing
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 3504 - 3514
  • [7] Dai X, 2020, Arxiv, DOI arXiv:2010.11683
  • [8] Research on the factors influencing the learner satisfaction of MOOCs
    Du, Bingxin
    [J]. EDUCATION AND INFORMATION TECHNOLOGIES, 2023, 28 (02) : 1935 - 1955
  • [9] Feng SY, 2021, Arxiv, DOI arXiv:2105.03075
  • [10] Tailored text augmentation for sentiment analysis
    Feng, Zijian
    Zhou, Hanzhang
    Zhu, Zixiao
    Mao, Kezhi
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 205