Enhancing Health Mention Classification Through Reexamining Misclassified Samples and Robust Fine-Tuning Pre-Trained Language Models

被引:0
作者
Meng, Deyu [1 ]
Phuntsho, Tshewang [2 ]
Gonsalves, Tad [1 ]
机构
[1] Sophia Univ, Fac Sci & Technol, Dept Informat & Commun Sci, Chiyoda Ku, Tokyo 1028554, Japan
[2] Royal Univ Bhutan, Gedu Coll Business Studies, Chukha 21007, Bhutan
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Training; Perturbation methods; Computational modeling; Diseases; Social networking (online); Mathematical models; Robustness; Context modeling; Writing; Predictive models; Public health surveillance; health mention classification; pre-trained language models;
D O I
10.1109/ACCESS.2024.3510388
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In Public health surveillance (PHS), accurately identifying health mentions on social media is crucial for detecting health trends and outbreaks early. Health mention classification (HMC) can identify health-related content in social media, thereby predicting the health status of users. Nevertheless, traditional approaches face challenges due to noise in keyword-based data collection, affecting the precision of HMC. To address this challenge, our research introduces an innovative method that enhances the accuracy and robustness of pre-trained language models for HMC by employing a misclassified samples replay buffer and applying controlled perturbations to data representations. This approach allows for continuous learning from errors. It improves the model's ability to distinguish subtle semantic differences, significantly outperforming existing state-of-the-art and baseline models across three HMC datasets. Our findings demonstrate the method's effectiveness in improving health mention detection and contribute to the field of explainable AI, offering insights into the decision-making process of models. This work promises to bolster the use of social media as a reliable tool for PHS, facilitating more proactive and informed public health responses.
引用
收藏
页码:190445 / 190453
页数:9
相关论文
共 30 条
  • [1] Kindig D., Stoddart G., What is population health?, Amer. J. Public Health, 93, 3, pp. 380-383, (2003)
  • [2] Amara A., Hadj Taieb M.A., Ben Aouicha M., Multilingual topic modeling for tracking COVID-19 trends based on Facebook data analysis, Int. J. Speech Technol., 51, 5, pp. 3052-3073, (2021)
  • [3] De Quincey E., Kostkova P., Early warning and outbreak detection using social networking Websites: The potential of Twitter, Proc. Int. Conf. Electron. Healthcare, pp. 21-24, (2010)
  • [4] Kim E.-K., Seok J.H., Oh J.S., Lee H.W., Kim K.H., Use of hangeul Twitter to track and predict human influenza infection, PLoS ONE, 8, 7, (2013)
  • [5] Gold M.R., Stevenson D., Fryback D.G., HALYs and QALYs and DALYs, oh my: Similarities and differences in summary measures of population health, Annu. Rev. Public Health, 23, 1, pp. 115-134, (2002)
  • [6] Biddle R., Joshi A., Liu S., Paris C., Xu G., Leveraging sentiment distributions to distinguish figurative from literal health reports on Twitter, Proc. Web Conf., pp. 1217-1227, (2020)
  • [7] Khan P.I., Siddiqui S.A., Razzak I., Dengel A., Ahmed S., Improving health mention classification of social media content using contrastive adversarial training, IEEE Access, 10, pp. 87900-87910, (2022)
  • [8] Devlin J., Chang M.-W., Lee K., Toutanova K., BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding, (2018)
  • [9] Zhuang L., Wayne L., Ya S., Jun Z., A robustly optimized BERT pretraining approach with post-training, Proc. Chin. Nat. Conf. Comput. Linguist., pp. 1218-1227, (2021)
  • [10] Naseem U., Razzak I., Khan S.K., Prasad M., A comprehensive survey on word representation models: From classical to state-of-The-Art word representation language models, ACM Trans. Asian Low-Resource Lang. Inf. Process., 20, 5, pp. 1-35, (2021)