Enhancing Emotion Prediction in Multimedia Content Through Multi-Task Learning

被引:0
作者
Fan, Wan [1 ]
机构
[1] West Anhui Univ, Dept Culture & Commun, Luan 237000, Peoples R China
关键词
Multi task learning; multimodal emotion analysis; timing; transformer; attention;
D O I
10.14569/IJACSA.2025.01602118
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This study presents a robust multimodal emotion analysis model aimed at improving emotion prediction in film and television communication. Addressing challenges in modal fusion and data association, the model integrates a Transformer-based framework with multi-task learning to capture emotional associations and temporal features across various modalities. It overcomes the limitations of single-modal labels by incorporating multi-task learning, and is tested on the Cmumosi dataset using both classification and regression tasks. The model achieves strong performance, with an average absolute error of 0.70, a Pearson correlation coefficient of 0.82, and an accuracy of 47.1% in a seven-class task. In a two-class task, it achieves an accuracy and F1 score of 88.4%. Predictions for specific video segments are highly consistent with actual labels, with predicted scores of 2.15 and 1.4. This research offers a new approach to multimodal emotion analysis, providing valuable insights for film and television content creation and setting the foundation for further advancements in this area.
引用
收藏
页码:1198 / 1209
页数:12
相关论文
共 20 条
[1]  
Abdullah T, 2022, ACM Computing Surveys, V55, P1
[2]  
Bames J, 2021, Natural Language Engineering, V27, P249
[3]   A Multitask Multiview Neural Network for End-to-End Aspect-Based Sentiment Analysis [J].
Bie, Yong ;
Yang, Yan .
BIG DATA MINING AND ANALYTICS, 2021, 4 (03) :195-207
[4]   A multi-task learning framework for end-to-end aspect sentiment triplet extraction [J].
Chen, Fang ;
Yang, Zhongliang ;
Huang, Yongfeng .
NEUROCOMPUTING, 2022, 479 :12-21
[5]   Mitigate Gender Bias Using Negative Multi-task Learning [J].
Gao, Liyuan ;
Zhan, Huixin ;
Sheng, Victor S. .
NEURAL PROCESSING LETTERS, 2023, 55 (08) :11131-11146
[6]  
Hande A, 2022, Journal of Data, Information and Management, V4, P137
[7]   A Multitask Multimodal Framework for Sentiment and Emotion-Aided Cyberbullying Detection [J].
Maity, Krishanu ;
Kumar, Abhishek ;
Saha, Sriparna .
IEEE INTERNET COMPUTING, 2022, 26 (04) :68-78
[8]   Exploring Multi-lingual, Multi-task, and Adversarial Learning for Low-resource Sentiment Analysis [J].
Mamta ;
Ekbal, Asif ;
Bhattacharyya, Pushpak .
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (05)
[9]  
Mao R, 2021, AAAI CONF ARTIF INTE, V35, P13534
[10]  
Purohit J., 2023, Archives Adv. Eng. Sci, V1, P11, DOI [10.47852/bonviewAAES32021220, DOI 10.47852/BONVIEWAAES32021220]