Multimodal Sentiment Analysis on Video Streams using Lightweight Deep Neural Networks

被引:4
|
作者
Yakaew, Atitaya [1 ]
Dailey, Matthew N. [1 ]
Racharak, Teeradaj [2 ]
机构
[1] Asian Inst Technol, Dept Informat & Commun Technol, Klongluang, Pathitimhani, Thailand
[2] Japan Adv Inst Sci & Technol, Sch Informat Sci, Nomi, Ishikawa, Japan
关键词
Deep Learning for Multimodal Real-Time Analysis; Emotion Recognition; Video Processing and Analysis; Lightweight Deep Convolutional Neural Networks; Sentiment Classification; EMOTION RECOGNITION;
D O I
10.5220/0010304404420451
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Real-time sentiment analysis on video streams involves classifying a subject's emotional expressions over time based on visual and/or audio information in the data stream. Sentiment can be analyzed using various modalities such as speech, mouth motion, and facial expression. This paper proposes a deep learning approach based on multiple modalities in which extracted features of an audiovisual data stream are fused in real time for sentiment classification. The proposed system comprises four small deep neural network models that analyze visual features and audio features concurrently. We fuse the visual and audio sentiment features into a single stream and accumulate evidence over time using an exponentially-weighted moving average to make a final prediction. Our work provides a promising solution to the problem of building real-time sentiment analysis systems that have constrained software or hardware capabilities. Experiments on the Ryerson audiovideo database of emotional speech (RAVDESS) show that deep audiovisual feature fusion yields substantial improvements over analysis of either single modality. We obtain an accuracy of 90.74%, which is better than baselines of 11.11% - 31.48% on a challenging test dataset.
引用
收藏
页码:442 / 451
页数:10
相关论文
共 50 条
  • [31] Sentiment Analysis in Social Networks Using Convolutional Neural Networks
    Elfaik, Hanane
    Nfaoui, El Habib
    ADVANCED INTELLIGENT SYSTEMS FOR SUSTAINABLE DEVELOPMENT (AI2SD'2020), VOL 2, 2022, 1418 : 263 - 276
  • [32] Multimodal Deep Learning Approach for Real-Time Sentiment Analysis in Video Streaming
    Tejashwini, S. G.
    Aradhana, D.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (08) : 730 - 736
  • [33] Ensemble transfer learning-based multimodal sentiment analysis using weighted convolutional neural networks
    Ghorbanali, Alireza
    Sohrabi, Mohammad Karim
    Yaghmaee, Farzin
    INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (03)
  • [34] Multimodal sentiment analysis based on cross-instance graph neural networks
    Hongbin Wang
    Chun Ren
    Zhengtao Yu
    Applied Intelligence, 2024, 54 : 3403 - 3416
  • [35] Sentiment Analysis Using SVM and Deep Neural Network
    Dubey, Punit
    Mishra, Anshul
    Saha, Bijoy Krishna
    PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 952 - 957
  • [36] Multimodal sentiment analysis based on cross-instance graph neural networks
    Wang, Hongbin
    Ren, Chun
    Yu, Zhengtao
    APPLIED INTELLIGENCE, 2024, 54 (04) : 3403 - 3416
  • [37] Twitter Sentiment Analysis using Deep Neural Network
    Wazery, Yaser Maher
    Mohammed, Hager Saleh
    Houssein, Essam Halim
    2018 14TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO), 2018, : 177 - 182
  • [38] A cognitive brain model for multimodal sentiment analysis based on attention neural networks
    Li, Yuanqing
    Zhang, Ke
    Wang, Jingyu
    Gao, Xinbo
    NEUROCOMPUTING, 2021, 430 : 159 - 173
  • [39] DelBERTo: A Deep Lightweight Transformer for Sentiment Analysis
    Molinaro, Luca
    Tatano, Rosalia
    Busto, Enrico
    Fiandrotti, Attilio
    Basile, Valerio
    Patti, Viviana
    AIXIA 2022 - ADVANCES IN ARTIFICIAL INTELLIGENCE, 2023, 13796 : 443 - 456
  • [40] VIDEO ERROR CONCEALMENT USING DEEP NEURAL NETWORKS
    Sankisa, Arun
    Punjabi, Arjun
    Katsaggelos, Aggelos K.
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 380 - 384