Multimodal sentiment and emotion recognition in hyperbolic space

被引:15
作者
Arano, Keith April [1 ]
Orsenigo, Carlotta [1 ]
Soto, Mauricio [1 ]
Vercellis, Carlo [1 ]
机构
[1] Politecn Milan, Dept Management Econ & Ind Engn, I-20156 Milan, Italy
关键词
Mmultimodal machine learning; Hyperbolic learning; Emotion recognition; Sentiment analysis; Deep learning; FUSION;
D O I
10.1016/j.eswa.2021.115507
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Prior approaches for multimodal sentiment and emotion recognition (SER) exploit input data representations and neural networks based on the classical Euclidean geometry. Recently, however, the hyperbolic metric proved to be a powerful tool for data mapping, being able to capture the hierarchical structure of the relations among elements in the data. In this paper we propose the use of hyperbolic learning for SER, and show that the inclusion in the neural network of hyperbolic structures mapping the input into the hyperbolic space can improve the quality of the predictions. The benefits brought by the hyperbolic features are evaluated by developing extensions of existing methods following two approaches. From one side, we modified state-of-the-art models by including hyperbolic output layers. From the other, we generated hybrid neural network architectures by combining hyperbolic and Euclidean layers according to different schemes. The proposed hyperbolic models were tested on several classification tasks applied to benchmark multimodal SER datasets. Experiments gave strong evidence that in both simple and complex networks the introduction of a hyperbolic structure results in an improvement of the model accuracy. Specifically, the combined use of hyperbolic and Euclidean layers showed superior performance in almost all the classification tasks.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] [Anonymous], 2017, NIPS
  • [2] "Emotions are the Great Captains of Our Lives": Measuring Moods Through the Power of Physiological and Environmental Sensing
    Arano, Keith April
    Gloor, Peter
    Orsenigo, Carlotta
    Vercellis, Carlo
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2022, 13 (03) : 1378 - 1389
  • [3] Multimodal Machine Learning: A Survey and Taxonomy
    Baltrusaitis, Tadas
    Ahuja, Chaitanya
    Morency, Louis-Philippe
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (02) : 423 - 443
  • [4] Becigneul G., 2019, INT C LEARN REPR
  • [5] Bojanowski P., 2017, T ASSOC COMPUT LING, V5, P135, DOI [10.1162/tacl_a_00051, DOI 10.1162/TACLA00051]
  • [6] Stochastic Gradient Descent on Riemannian Manifolds
    Bonnabel, Silvere
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2013, 58 (09) : 2217 - 2229
  • [7] Geometric Deep Learning Going beyond Euclidean data
    Bronstein, Michael M.
    Bruna, Joan
    LeCun, Yann
    Szlam, Arthur
    Vandergheynst, Pierre
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (04) : 18 - 42
  • [8] Analyzing Connections Between User Attributes, Images, and Text
    Burdick, Laura
    Mihalcea, Rada
    Boyd, Ryan L.
    Pennebaker, James W.
    [J]. COGNITIVE COMPUTATION, 2021, 13 (02) : 241 - 260
  • [9] Degottex G, 2014, INT CONF ACOUST SPEE, DOI 10.1109/ICASSP.2014.6853739
  • [10] Devlin J, 2018, ARXIV