Automatic Detection of Irony Based on Acoustic Features and Facial Expressions

被引:0
作者
Kochetkova, Uliana [1 ]
Slcreline, Pavel [1 ]
Evdokimova, Vera [1 ]
Borisov, Nikolai [1 ]
Scherbakov, Pavel [1 ]
Fedkin, Petr [1 ]
German, Rada [1 ]
机构
[1] St Petersburg State Univ, 7-9 Univ Skaya Embankment, St Petersburg, Russia
来源
SPEECH AND COMPUTER, SPECOM 2024, PT II | 2025年 / 15300卷
关键词
Irony; Multimedia Speech Corpus; Artificial Neural Networks; Acoustic Feature Extraction; Facial Expression Analysis; FACE;
D O I
10.1007/978-3-031-78014-1_6
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The current study deals with the automatic analysis of verbal irony using artificial neural networks. Detection of verbal irony is an important task nowadays, because the effectiveness of the communication depends on the correct interpretation of sentences with an ambiguous meaning. In the case, when the context is lacking, the correct sense can be understood not from the lexical content, but through phonetic features, as well as through co-speech mimics and gestures. Thus we accomplished a new research on the material of the multimedia corpus of Russian ironic speech, which contains the detailed phonetic annotation and irony evaluation by native listeners in perceptual auditory experiments. Two types of automated analysis were accomplished: based on acoustic feature and facial expression extraction. The use of the fully connected neural network and of the Wav2Vec 2.0 model for the automatic irony detection in audio signal demonstrated high level of irony recognition. We also tested on a part of the corpus the recognition of ironic facial expressions in video signal using convolutional neural network and the PyFeat library, which allowed us to conclude that this model can give good results when we increase the amount of the material.
引用
收藏
页码:70 / 82
页数:13
相关论文
共 50 条
[41]   Facial Expressions Recognition Based on Delaunay Triangulation of Landmark and Machine Learning [J].
Ayeche, Farid ;
Alti, Adel .
TRAITEMENT DU SIGNAL, 2021, 38 (06) :1575-1586
[42]   Automatic Detection of Acoustic Signals of Beluga Whales and Bottlenose Dolphins [J].
A. A. Tyshko ;
M. A. Krinitskiy ;
A. V. Shatravin ;
R. A. Belikov .
Moscow University Physics Bulletin, 2023, 78 :S217-S225
[43]   Hybrid Metaheuristics with Deep Learning Enabled Automated Deception Detection and Classification of Facial Expressions [J].
Alaskar, Haya .
CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (03) :5433-5449
[44]   Automatic Detection of Acoustic Signals of Beluga Whales and Bottlenose Dolphins [J].
Tyshko, A. A. ;
Krinitskiy, M. A. ;
Shatravin, A. V. ;
Belikov, R. A. .
MOSCOW UNIVERSITY PHYSICS BULLETIN, 2023, 78 (SUPPL 1) :S217-S225
[45]   Improving Parkinson Detection using Dynamic Features from Evoked Expressions in Video [J].
Gomez, Luis F. ;
Morales, Aythami ;
Orozco-Arroyave, Juan R. ;
Daza, Roberto ;
Fierrez, Julian .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, :1562-1570
[46]   Artificial Neural Network Based Ensemble Approach for Multicultural Facial Expressions Analysis [J].
Ali, Ghulam ;
Ali, Amjad ;
Ali, Farman ;
Draz, Umar ;
Majeed, Fiaz ;
Yasin, Sana ;
Ali, Tariq ;
Haider, Noman .
IEEE ACCESS, 2020, 8 :134950-134963
[47]   Automatic Detection of Acromegaly From Facial Photographs Using Machine Learning Methods [J].
Kong, Xiangyi ;
Gong, Shun ;
Su, Lijuan ;
Howard, Newton ;
Kong, Yanguo .
EBIOMEDICINE, 2018, 27 :94-102
[48]   Fully Automatic 3D Facial Expression Recognition using Local Depth Features [J].
Xue, Mingliang ;
Mian, Ajmal ;
Liu, Wanquan ;
Li, Ling .
2014 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2014, :1096-1103
[49]   Ensemble-based domain adaptation on social media posts for irony detection [J].
Saroj, Anita ;
Pal, Sukomal .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (8) :23249-23268
[50]   Ensemble-based domain adaptation on social media posts for irony detection [J].
Anita Saroj ;
Sukomal Pal .
Multimedia Tools and Applications, 2024, 83 :23249-23268