A Combined Rule-Based & Machine Learning Audio-Visual Emotion Recognition Approach

被引：52

作者：

Seng, Kah Phooi ^{[1
]}

Ang, Li-Minn ^{[1
]}

Ooi, Chien Shing ^{[2
]}

机构：

[1] Charles Sturt Univ, Sch Comp & Math, Bathurst, NSW 2678, Australia

[2] Sunway Univ, Dept Comp Sci & Networked Syst, Subang Jaya 47500, Malaysia

来源：

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING | 2018年 / 9卷 / 01期

关键词：

Emotion recognition; audio-visual processing; rule-based; machine learning; multimodal system; LINEAR DISCRIMINANT-ANALYSIS; EFFICIENT APPROACH; FACE; FRAMEWORK; FUSION; AUDIO; LDA;

D O I：

10.1109/TAFFC.2016.2588488

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes an audio-visual emotion recognition system that uses a mixture of rule-based and machine learning techniques to improve the recognition efficacy in the audio and video paths. The visual path is designed using the Bi-directional Principal Component Analysis (BDPCA) and Least-Square Linear Discriminant Analysis (LSLDA) for dimensionality reduction and discrimination. The extracted visual features are passed into a newly designed Optimized Kernel-Laplacian Radial Basis Function (OKL-RBF) neural classifier. The audio path is designed using a combination of input prosodic features (pitch, log-energy, zero crossing rates and Teager energy operator) and spectral features (Mel-scale frequency cepstral coefficients). The extracted audio features are passed into an audio feature level fusion module that uses a set of rules to determine the most likely emotion contained in the audio signal. An audio visual fusion module fuses outputs from both paths. The performances of the proposed audio path, visual path, and the final system are evaluated on standard databases. Experiment results and comparisons reveal the good performance of the proposed system.

引用

页码：3 / 13

页数：11

共 50 条

[31] Joint modelling of audio-visual cues using attention mechanisms for emotion recognition
Esam Ghaleb
Jan Niehues
Stylianos Asteriadis
Multimedia Tools and Applications, 2023, 82 : 11239 - 11264
[32] Joint modelling of audio-visual cues using attention mechanisms for emotion recognition
Ghaleb, Esam
Niehues, Jan
Asteriadis, Stylianos
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (08) : 11239 - 11264
[33] EXTRACTING AUDIO-VISUAL FEATURES FOR EMOTION RECOGNITION THROUGH ACTIVE FEATURE SELECTION
Haider, Fasih
Pollak, Senja
Albert, Pierre
Luz, Saturnino
2019 7TH IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (IEEE GLOBALSIP), 2019,
[34] Audio-Visual Emotion Recognition System Using Multi-Modal Features
Handa, Anand
Agarwal, Rashi
Kohli, Narendra
INTERNATIONAL JOURNAL OF COGNITIVE INFORMATICS AND NATURAL INTELLIGENCE, 2021, 15 (04)
[35] SEWA DB: A Rich Database for Audio-Visual Emotion and Sentiment Research in the Wild
Kossaifi, Jean
Walecki, Robert
Panagakis, Yannis
Shen, Jie
Schmitt, Maximilian
Ringeval, Fabien
Han, Jing
Pandit, Vedhas
Toisoul, Antoine
Schuller, Bjorn
Star, Kam
Hajiyev, Elnar
Pantic, Maja
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (03) : 1022 - 1040
[36] Feature and Decision Level Audio-visual Data Fusion in Emotion Recognition Problem
Sidorov, Maxim
Sopov, Evgenii
Ivanov, Ilia
Minker, Wolfgang
ICIMCO 2015 PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL. 2, 2015, : 246 - 251
[37] Data Augmentation for Audio-Visual Emotion Recognition with an Efficient Multimodal Conditional GAN
Ma, Fei
Li, Yang
Ni, Shiguang
Huang, Shao-Lun
Zhang, Lin
APPLIED SCIENCES-BASEL, 2022, 12 (01):
[38] Audio-visual affective expression recognition
Huang, Thomas S.
Zeng, Zhihong
MIPPR 2007: PATTERN RECOGNITION AND COMPUTER VISION, 2007, 6788
[39] Facial Emotion Recognition for Photo and Video Surveillance Based on Machine Learning and Visual Analytics
Kalyta, Oleg
Barmak, Olexander
Radiuk, Pavlo
Krak, Iurii
APPLIED SCIENCES-BASEL, 2023, 13 (17):
[40] Multi-Corpus Learning for Audio-Visual Emotions and Sentiment Recognition
Ryumina, Elena
Markitantov, Maxim
Karpov, Alexey
MATHEMATICS, 2023, 11 (16)

← 1 2 3 4 5 →