Construction of Japanese Audio-Visual Emotion Database and Its Application in Emotion Recognition

被引:0
作者
Lubis, Nurul [1 ]
Gomez, Randy [2 ]
Sakti, Sakriani [1 ]
Nakamura, Keisuke [2 ]
Yoshino, Koichiro [1 ]
Nakamura, Satoshi [1 ]
Nakadai, Kazuhiro [2 ]
机构
[1] Nara Inst Sci & Technol, Ikoma, Nara, Japan
[2] Honda Res Inst Japan Co Ltd, Wako, Saitama, Japan
来源
LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2016年
关键词
Japanese; emotion; corpus; audio-visual; multimodal;
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
Emotional aspects play a vital role in making human communication a rich and dynamic experience. As we introduce more automated system in our daily lives, it becomes increasingly important to incorporate emotion to provide as natural an interaction as possible. To achieve said incorporation, rich sets of labeled emotional data is prerequisite. However, in Japanese, existing emotion database is still limited to unimodal and bimodal corpora. Since emotion is not only expressed through speech, but also visually at the same time, it is essential to include multiple modalities in an observation. In this paper, we present the first audio-visual emotion corpora in Japanese, collected from 14 native speakers. The corpus contains 100 minutes of annotated and transcribed material. We performed preliminary emotion recognition experiments on the corpus and achieved an accuracy of 61.42% for five classes of emotion.
引用
收藏
页码:2180 / 2184
页数:5
相关论文
共 20 条
[1]  
[Anonymous], P IWSDS
[2]  
[Anonymous], 1996, MEDIA EQUATION PEOPL
[3]  
[Anonymous], 2014, EIR C COMP VIS SPRIN
[4]  
Arimoto Y, 2008, EMOTION RECOGNITION
[5]  
Bänziger T, 2007, LECT NOTES COMPUT SC, V4738, P476
[6]  
Bulut M., 2002, INTERSPEECH
[7]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[8]  
Douglas-Cowie Ellen, 2007, HUMAINE DATABASE ADD
[9]   Survey on speech emotion recognition: Features, classification schemes, and databases [J].
El Ayadi, Moataz ;
Kamel, Mohamed S. ;
Karray, Fakhri .
PATTERN RECOGNITION, 2011, 44 (03) :572-587
[10]  
Eyben F., 2010, P 18 ACM INT C MULT, P1459