Progressive Neural Networks for Transfer Learning in Emotion Recognition

被引:55
作者
Gideon, John [1 ]
Khorram, Soheil [1 ]
Aldeneh, Zakaria [1 ]
Dimitriadis, Dimitrios [2 ]
Provost, Emily Mower [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
[2] IBM TJ Watson Res Ctr, Ossining, NY USA
来源
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION | 2017年
关键词
neural networks; transfer learning; progressive neural networks; computational paralinguistics; emotion recognition;
D O I
10.21437/Interspeech.2017-1637
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many paralinguistic tasks are closely related and thus representations learned in one domain can be leveraged for another. In this paper, we investigate how knowledge can be transferred between three paralinguistic tasks: speaker, emotion, and gender recognition. Further, we extend this problem to cross-dataset tasks. asking how knowledge captured in one emotion dataset can be transferred to another. We focus on progressive neural networks and compare these networks to the conventional deep learning method of pre-training and fine-tuning. Progressive neural networks provide a way to transfer knowledge and avoid the forgetting effect present when pre-training neural networks on different tasks. Our experiments demonstrate that: (I) emotion recognition can benefit from using representations originally learned for different paralinguistic tasks and (2) transfer learning can effectively leverage additional datasets to improve the performance of emotion recognition systems.
引用
收藏
页码:1098 / 1102
页数:5
相关论文
共 21 条
[1]  
[Anonymous], INTERSPEECH
[2]  
[Anonymous], IEEE T AFFECTIVE COM
[3]  
[Anonymous], 2016, ARXIV161004286
[4]  
[Anonymous], P INTERSPEECH
[5]  
[Anonymous], 2006, P LANG RES EV C
[6]  
[Anonymous], 2016, P 2016 C EMP METH NA
[7]  
[Anonymous], 2016, NEURAL INFORM PROCES
[8]  
Bouckaert RR, 2004, LECT NOTES ARTIF INT, V3056, P3
[9]   IEMOCAP: interactive emotional dyadic motion capture database [J].
Busso, Carlos ;
Bulut, Murtaza ;
Lee, Chi-Chun ;
Kazemzadeh, Abe ;
Mower, Emily ;
Kim, Samuel ;
Chang, Jeannette N. ;
Lee, Sungbok ;
Narayanan, Shrikanth S. .
LANGUAGE RESOURCES AND EVALUATION, 2008, 42 (04) :335-359
[10]  
Busso Carlos, 2016, IEEE T AFFECTIVE COM