MULTIMODAL DEEP NEURAL NETS FOR DETECTING HUMOR IN TV SITCOMS

被引:3
作者
Bertero, Dario [1 ]
Fung, Pascale [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Human Language Technol Ctr, Clear Water Bay, Hong Kong, Peoples R China
来源
2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016) | 2016年
关键词
deep learning; humor response; dialog; empathetic computing; TV sitcoms;
D O I
10.1109/SLT.2016.7846293
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel approach of combining acoustic and language features to predict humor in dialogues with a deep neural network. We analyze data from three popular TV-sitcoms whose canned laughters give an indication of when the audience would react. We model the setup-punchline sequential relation of conversational humor with a Long Short-Term Memory network, with utterance encodings obtained from two Convolutional Neural Networks, one to model word-level language features and the other to model frame-level acoustic and prosodic features. Our neural network framework is able to improve the F-score of over 5% over a Conditional Random Field baseline trained on a similar acoustic and language feature combination, achieving a much higher recall. It is also more effective over a language features-only setting, with a F-score of 10% higher. It also has a good generalization performance, reaching in most cases precision values of over 70% when trained and tested over different sitcoms.
引用
收藏
页码:383 / 390
页数:8
相关论文
共 36 条
[1]  
[Anonymous], 2010, P PYTH SCI C
[2]  
[Anonymous], 2013, P EMNLP
[3]  
[Anonymous], HUMOR LAUGHTER THEOR
[4]  
[Anonymous], 2013, P 6 INT JOINT C NATU
[5]   The semantic foundations of cognitive theories of humor [J].
Attardo, S .
HUMOR-INTERNATIONAL JOURNAL OF HUMOR RESEARCH, 1997, 10 (04) :395-420
[6]  
Attardo Salvatore., 1994, Linguistic Theories of Humor, V1
[7]  
Bamman David., 2015, The Ninth International AAAI Conference on Web and Social Media, DOI DOI 10.1609/ICWSM.V9I1.14655
[8]  
Barbieri F., 2014, Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, {EACL}, P56
[9]  
Bertero D., 2016, AC SPEECH SIGN PROC
[10]  
Bertero Dario, 2016, P NAACL