Looking for Confirmations: An Effective and Human-Like Visual Dialogue Strategy

被引:0
作者
Testoni, Alberto [1 ]
Bernardi, Raffaella [2 ,3 ]
机构
[1] Univ Trento, DISI, Trento, Italy
[2] CIMeC, Rovereto, TN, Italy
[3] Univ Trento, DISI, Rovereto, TN, Italy
来源
2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021) | 2021年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generating goal-oriented questions in Visual Dialogue tasks is a challenging and long-standing problem. State-Of-The-Art systems are shown to generate questions that, although grammatically correct, often lack an effective strategy and sound unnatural to humans. Inspired by the cognitive literature on information search and cross-situational word learning, we design Confirm-it, a model based on a beam search re-ranking algorithm that guides an effective goal-oriented strategy by asking questions that confirm the model's conjecture about the referent. We take the Guess What?! game as a case-study. We show that dialogues generated by Confirm-it are more natural and effective than beam search decoding without re-ranking.
引用
收藏
页码:9330 / 9338
页数:9
相关论文
共 26 条
[1]  
[Anonymous], 2018, P 11 INT C NAT LANG, DOI DOI 10.1109/VLSID.2018.111
[2]  
Asia-Pacific Association for Machine Translation (AAMT), 2017, AS PAC ASS MACH TRAN
[3]  
Baron J., 2000, THINKING DECIDING, V3rd ed.
[4]  
Borgeaud S, 2020, NEURAL GENERATION AND TRANSLATION, P97
[5]   Use of non-recycled plastics and paper as alternative fuel in cement production [J].
Bourtsalas, A. C. ;
Zhang, Jiao ;
Castaldi, M. J. ;
Themelis, N. J. .
JOURNAL OF CLEANER PRODUCTION, 2018, 181 :8-16
[6]   GuessWhat?! Visual object discovery through multi-modal dialogue [J].
de Vries, Harm ;
Strub, Florian ;
Chandar, Sarath ;
Pietquin, Olivier ;
Larochelle, Hugo ;
Courville, Aaron .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4466-4475
[7]  
Dusek O, 2016, PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2016), VOL 2, P45
[8]  
Gatt A., 2013, Are we Bayesian referring expression generators
[9]  
Hargreaves J, 2021, 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), P2563
[10]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778