Active Learning for Domain Classification in a Commercial Spoken Personal Assistant

被引:4
作者
Chen, Xi C. [1 ]
Sagar, Adithya [1 ]
Kao, Justine T. [1 ]
Li, Tony Y. [1 ]
Klein, Christopher [1 ]
Pulman, Stephen [1 ]
Garg, Ashish [1 ]
Williams, Jason D. [1 ]
机构
[1] Apple Inc, One Apple Pk Way, Cupertino, CA 95014 USA
来源
INTERSPEECH 2019 | 2019年
关键词
intelligent personal assistant; domain selection; active learning;
D O I
10.21437/Interspeech.2019-1315
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
We describe a method for selecting relevant new training data for the LSTM-based domain selection component of our personal assistant system. Adding more annotated training data for any ML system typically improves accuracy, but only if it provides examples not already adequately covered in the existing data. However, obtaining, selecting, and labeling relevant data is expensive. This work presents a simple technique that automatically identifies new helpful examples suitable for human annotation. Our experimental results show that the proposed method, compared with random-selection and entropy-based methods, leads to higher accuracy improvements given a fixed annotation budget. Although developed and tested in the setting of a commercial intelligent assistant, the technique is of wider applicability.
引用
收藏
页码:1478 / 1482
页数:5
相关论文
共 10 条
[1]  
[Anonymous], 2018, P 2018 C EMP METH NA
[2]  
Asghar Nabiha, 2017, Proceedings of the 6th Joint Conference on Lexical and Computational Semantics (* SEM 2017), P78
[3]  
CHEN W, 2017, P 2017 C EMP METH NA, P599, DOI DOI 10.18653/V1/D17-1063
[4]  
Duong L, 2018, PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, P43
[5]  
Hiraoka T., 2016, DIAL SOC ROB EN AN E, P67, DOI [10.1007/978, DOI 10.1007/978-981-10-2585-35]
[6]  
Peshterliev S., 2019, P 2019 C N AM CHAPT, V2, P90
[7]  
Settles B., 2010, Tech. Rep., V15, P201
[8]  
Settles B., 2012, Active Learning
[9]  
Shen Yanyao, 2017, P 2 WORKSHOP REPRESE, P252
[10]  
Su P., 2016, P 54 ANN M ASS COMP, V1