NMF-BASED KEYWORD LEARNING FROM SCARCE DATA

被引:0
作者
Ons, Bart [1 ]
Gemmeke, Jort F. [1 ]
Van Hamme, Hugo [1 ]
机构
[1] KULeuven, Dept ESAT PSI, Louvain, Belgium
来源
2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU) | 2013年
关键词
weakly supervised non-negative matrix factorization; vocabulary acquisition; vocal user interface; data scarcity;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This research is situated in a project aimed at the development of a vocal user interface (VUI) that learns to understand its users specifically persons with a speech impairment. The vocal interface adapts to the speech of the user by learning the vocabulary from interaction examples. Word learning is implemented through weakly supervised non-negative matrix factorization (NMF). The goal of this study is to investigate how we can improve word learning when the number of interaction examples is low. We demonstrate two approaches to train NMF models on scarce data: 1) training word models using smoothed training data, and 2) training word models that strictly correspond to the grounding information derived from a few interaction examples. We found that both approaches can substantially improve word learning from scarce training data.
引用
收藏
页码:392 / 397
页数:6
相关论文
共 13 条
[1]  
[Anonymous], P ICASSP
[2]   ACORNS - towards computational modeling of communication and recognition skills [J].
Boves, Lou ;
ten Bosch, Louis ;
Moore, Roger .
PROCEEDINGS OF THE SIXTH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS, 2007, :349-+
[3]   Incremental word learning: Efficient HMM initialization and large margin discriminative adaptation [J].
Clemente, Irene Ayllon ;
Heckmann, Martin ;
Wrede, Britta .
SPEECH COMMUNICATION, 2012, 54 (09) :1029-1048
[4]   Confidence intervals in within-subject designs: A simpler solution to Loftus and Masson's method [J].
Cousineau, Denis .
TUTORIALS IN QUANTITATIVE METHODS FOR PSYCHOLOGY, 2005, 1 (01) :42-45
[5]  
DEMUYNCK K, 2001, THESIS K U LEUVEN
[6]  
Driesen J., 2012, THESIS K U LEUVEN
[7]  
Driesen J, 2012, INT CONF ACOUST SPEE, P5145, DOI 10.1109/ICASSP.2012.6287950
[8]  
Gemmeke J. F., 2012, P INTERSPEECH, P1
[9]  
Lee DD, 2001, ADV NEUR IN, V13, P556
[10]  
Sun M., 2011, ICML 11 S MACH LEARN