Multilingual Query by Example Spoken Term Detection for Under-Resourced Languages

被引:0
作者
Buzo, Andi [1 ]
Cucu, Horia [1 ]
Safta, Mihai [1 ]
Burileanu, Corneliu [1 ]
机构
[1] Univ Politehn Bucuresti, Speech & Dialogue SpeeD Res Lab, Bucharest, Romania
来源
2013 7TH CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN - COMPUTER DIALOGUE (SPED) | 2013年
关键词
spoken term detection; multilingual acoustic model; under-resourced languages;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a query-by-example approach to multilingual Spoken Term Detection for under-resourced languages based on Automatic Speech Recognition. The approach overcomes the main difficulties met under these conditions, i.e., providing a new method for building multilingual acoustic models with few annotated data and searching in approximate Automatic Speech Recognition transcriptions providing high scalability. The acoustic models are obtained by adapting well-trained phonemes to the ones from the envisaged languages. The mapping is made according to International Phonetic Alphabet phoneme classification and a confusion matrix. The weighting of query length and alignment spread are incorporated in the Dynamic Time Warping technique to improve the searching method. Experimental validation was conducted on a standard data set consisting of 3 hours of mixed African languages. The recorded speech has telephonic quality and it is a mix of read and spontaneous speech.
引用
收藏
页数:6
相关论文
共 22 条
  • [1] Anguera X., 2012, P MEDIAEVAL 2012 WOR
  • [2] Byrne W, 2000, INT CONF ACOUST SPEE, P1029
  • [3] Cucu H., 2011, P 2011 IEEE AUT SPEE, P260
  • [4] Query-By-Example Spoken Term Detection Using Phonetic Posteriorgram Templates
    Hazen, Timothy J.
    Shen, Wade
    White, Christopher
    [J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 421 - +
  • [5] Imseng D, 2012, INT CONF ACOUST SPEE, P4869, DOI 10.1109/ICASSP.2012.6289010
  • [6] Jansen A., 2012, P MEDIAEVAL 2012 WOR
  • [7] Joder C., 2012, P MEDIAEVAL 2012 WOR
  • [8] Le VB, 2005, INT CONF ACOUST SPEE, P821
  • [9] Spoken Term Detection from Bilingual Spontaneous Speech Using Code-switched Lattice-based Structures for Words and Subword Units
    Lee, Hung-Yi
    Tang, Yueh-Lien
    Tang, Hao
    Lee, Lin-Shan
    [J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 410 - +
  • [10] A STUDY ON MULTILINGUAL ACOUSTIC MODELING FOR LARGE VOCABULARY ASR
    Lin, Hui
    Deng, Li
    Yu, Dong
    Gong, Yi-fan
    Acero, Alex
    Lee, Chin-Hui
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4333 - +