Multilingual Query by Example Spoken Term Detection for Under-Resourced Languages

被引：0

作者：

Buzo, Andi ^{[1
]}

Cucu, Horia ^{[1
]}

Safta, Mihai ^{[1
]}

Burileanu, Corneliu ^{[1
]}

机构：

[1] Univ Politehn Bucuresti, Speech & Dialogue SpeeD Res Lab, Bucharest, Romania

来源：

2013 7TH CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN - COMPUTER DIALOGUE (SPED) | 2013年

关键词：

spoken term detection; multilingual acoustic model; under-resourced languages;

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

We propose a query-by-example approach to multilingual Spoken Term Detection for under-resourced languages based on Automatic Speech Recognition. The approach overcomes the main difficulties met under these conditions, i.e., providing a new method for building multilingual acoustic models with few annotated data and searching in approximate Automatic Speech Recognition transcriptions providing high scalability. The acoustic models are obtained by adapting well-trained phonemes to the ones from the envisaged languages. The mapping is made according to International Phonetic Alphabet phoneme classification and a confusion matrix. The weighting of query length and alignment spread are incorporated in the Dynamic Time Warping technique to improve the searching method. Experimental validation was conducted on a standard data set consisting of 3 hours of mixed African languages. The recorded speech has telephonic quality and it is a mix of read and spontaneous speech.

引用

页数：6

共 22 条

[1] Anguera X., 2012, P MEDIAEVAL 2012 WOR
[2] Byrne W, 2000, INT CONF ACOUST SPEE, P1029
[3] Cucu H., 2011, P 2011 IEEE AUT SPEE, P260
[4] Query-By-Example Spoken Term Detection Using Phonetic Posteriorgram Templates
Hazen, Timothy J.
Shen, Wade
White, Christopher
[J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 421 - +
[5] Imseng D, 2012, INT CONF ACOUST SPEE, P4869, DOI 10.1109/ICASSP.2012.6289010
[6] Jansen A., 2012, P MEDIAEVAL 2012 WOR
[7] Joder C., 2012, P MEDIAEVAL 2012 WOR
[8] Le VB, 2005, INT CONF ACOUST SPEE, P821
[9] Spoken Term Detection from Bilingual Spontaneous Speech Using Code-switched Lattice-based Structures for Words and Subword Units
Lee, Hung-Yi
Tang, Yueh-Lien
Tang, Hao
Lee, Lin-Shan
[J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 410 - +
[10] A STUDY ON MULTILINGUAL ACOUSTIC MODELING FOR LARGE VOCABULARY ASR
Lin, Hui
Deng, Li
Yu, Dong
Gong, Yi-fan
Acero, Alex
Lee, Chin-Hui
[J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4333 - +

← 1 2 3 →