Comparison of ALBAYZIN query-by-example spoken term detection 2012 and 2014 evaluations

被引：8

作者：

Tejedor, Javier ^{[1
]}

Toledano, Doroteo T. ^{[2
]}

Lopez-Otero, Paula ^{[3
]}

Docio-Fernandez, Laura ^{[3
]}

Garcia-Mateo, Carmen ^{[3
]}

机构：

[1] Univ Alcala, GEINTRA, Campus Univ Ctra Madrid Barcelona,Km 33,600, Madrid, Spain

[2] Univ Autonoma Madrid, Biometr Recognit Grp ATVS, Ave Francisco Tomas & Valiente, Madrid, Spain

[3] EE Telecomunicac, AtlantTIC Res Ctr, Multimedia Technol Grp GTM, Campus Univ Vigo S-N, Vigo VIGO, Spain

来源：

EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING | 2016年

关键词：

Query-by-example spoken term detection; International evaluation; Search on spontaneous speech;

D O I：

10.1186/s13636-016-0080-2

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Query-by-example spoken term detection (QbE STD) aims at retrieving data from a speech repository given an acoustic query containing the term of interest as input. Nowadays, it is receiving much interest due to the large volume of multimedia information. This paper presents the systems submitted to the ALBAYZIN QbE STD 2014 evaluation held as a part of the ALBAYZIN 2014 Evaluation campaign within the context of the IberSPEECH 2014 conference. This is the second QbE STD evaluation in Spanish, which allows us to evaluate the progress in this technology for this language. The evaluation consists in retrieving the speech files that contain the input queries, indicating the start and end times where the input queries were found, along with a score value that reflects the confidence given to the detection of the query. Evaluation is conducted on a Spanish spontaneous speech database containing a set of talks from workshops, which amount to about 7 h of speech. We present the database, the evaluation metric, the systems submitted to the evaluation, the results, and compare this second evaluation with the first ALBAYZIN QbE STD evaluation held in 2012. Four different research groups took part in the evaluations held in 2012 and 2014. In 2014, new multi-word and foreign queries were added to the single-word and in-language queries used in 2012. Systems submitted to the second evaluation are hybrid systems which integrate letter transcription- and template matching-based systems. Despite the significant improvement obtained by the systems submitted to this second evaluation compared to those of the first evaluation, results still show the difficulty of this task and indicate that there is still room for improvement.

引用

页码：1 / 19

页数：19

共 50 条

[1] Comparison of ALBAYZIN query-by-example spoken term detection 2012 and 2014 evaluations
Javier Tejedor
Doroteo T. Toledano
Paula Lopez-Otero
Laura Docio-Fernandez
Carmen Garcia-Mateo
EURASIP Journal on Audio, Speech, and Music Processing, 2016
[2] ALBAYZIN Query-by-example Spoken Term Detection 2016 evaluation
Javier Tejedor
Doroteo T. Toledano
Paula Lopez-Otero
Laura Docio-Fernandez
Jorge Proença
Fernando Perdigão
Fernando García-Granada
Emilio Sanchis
Anna Pompili
Alberto Abad
EURASIP Journal on Audio, Speech, and Music Processing, 2018
[3] ALBAYZIN Query-by-example Spoken Term Detection 2016 evaluation
Tejedor, Javier
Toledano, Doroteo T.
Lopez-Otero, Paula
Docio-Fernandez, Laura
Proenca, Jorge
Perdigao, Fernando
Garcia-Granada, Fernando
Sanchis, Emilio
Pompili, Anna
Abad, Alberto
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2018,
[4] Query-by-Example Spoken Term Detection ALBAYZIN 2012 evaluation: overview, systems, results, and discussion
Tejedor, Javier
Toledano, Doroteo T.
Anguera, Xavier
Varona, Amparo
Hurtado, Lluis F.
Miguel, Antonio
Colas, Jose
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2013,
[5] Query-by-Example Spoken Term Detection ALBAYZIN 2012 evaluation: overview, systems, results, and discussion
Javier Tejedor
Doroteo T Toledano
Xavier Anguera
Amparo Varona
Lluís F Hurtado
Antonio Miguel
José Colás
EURASIP Journal on Audio, Speech, and Music Processing, 2013
[6] A Comparison of Query-by-Example Methods for Spoken Term Detection
Shen, Wade
White, Christopher M.
Hazen, Timothy J.
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2107 - 2110
[7] Query-by-example spoken term detection based on phonetic posteriorgram Query-by-example spoken term detection based on phonetic posteriorgram
Song, Beili
Zhang, Wei-Qiang
Cai, Meng
Liu, Jia
Johnson, Michael T.
PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON EDUCATION, MANAGEMENT AND COMPUTING TECHNOLOGY, 2015, 30 : 1255 - 1260
[8] Query-by-Example Spoken Term Detection For OOV Terms
Parada, Carolina
Sethy, Abhinav
Ramabhadran, Bhuvana
2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 404 - +
[9] Search on speech from spoken queries: the Multi-domain International ALBAYZIN 2018 Query-by-Example Spoken Term Detection Evaluation
Javier Tejedor
Doroteo T. Toledano
Paula Lopez-Otero
Laura Docio-Fernandez
Mikel Peñagarikano
Luis Javier Rodriguez-Fuentes
Antonio Moreno-Sandoval
EURASIP Journal on Audio, Speech, and Music Processing, 2019
[10] Search on speech from spoken queries: the Multi-domain International ALBAYZIN 2018 Query-by-Example Spoken Term Detection Evaluation
Tejedor, Javier
Toledano, Doroteo T.
Lopez-Otero, Paula
Docio-Fernandez, Laura
Penagarikano, Mikel
Javier Rodriguez-Fuentes, Luis
Moreno-Sandoval, Antonio
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2019, 2019 (1)

← 1 2 3 4 5 →