Comparison of ALBAYZIN query-by-example spoken term detection 2012 and 2014 evaluations

被引:8
|
作者
Tejedor, Javier [1 ]
Toledano, Doroteo T. [2 ]
Lopez-Otero, Paula [3 ]
Docio-Fernandez, Laura [3 ]
Garcia-Mateo, Carmen [3 ]
机构
[1] Univ Alcala, GEINTRA, Campus Univ Ctra Madrid Barcelona,Km 33,600, Madrid, Spain
[2] Univ Autonoma Madrid, Biometr Recognit Grp ATVS, Ave Francisco Tomas & Valiente, Madrid, Spain
[3] EE Telecomunicac, AtlantTIC Res Ctr, Multimedia Technol Grp GTM, Campus Univ Vigo S-N, Vigo VIGO, Spain
来源
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING | 2016年
关键词
Query-by-example spoken term detection; International evaluation; Search on spontaneous speech;
D O I
10.1186/s13636-016-0080-2
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Query-by-example spoken term detection (QbE STD) aims at retrieving data from a speech repository given an acoustic query containing the term of interest as input. Nowadays, it is receiving much interest due to the large volume of multimedia information. This paper presents the systems submitted to the ALBAYZIN QbE STD 2014 evaluation held as a part of the ALBAYZIN 2014 Evaluation campaign within the context of the IberSPEECH 2014 conference. This is the second QbE STD evaluation in Spanish, which allows us to evaluate the progress in this technology for this language. The evaluation consists in retrieving the speech files that contain the input queries, indicating the start and end times where the input queries were found, along with a score value that reflects the confidence given to the detection of the query. Evaluation is conducted on a Spanish spontaneous speech database containing a set of talks from workshops, which amount to about 7 h of speech. We present the database, the evaluation metric, the systems submitted to the evaluation, the results, and compare this second evaluation with the first ALBAYZIN QbE STD evaluation held in 2012. Four different research groups took part in the evaluations held in 2012 and 2014. In 2014, new multi-word and foreign queries were added to the single-word and in-language queries used in 2012. Systems submitted to the second evaluation are hybrid systems which integrate letter transcription- and template matching-based systems. Despite the significant improvement obtained by the systems submitted to this second evaluation compared to those of the first evaluation, results still show the difficulty of this task and indicate that there is still room for improvement.
引用
收藏
页码:1 / 19
页数:19
相关论文
共 50 条
  • [1] Comparison of ALBAYZIN query-by-example spoken term detection 2012 and 2014 evaluations
    Javier Tejedor
    Doroteo T. Toledano
    Paula Lopez-Otero
    Laura Docio-Fernandez
    Carmen Garcia-Mateo
    EURASIP Journal on Audio, Speech, and Music Processing, 2016
  • [2] ALBAYZIN Query-by-example Spoken Term Detection 2016 evaluation
    Javier Tejedor
    Doroteo T. Toledano
    Paula Lopez-Otero
    Laura Docio-Fernandez
    Jorge Proença
    Fernando Perdigão
    Fernando García-Granada
    Emilio Sanchis
    Anna Pompili
    Alberto Abad
    EURASIP Journal on Audio, Speech, and Music Processing, 2018
  • [3] ALBAYZIN Query-by-example Spoken Term Detection 2016 evaluation
    Tejedor, Javier
    Toledano, Doroteo T.
    Lopez-Otero, Paula
    Docio-Fernandez, Laura
    Proenca, Jorge
    Perdigao, Fernando
    Garcia-Granada, Fernando
    Sanchis, Emilio
    Pompili, Anna
    Abad, Alberto
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2018,
  • [4] Query-by-Example Spoken Term Detection ALBAYZIN 2012 evaluation: overview, systems, results, and discussion
    Tejedor, Javier
    Toledano, Doroteo T.
    Anguera, Xavier
    Varona, Amparo
    Hurtado, Lluis F.
    Miguel, Antonio
    Colas, Jose
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2013,
  • [5] Query-by-Example Spoken Term Detection ALBAYZIN 2012 evaluation: overview, systems, results, and discussion
    Javier Tejedor
    Doroteo T Toledano
    Xavier Anguera
    Amparo Varona
    Lluís F Hurtado
    Antonio Miguel
    José Colás
    EURASIP Journal on Audio, Speech, and Music Processing, 2013
  • [6] A Comparison of Query-by-Example Methods for Spoken Term Detection
    Shen, Wade
    White, Christopher M.
    Hazen, Timothy J.
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2107 - 2110
  • [7] Query-by-example spoken term detection based on phonetic posteriorgram Query-by-example spoken term detection based on phonetic posteriorgram
    Song, Beili
    Zhang, Wei-Qiang
    Cai, Meng
    Liu, Jia
    Johnson, Michael T.
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON EDUCATION, MANAGEMENT AND COMPUTING TECHNOLOGY, 2015, 30 : 1255 - 1260
  • [8] Query-by-Example Spoken Term Detection For OOV Terms
    Parada, Carolina
    Sethy, Abhinav
    Ramabhadran, Bhuvana
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 404 - +
  • [9] Search on speech from spoken queries: the Multi-domain International ALBAYZIN 2018 Query-by-Example Spoken Term Detection Evaluation
    Javier Tejedor
    Doroteo T. Toledano
    Paula Lopez-Otero
    Laura Docio-Fernandez
    Mikel Peñagarikano
    Luis Javier Rodriguez-Fuentes
    Antonio Moreno-Sandoval
    EURASIP Journal on Audio, Speech, and Music Processing, 2019
  • [10] Search on speech from spoken queries: the Multi-domain International ALBAYZIN 2018 Query-by-Example Spoken Term Detection Evaluation
    Tejedor, Javier
    Toledano, Doroteo T.
    Lopez-Otero, Paula
    Docio-Fernandez, Laura
    Penagarikano, Mikel
    Javier Rodriguez-Fuentes, Luis
    Moreno-Sandoval, Antonio
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2019, 2019 (1)