Low-Resource Speech Recognition and Keyword-Spotting

被引:5
作者
Gales, Mark J. F. [1 ]
Knill, Kate M. [1 ]
Ragni, Anton [1 ]
机构
[1] Univ Cambridge, Dept Engn, Trumpington St, Cambridge, England
来源
SPEECH AND COMPUTER, SPECOM 2017 | 2017年 / 10458卷
关键词
Prosody perception; Narrow versus broad focus; Japanese learners of English; L2; acquisition; DEEP NEURAL-NETWORK; DATA AUGMENTATION;
D O I
10.1007/978-3-319-66429-3_1
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The IARPA Babel program ran from March 2012 to November 2016. The aim of the program was to develop agile and robust speech technology that can be rapidly applied to any human language in order to provide effective search capability on large quantities of real world data. This paper will describe some of the developments in speech recognition and keyword-spotting during the lifetime of the project. Two technical areas will be briefly discussed with a focus on techniques developed at Cambridge University: the application of deep learning for low-resource speech recognition; and efficient approaches for keyword spotting. Finally a brief analysis of the Babel speech language characteristics and language performance will be presented.
引用
收藏
页码:3 / 19
页数:17
相关论文
共 50 条
  • [31] ISI ASR System for the Low Resource Speech Recognition Challenge for Indian Languages
    Billa, Jayadev
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3207 - 3211
  • [32] ELASTIC SPECTRAL DISTORTION FOR LOW RESOURCE SPEECH RECOGNITION WITH DEEP NEURAL NETWORKS
    Kanda, Naoyuki
    Takeda, Ryu
    Obuchi, Yasunari
    2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 309 - 314
  • [33] Transfer Learning for Robust Low-Resource Children's Speech ASR with Transformers and Source-Filter Warping
    Thienpondt, Jenthe
    Demuynck, Kris
    INTERSPEECH 2022, 2022, : 2213 - 2217
  • [34] Prosodic Feature-Based Discriminatively Trained Low Resource Speech Recognition System
    Hasija, Taniya
    Kadyan, Virender
    Guleria, Kalpna
    Alharbi, Abdullah
    Alyami, Hashem
    Goyal, Nitin
    SUSTAINABILITY, 2022, 14 (02)
  • [35] End-to-end neural automatic speech recognition system for low resource languages
    Dhahbi, Sami
    Saleem, Nasir
    Bourouis, Sami
    Berrima, Mouhebeddine
    Verdu, Elena
    EGYPTIAN INFORMATICS JOURNAL, 2025, 29
  • [36] A Scheme for News Article Classification in a Low-Resource Language
    Yohannes, Hailemariam Mehari
    Amagasa, Toshiyuki
    INFORMATION INTEGRATION AND WEB INTELLIGENCE, IIWAS 2022, 2022, 13635 : 519 - 530
  • [37] Low-resource Neural Machine Translation: Methods and Trends
    Shi, Shumin
    Wu, Xing
    Su, Rihai
    Huang, Heyan
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (05)
  • [38] Data Augmentation for Low-Resource Quechua ASR Improvement
    Zevallos, Rodolfo
    Bel, Nuria
    Cambara, Guillermo
    Farrus, Mireia
    Luque, Jordi
    INTERSPEECH 2022, 2022, : 3518 - 3522
  • [39] SYNTHETIC DATA AUGMENTATION FOR IMPROVING LOW-RESOURCE ASR
    Thai, Bao
    Jimerson, Robert
    Arcoraci, Dominic
    Prud'hommeaux, Emily
    Ptucha, Raymond
    2019 IEEE WESTERN NEW YORK IMAGE AND SIGNAL PROCESSING WORKSHOP (WNYISPW), 2019,
  • [40] Neural Machine Translation for Low-resource Languages: A Survey
    Ranathunga, Surangika
    Lee, En-Shiun Annie
    Skenduli, Marjana Prifti
    Shekhar, Ravi
    Alam, Mehreen
    Kaur, Rishemjit
    ACM COMPUTING SURVEYS, 2023, 55 (11)