EFFECT OF PRONUNCIATIONS ON OOV QUERIES IN SPOKEN TERM DETECTION

被引:28
|
作者
Can, Dogan [1 ]
Cooper, Erica [2 ]
Sethy, Abhinav [3 ]
White, Chris [4 ]
Ramabhadran, Bhuvana [3 ]
Saraclar, Murat [1 ]
机构
[1] Bogazici Univ, TR-80815 Bebek, Turkey
[2] MIT, Cambridge, MA 02139 USA
[3] IBM Corp, Armonk, NY 10504 USA
[4] Johns Hopkins Univ, Baltimore, MD 21218 USA
来源
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS | 2009年
关键词
Speech Recognition; Speech Indexing and Retrieval; Spoken Term Detection; Weighted Finite State Transducers;
D O I
10.1109/ICASSP.2009.4960494
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The spoken term detection (STD) task aims to return relevant segments from a spoken archive that contain the query terms whether or not they are in the system vocabulary. This paper focuses on pronunciation modeling for Out-of-Vocabulary (OOV) terms which frequently occur in STD queries. The STD system described in this paper indexes word-level and sub-word level lattices or confusion networks produced by an LVCSR system using Weighted Finite State Transducers (WFST). We investigate the inclusion of n-best pronunciation variants for OOV terms (obtained from letter-to-sound rules) into the search and present the results obtained by indexing confusion networks as well as lattices. The following observations are worth mentioning: phone indexes generated from sub-words represent OOVs well and too many variants for the OOV terms degrade performance if pronunciations are not weighted.
引用
收藏
页码:3957 / +
页数:2
相关论文
共 50 条
  • [1] Web Derived Pronunciations for Spoken Term Detection
    Can, Dogan
    Cooper, Erica
    Ghoshal, Arnab
    Jansche, Martin
    Khudanpur, Sanjeev
    Ramabhadran, Bhuvana
    Riley, Michael
    Saraclar, Murat
    Sethy, Abhinav
    Ulinski, Morgan
    White, Christopher
    PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 83 - 90
  • [2] Query-by-Example Spoken Term Detection For OOV Terms
    Parada, Carolina
    Sethy, Abhinav
    Ramabhadran, Bhuvana
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 404 - +
  • [3] COMBINATION OF SYLLABLE BASED N-GRAM SEARCH AND WORD SEARCH FOR SPOKEN TERM DETECTION THROUGH SPOKEN QUERIES AND IV/OOV CLASSIFICATION
    Sakamoto, Nagisa
    Yamamoto, Kazumasa
    Nakagawa, Seiichi
    2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 200 - 206
  • [4] Combination of syllable based N-gram search and word search for spoken term detection through spoken queries and IV/OOV classification
    Toyohashi University of Technology, Japan
    IEEE Workshop Autom. Speech Recognit. Underst., ASRU - Proc., 2015, (200-206):
  • [5] Approaches to reduce the effects of OOV queries on indexed spoken audio
    Logan, B
    Van Thong, JM
    Moreno, PJ
    IEEE TRANSACTIONS ON MULTIMEDIA, 2005, 7 (05) : 899 - 906
  • [6] SPOKEN TERM DETECTION FOR OOV TERMS BASED ON TRIPHONE CONFUSION MATRIX
    Xu, Yong
    Guo, Wu
    Su, Shan
    Dai, LiRong
    2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : 98 - 102
  • [7] Model-Based Unsupervised Spoken Term Detection with Spoken Queries
    Chan, Chun-an
    Lee, Lin-shan
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (07): : 1330 - 1342
  • [8] Unsupervised Hidden Markov Modeling of Spoken Queries for Spoken Term Detection without Speech Recognition
    Chan, Chun-an
    Lee, Lin-shan
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2152 - 2155
  • [9] Handling OOV Words in Mandarin Spoken Term Detection with an Hierarchical n-Gram Language Model
    WANG Xuyang
    ZHANG Pengyuan
    NA Xingyu
    PAN Jielin
    YAN Yonghong
    Chinese Journal of Electronics, 2017, 26 (06) : 1239 - 1244
  • [10] Handling OOV Words in Mandarin Spoken Term Detection with an Hierarchical n-Gram Language Model
    Wang Xuyang
    Zhang Pengyuan
    Na Xingyu
    Pan Jielin
    Yan Yonghong
    CHINESE JOURNAL OF ELECTRONICS, 2017, 26 (06) : 1239 - 1244