Experiments in syllable-based retrieval of broadcast news speech in Mandarin Chinese

被引:16
|
作者
Wang, HM [1 ]
机构
[1] Acad Sinica, Inst Informat Sci, Taipei 115, Taiwan
关键词
spoken document retrieval; broadcast news; Mandarin Chinese; syllable lattice; speech recognition; hidden Markov Model;
D O I
10.1016/S0167-6393(00)00023-6
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Spoken document retrieval (SDR) has been extensively studied in recent years because of its potential use in navigating large multi-media collections in the near future. Considering the characteristics and monosyllabic structure of the Chinese language, the syllable-based indexing for retrieval of spoken documents in Mandarin Chinese has been investigated, and extensive experiments on retrieval of broadcast news speech collected in Taiwan were performed. This paper reports some interesting results and findings obtained in this research. (C) 2000 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:49 / 60
页数:12
相关论文
共 50 条
  • [41] A retrieval system of broadcast news speech documents through keyboard and voice
    Nishizaki, H
    Nakagawa, S
    TEXT, SPEECH AND DIALOGUE, 1999, 1692 : 286 - 289
  • [42] Development of the 2008 SRI Mandarin Speech-to-text System for Broadcast News and Conversation
    Lei, Xin
    Wu, Wei
    Wang, Wen
    Mandal, Arindam
    Stolcke, Andreas
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2087 - +
  • [43] DATA-DRIVEN LEXICON EXPANSION FOR MANDARIN BROADCAST NEWS AND CONVERSATION SPEECH RECOGNITION
    Lei, Xin
    Wang, Wen
    Stolcke, Andreas
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4329 - 4332
  • [44] An Efficient Syllable-Based Speech Segmentation Model Using Fuzzy and Threshold-Based Boundary Detection
    Kumari, Ruchika
    Dev, Amita
    Kumar, Ashwani
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2022, 21 (02)
  • [45] A Syllable-Based Turkish Speech Recognition System by Using Time Delay Neural Networks (TDNNs)
    Can, Burcu
    Artuner, Harun
    2013 INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2013, : 219 - 224
  • [46] A New Syllable-lattice Based Approach for Mandarin Spoken Document Retrieval
    Zhang, Lei
    Gao, Yunxia
    Xiang, Xuezhi
    Lu, Dong
    2009 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP 2009), 2009, : 1175 - 1178
  • [47] Syllable-based Pitch Encoding for Low Bit Rate Speech Coding with Recognition/Synthesis Architecture
    Cernak, Milos
    Na, Xingyu
    Garner, Philip N.
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3416 - 3419
  • [48] Towards Burmese (Myanmar) Morphological Analysis: Syllable-based Tokenization and Part-of-speech Tagging
    Ding, Chenchen
    Aye, Hnin Thu Zar
    Pa, Win Pa
    Nwet, Khin Thandar
    Soe, Khin Mar
    Utiyama, Masao
    Sumita, Eiichiro
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2020, 19 (01)
  • [49] A Novel Text-to-Speech Synthesis System Using Syllable-Based HMM for Tamil Language
    Manoharan, J. Samuel
    PROCEEDINGS OF SECOND INTERNATIONAL CONFERENCE ON SUSTAINABLE EXPERT SYSTEMS (ICSES 2021), 2022, 351 : 305 - 314
  • [50] HOW TO DESCRIBE SPEECH EMOTION MORE COMPLETELY - AN INVESTIGATION ON CHINESE BROADCAST NEWS SPEECH
    Gao Yingying
    Zhu Weibin
    2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : 450 - 453