Automatic Rule Extraction for Modeling Pronunciation Variation

被引:0
|
作者
Ahmed, Zeeshan [1 ]
Carson-Berndsen, Julie [1 ]
机构
[1] Univ Coll Dublin, Sch Comp Sci & Informat, CNGL, Dublin, Ireland
来源
COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PT II | 2011年 / 6609卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes the technique for automatic extraction of pronunciation rules from continuous speech corpus. The purpose of the work is to model pronunciation variation in phoneme based continuous speech recognition at. language model level. In modeling pronunciation variations, morphological variations and out-of-vocabulary words problem are also implicitly modeled in the system. It is not possible to model these kind of variations using dictionary based approach in phoneme based automatic speech recognition. The variations are automatically learned front annotated continuous speech corpus. The corpus is first aligned, on the basis of phoneme and letter, using a dynamic string alignment algorithm. The DSA is applied to isolated words to deal with intra-word variations as well as to complete sentences in the corpus to deal with inter-word variations. The pronunciation rules phonemes -> letters are extracted from these aligned speech units to build pronunciation model. The rules are finally fed to a phoneme-to-word decoder for recognition of the words having different pronunciations or that are OOV.
引用
收藏
页码:467 / 476
页数:10
相关论文
共 50 条
  • [21] Automatic Generation of a Pronunciation Dictionary with Rich Variation Coverage Using SMT Methods
    Karanasou, Panagiota
    Lamel, Lori
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PT II, 2011, 6609 : 506 - 517
  • [22] Automatic recurrent ANN rule extraction with Genetic programming
    Dorado, J
    Rabuñal, JR
    Rivero, D
    Santos, A
    Pazos, A
    PROCEEDING OF THE 2002 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-3, 2002, : 1552 - 1557
  • [23] Pronunciation feature extraction
    Hacker, C
    Cincarek, T
    Gruhn, R
    Steidl, S
    Nöth, E
    Niemann, H
    PATTERN RECOGNITION, PROCEEDINGS, 2005, 3663 : 141 - 148
  • [24] Cross-word Arabic pronunciation variation modeling for speech recognition
    AbuZeina, Dia
    Al-Khatib, Wasfi
    Elshafei, Moustafa
    Al-Muhtaseb, Husni
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2011, 14 (03) : 227 - 236
  • [25] INTEGRATED PRONUNCIATION LEARNING FOR AUTOMATIC SPEECH RECOGNITION USING PROBABILISTIC LEXICAL MODELING
    Rasipuram, Ramya
    Razavi, Marzieh
    Magimai-Doss, Mathew
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5176 - 5180
  • [26] Automatic assessment of pronunciation quality
    Dong, B
    Zhao, QW
    Zhang, JP
    Yan, YH
    2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 137 - 140
  • [27] Automatic Pronunciation Evaluation of Singing
    Gupta, Chitralekha
    Li, Haizhou
    Wang, Ye
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1507 - 1511
  • [28] Automatic Pronunciation Assessment - A Review
    El Kheir, Yassine
    Ali, Ahmed
    Chowdhury, Shammur Absar
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 8304 - 8324
  • [29] Automatic scoring of pronunciation quality
    Neumeyer, L
    Franco, H
    Digalakis, V
    Weintraub, M
    SPEECH COMMUNICATION, 2000, 30 (2-3) : 83 - 93
  • [30] Automatic Pronunciation Evaluation and Classification
    Deshmukh, Om D.
    Joshi, Sachindra
    Verma, Ashish
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1721 - 1724