Automatic Rule Extraction for Modeling Pronunciation Variation

被引：0

作者：

Ahmed, Zeeshan ^{[1
]}

Carson-Berndsen, Julie ^{[1
]}

机构：

[1] Univ Coll Dublin, Sch Comp Sci & Informat, CNGL, Dublin, Ireland

来源：

COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PT II | 2011年 / 6609卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper describes the technique for automatic extraction of pronunciation rules from continuous speech corpus. The purpose of the work is to model pronunciation variation in phoneme based continuous speech recognition at. language model level. In modeling pronunciation variations, morphological variations and out-of-vocabulary words problem are also implicitly modeled in the system. It is not possible to model these kind of variations using dictionary based approach in phoneme based automatic speech recognition. The variations are automatically learned front annotated continuous speech corpus. The corpus is first aligned, on the basis of phoneme and letter, using a dynamic string alignment algorithm. The DSA is applied to isolated words to deal with intra-word variations as well as to complete sentences in the corpus to deal with inter-word variations. The pronunciation rules phonemes -> letters are extracted from these aligned speech units to build pronunciation model. The rules are finally fed to a phoneme-to-word decoder for recognition of the words having different pronunciations or that are OOV.

引用

页码：467 / 476

页数：10

共 50 条

[21] Automatic Generation of a Pronunciation Dictionary with Rich Variation Coverage Using SMT Methods
Karanasou, Panagiota
Lamel, Lori
COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PT II, 2011, 6609 : 506 - 517
[22] Automatic recurrent ANN rule extraction with Genetic programming
Dorado, J
Rabuñal, JR
Rivero, D
Santos, A
Pazos, A
PROCEEDING OF THE 2002 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-3, 2002, : 1552 - 1557
[23] Pronunciation feature extraction
Hacker, C
Cincarek, T
Gruhn, R
Steidl, S
Nöth, E
Niemann, H
PATTERN RECOGNITION, PROCEEDINGS, 2005, 3663 : 141 - 148
[24] Cross-word Arabic pronunciation variation modeling for speech recognition
AbuZeina, Dia
Al-Khatib, Wasfi
Elshafei, Moustafa
Al-Muhtaseb, Husni
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2011, 14 (03) : 227 - 236
[25] INTEGRATED PRONUNCIATION LEARNING FOR AUTOMATIC SPEECH RECOGNITION USING PROBABILISTIC LEXICAL MODELING
Rasipuram, Ramya
Razavi, Marzieh
Magimai-Doss, Mathew
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5176 - 5180
[26] Automatic assessment of pronunciation quality
Dong, B
Zhao, QW
Zhang, JP
Yan, YH
2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 137 - 140
[27] Automatic Pronunciation Evaluation of Singing
Gupta, Chitralekha
Li, Haizhou
Wang, Ye
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1507 - 1511
[28] Automatic Pronunciation Assessment - A Review
El Kheir, Yassine
Ali, Ahmed
Chowdhury, Shammur Absar
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 8304 - 8324
[29] Automatic scoring of pronunciation quality
Neumeyer, L
Franco, H
Digalakis, V
Weintraub, M
SPEECH COMMUNICATION, 2000, 30 (2-3) : 83 - 93
[30] Automatic Pronunciation Evaluation and Classification
Deshmukh, Om D.
Joshi, Sachindra
Verma, Ashish
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1721 - 1724

← 1 2 3 4 5 →