SPONTANEOUS SPEECH RECOGNITION FOR ROMANIAN IN SPOKEN DIALOGUE SYSTEMS

被引：0

作者：

Burileanu, Corneliu ^{[1
]}

Popescu, Vladimir ^{[1
,2
]}

Buzo, Andi ^{[1
]}

Petrea, Cristina Sorina ^{[1
]}

Ghelmez-Hanes, Diana ^{[1
]}

机构：

[1] Univ Politehn Bucuresti, Fac Elect Telecommun & Informat Technol, Bucharest, Romania

[2] Univ Avignon, Lab Informat Avignon, Avignon, France

来源：

PROCEEDINGS OF THE ROMANIAN ACADEMY SERIES A-MATHEMATICS PHYSICS TECHNICAL SCIENCES INFORMATION SCIENCE | 2010年 / 11卷 / 01期

关键词：

Continuous speech recognition; Speech database; Hidden Markov Modeling;

D O I：

暂无

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

In this paper we present an attempt to develop a speech recognition module for the Romanian language in order to be used in a dialogue system. The main characteristics of such a dialogue system are first discussed. Further, we explain the design and acquisition of a spontaneous speech database for training the decoder: the design guidelines in developing the database, as well as several practical issues encountered, along with some triphones balancing statistics arc pointed out. Then, the speech recognition architecture (based on components in the "Hidden Markov Modeling Toolkit" - HTK) is described in detail, emphasizing the two aspects, training and decoding. In the next section, a discussion of several preliminary recognition results is provided, emphasizing current limitations and the need to significantly increase the size of the database. A set of conclusions and perspectives are offered at the end of the paper.

引用

页码：83 / 91

页数：9

共 50 条

[1] Effects of speech recognition accuracy on the performance of DARPA communicator spoken dialogue systems
Sanders G.A.
Le A.N.
International Journal of Speech Technology, 2004, 7 (4) : 293 - 309
[2] Two-level speech recognition to enhance the performance of spoken dialogue systems
Lopez-Cozar, Ramon
Callejas, Zoraida
KNOWLEDGE-BASED SYSTEMS, 2006, 19 (03) : 153 - 163
[3] The utility of semantic-pragmatic information and dialogue-state for speech recognition in spoken dialogue systems
Stemmer, G
Nöth, E
Niemann, H
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2000, 1902 : 439 - 444
[4] Speech act modeling and verification of spontaneous speech with disfluency in a spoken dialogue system
Wu, CH
Yan, GL
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (03): : 330 - 344
[5] Emotion recognition and adaptation in spoken dialogue systems
Pittermann, Johannes
Pittermann, Angela
Minker, Wolfgang
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2010, 13 (01) : 49 - 60
[6] Predicting and adapting to poor speech recognition in a spoken dialogue system
Litman, DJ
Pan, S
SEVENTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-2001) / TWELFTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-2000), 2000, : 722 - 728
[7] Improving the response timing estimation for spoken dialogue systems by reducing the effect of speech recognition delay
Sakuma, Jin
Fujie, Shinya
Zhao, Huaibo
Kobayashi, Tetsunori
INTERSPEECH 2023, 2023, : 2668 - 2672
[8] YEAH RIGHT: SARCASM RECOGNITION FOR SPOKEN DIALOGUE SYSTEMS
Tepperman, Joseph
Traum, David
Narayanan, Shrikanth
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1838 - +
[9] Recognition of Paralinguistic Information in Spoken Dialogue Systems for Elderly People
Perez-Espinosa, Humberto
Martinez-Miranda, Juan
ADVANCES IN ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, MICAI 2015, PT I, 2015, 9413 : 107 - 117
[10] Enhancement of Spoken Dialogue Systems by Means of User Emotion Recognition
Lopez-Cozar, Ramon
Silovsky, Jan
Griol, David
PROCESAMIENTO DEL LENGUAJE NATURAL, 2010, (45): : 191 - 198

← 1 2 3 4 5 →