SPONTANEOUS SPEECH RECOGNITION FOR ROMANIAN IN SPOKEN DIALOGUE SYSTEMS

被引:0
|
作者
Burileanu, Corneliu [1 ]
Popescu, Vladimir [1 ,2 ]
Buzo, Andi [1 ]
Petrea, Cristina Sorina [1 ]
Ghelmez-Hanes, Diana [1 ]
机构
[1] Univ Politehn Bucuresti, Fac Elect Telecommun & Informat Technol, Bucharest, Romania
[2] Univ Avignon, Lab Informat Avignon, Avignon, France
关键词
Continuous speech recognition; Speech database; Hidden Markov Modeling;
D O I
暂无
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this paper we present an attempt to develop a speech recognition module for the Romanian language in order to be used in a dialogue system. The main characteristics of such a dialogue system are first discussed. Further, we explain the design and acquisition of a spontaneous speech database for training the decoder: the design guidelines in developing the database, as well as several practical issues encountered, along with some triphones balancing statistics arc pointed out. Then, the speech recognition architecture (based on components in the "Hidden Markov Modeling Toolkit" - HTK) is described in detail, emphasizing the two aspects, training and decoding. In the next section, a discussion of several preliminary recognition results is provided, emphasizing current limitations and the need to significantly increase the size of the database. A set of conclusions and perspectives are offered at the end of the paper.
引用
收藏
页码:83 / 91
页数:9
相关论文
共 50 条
  • [1] Effects of speech recognition accuracy on the performance of DARPA communicator spoken dialogue systems
    Sanders G.A.
    Le A.N.
    International Journal of Speech Technology, 2004, 7 (4) : 293 - 309
  • [2] Two-level speech recognition to enhance the performance of spoken dialogue systems
    Lopez-Cozar, Ramon
    Callejas, Zoraida
    KNOWLEDGE-BASED SYSTEMS, 2006, 19 (03) : 153 - 163
  • [3] The utility of semantic-pragmatic information and dialogue-state for speech recognition in spoken dialogue systems
    Stemmer, G
    Nöth, E
    Niemann, H
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2000, 1902 : 439 - 444
  • [4] Speech act modeling and verification of spontaneous speech with disfluency in a spoken dialogue system
    Wu, CH
    Yan, GL
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (03): : 330 - 344
  • [5] Emotion recognition and adaptation in spoken dialogue systems
    Pittermann, Johannes
    Pittermann, Angela
    Minker, Wolfgang
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2010, 13 (01) : 49 - 60
  • [6] Predicting and adapting to poor speech recognition in a spoken dialogue system
    Litman, DJ
    Pan, S
    SEVENTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-2001) / TWELFTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-2000), 2000, : 722 - 728
  • [7] Improving the response timing estimation for spoken dialogue systems by reducing the effect of speech recognition delay
    Sakuma, Jin
    Fujie, Shinya
    Zhao, Huaibo
    Kobayashi, Tetsunori
    INTERSPEECH 2023, 2023, : 2668 - 2672
  • [8] YEAH RIGHT: SARCASM RECOGNITION FOR SPOKEN DIALOGUE SYSTEMS
    Tepperman, Joseph
    Traum, David
    Narayanan, Shrikanth
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1838 - +
  • [9] Recognition of Paralinguistic Information in Spoken Dialogue Systems for Elderly People
    Perez-Espinosa, Humberto
    Martinez-Miranda, Juan
    ADVANCES IN ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, MICAI 2015, PT I, 2015, 9413 : 107 - 117
  • [10] Enhancement of Spoken Dialogue Systems by Means of User Emotion Recognition
    Lopez-Cozar, Ramon
    Silovsky, Jan
    Griol, David
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2010, (45): : 191 - 198