Speech act modeling and verification of spontaneous speech with disfluency in a spoken dialogue system

被引:15
|
作者
Wu, CH [1 ]
Yan, GL [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan
来源
关键词
Bayesian belief model; disfluency modeling; speech act modeling; spoken dialogue;
D O I
10.1109/TSA.2005.845820
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This work presents an approach to modeling speech acts and verifying spontaneous speech with disfluency in a spoken dialogue system. According to this approach, semantic information, syntactic structure and fragment class of an input utterance are statistically encapsulated in a proposed speech act hidden Markov model (SAHMM) to characterize the speech act. An interpolation mechanism is exploited to re-estimate the state transition probability in SAHMM, to deal with the problem of disfluency in a sparse training corpus. Finally, a Bayesian belief model (BBM), based on latent semantic analysis (LSA), is adopted to verify the potential speech acts and output the final speech act. Experiments were conducted to evaluate the proposed approach using a spoken dialogue system for providing air travel information. A testing database from 25 speakers, with 480 dialogues that include 3038 sentences, was established and used for evaluation. Experimental results show that the proposed approach identifies 95.3% of speech act at a rejection rate of 5%, and the semantic accuracy is 4.2% better than that obtained using a keyword-based system. The proposed strategy also effectively alleviates the disfluency problem in spontaneous speech.
引用
收藏
页码:330 / 344
页数:15
相关论文
共 50 条
  • [21] Improving Dialogue Act Classification for Spontaneous Arabic Speech and Instant Messages at Utterance Level
    Elmadany, AbdelRahim A.
    Abdou, Sherif M.
    Gheith, Mervat
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 128 - 134
  • [22] Contextual maximum entropy model for edit disfluency detection of spontaneous speech
    Yeh, Jui-Feng
    Wu, Chung-Hsien
    Wu, Wei-Yen
    CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 578 - +
  • [23] Prosodic focus control in reply speech generation for a spoken dialogue system of information retrieval
    Kiriyama, S
    Hirose, K
    Minematsu, N
    PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON SPEECH SYNTHESIS, 2002, : 139 - 142
  • [24] Detection of Repetitions in Spontaneous Speech in Dialogue Sessions
    Cevik, Mert
    Weng, Fuliang
    Lee, Chin-Hui
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 471 - +
  • [25] ISCA Tutorial and Research Workshop on Disfluency in Spontaneous Speech, DiSS 2001
    ISCA Tutorial and Research Workshop on Disfluency in Spontaneous Speech, DiSS 2001, 2001,
  • [26] End-to-End Spontaneous Speech Recognition Using Disfluency Labeling
    Horii, Koharu
    Fukuda, Meiko
    Ohta, Kengo
    Nishimura, Ryota
    Ogawa, Atsunori
    Kitaoka, Norihide
    INTERSPEECH 2022, 2022, : 4108 - 4112
  • [27] 6th Workshop on Disfluency in Spontaneous Speech, DiSS 2013
    6th Workshop on Disfluency in Spontaneous Speech, DiSS 2013, 2013,
  • [28] Language Modeling for Speech Recognition of Spoken Cantonese
    Yeung, Yu Ting
    Cao, Houwei
    Zheng, N. H.
    Lee, Tan
    Ching, P. C.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1570 - 1573
  • [29] Predicting dialogue acts for a speech-to-speech translation system
    Reithinger, N
    Engel, R
    Kipp, M
    Klesen, M
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 654 - 657
  • [30] AUTOMATIC FLUENCY EVALUATION OF SPONTANEOUS SPEECH USING DISFLUENCY-BASED FEATURES
    Deng, Huaijin
    Lin, Youchao
    Utsuro, Takehito
    Kobayashi, Akio
    Nishizaki, Hiromitsu
    Hoshino, Junichi
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 9239 - 9243