Speech act modeling and verification of spontaneous speech with disfluency in a spoken dialogue system

被引:15
|
作者
Wu, CH [1 ]
Yan, GL [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan
来源
关键词
Bayesian belief model; disfluency modeling; speech act modeling; spoken dialogue;
D O I
10.1109/TSA.2005.845820
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This work presents an approach to modeling speech acts and verifying spontaneous speech with disfluency in a spoken dialogue system. According to this approach, semantic information, syntactic structure and fragment class of an input utterance are statistically encapsulated in a proposed speech act hidden Markov model (SAHMM) to characterize the speech act. An interpolation mechanism is exploited to re-estimate the state transition probability in SAHMM, to deal with the problem of disfluency in a sparse training corpus. Finally, a Bayesian belief model (BBM), based on latent semantic analysis (LSA), is adopted to verify the potential speech acts and output the final speech act. Experiments were conducted to evaluate the proposed approach using a spoken dialogue system for providing air travel information. A testing database from 25 speakers, with 480 dialogues that include 3038 sentences, was established and used for evaluation. Experimental results show that the proposed approach identifies 95.3% of speech act at a rejection rate of 5%, and the semantic accuracy is 4.2% better than that obtained using a keyword-based system. The proposed strategy also effectively alleviates the disfluency problem in spontaneous speech.
引用
收藏
页码:330 / 344
页数:15
相关论文
共 50 条
  • [1] SPEECH ACT COMPREHENSION IN SPOKEN DIALOGUE: AN ERP STUDY
    Gisladottir, Rosa
    Chwilla, Dorothee
    Levinson, Stephen
    JOURNAL OF COGNITIVE NEUROSCIENCE, 2013, : 78 - 79
  • [2] SPONTANEOUS SPEECH RECOGNITION FOR ROMANIAN IN SPOKEN DIALOGUE SYSTEMS
    Burileanu, Corneliu
    Popescu, Vladimir
    Buzo, Andi
    Petrea, Cristina Sorina
    Ghelmez-Hanes, Diana
    PROCEEDINGS OF THE ROMANIAN ACADEMY SERIES A-MATHEMATICS PHYSICS TECHNICAL SCIENCES INFORMATION SCIENCE, 2010, 11 (01): : 83 - 91
  • [3] Predicting and adapting to poor speech recognition in a spoken dialogue system
    Litman, DJ
    Pan, S
    SEVENTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-2001) / TWELFTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-2000), 2000, : 722 - 728
  • [4] Spoken Dialogue System for Call Centers with Expressive Speech Synthesis
    Nicmanis, Davis
    Salimbajevs, Askars
    INTERSPEECH 2022, 2022, : 5215 - 5218
  • [5] Dialogue act modeling for automatic tagging and recognition of conversational speech
    Stolcke, A
    Ries, K
    Coccaro, N
    Shriberg, E
    Bates, R
    Jurafsky, D
    Taylor, P
    Martin, R
    Van Ess-Dykema, C
    Meteer, M
    COMPUTATIONAL LINGUISTICS, 2000, 26 (03) : 339 - 373
  • [6] Modeling speech disfluency to predict conceptual misalignment in speech survey interfaces
    Ehlen, Patrick
    Schober, Michael F.
    Conrad, Frederick G.
    DISCOURSE PROCESSES, 2007, 44 (03) : 245 - 265
  • [7] Dialogue act classification in a spoken dialogue system
    Castro, MJ
    Vilar, D
    Aibar, P
    Sanchis, E
    CURRENT TOPICS IN ARTIFICIAL INTELLIGENCE, 2004, 3040 : 260 - 270
  • [8] When can listeners detect disfluency in spontaneous speech?
    Lickley, RJ
    Bard, EG
    LANGUAGE AND SPEECH, 1998, 41 : 203 - 226
  • [9] Analysis of head motions and speech in spoken dialogue
    Ishi, Carlos T.
    Ishiguro, Hiroshi
    Hagita, Norihiro
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1485 - 1488
  • [10] On Appropriateness and Estimation of the Emotion of Synthesized Response Speech in a Spoken Dialogue System
    Kase, Taketo
    Nose, Takashi
    Ito, Akinori
    HCI INTERNATIONAL 2015 - POSTERS' EXTENDED ABSTRACTS, PT I, 2015, 528 : 747 - 752