Improving generalisation to new speakers in spoken dialogue state tracking

被引:2
|
作者
Casanueva, Inigo [1 ]
Hain, Thomas [1 ]
Green, Phil [1 ]
机构
[1] Univ Sheffield, Dept Comp Sci, Sheffield, S Yorkshire, England
基金
英国工程与自然科学研究理事会;
关键词
dialogue state tracking; dysarthric speakers;
D O I
10.21437/Interspeech.2016-404
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Users with disabilities can greatly benefit from personalised voice-enabled environmental-control interfaces, but for users with speech impairments (e.g. dysarthria) poor ASR performance poses a challenge to successful dialogue. Statistical dialogue management has shown resilience against high ASR error rates, hence making it useful to improve the performance of these interfaces. However, little research was devoted to dialogue management personalisation to specific users so far. Recently, data driven discriminative models have been shown to yield the best performance in dialogue state tracking (the inference of the user goal from the dialogue history). However, due to the unique characteristics of each speaker, training a system for a new user when user specific data is not available can be challenging due to the mismatch between training and working conditions. This work investigates two methods to improve the performance with new speakers of a LSTM-based personalised state tracker: The use of speaker specific acoustic and ASR related features; and dropout regularisation. It is shown that in an environmental control system for dysarthric speakers, the combination of both techniques yields improvements of 3.5% absolute in state tracking accuracy. Further analysis explores the effect of using different amounts of speaker specific data to train the tracking system.
引用
收藏
页码:2726 / 2730
页数:5
相关论文
共 50 条
  • [1] Target-based state and tracking algorithm for spoken dialogue system
    Li, Miao
    He, Zhiyang
    Wu, Ji
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2711 - 2715
  • [2] Adaptive Multi-Domain Dialogue State Tracking on Spoken Conversations
    Lim, Jungwoo
    Whang, Taesun
    Lee, Dongyub
    Lim, Heuiseok
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 727 - 732
  • [3] Improving Dialogue State Tracking by Discerning the Relevant Context
    Sharma, Sanuj
    Choubey, Prafulla Kumar
    Huang, Ruihong
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 576 - 581
  • [4] Improving Limited Labeled Dialogue State Tracking with Self-Supervision
    Wu, Chien-Sheng
    Hoi, Steven
    Xiong, Caiming
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 4462 - 4472
  • [5] Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems
    Thomson, Blaise
    Young, Steve
    COMPUTER SPEECH AND LANGUAGE, 2010, 24 (04): : 562 - 588
  • [6] Multimodal Dialogue State Tracking
    Le, Hung
    Chen, Nancy F.
    Hoi, Steven C. H.
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 3394 - 3415
  • [7] Speech repairs, intonational phrases, and discourse markers: Modeling speakers' utterances in spoken dialogue
    Heeman, PA
    Allen, JF
    COMPUTATIONAL LINGUISTICS, 1999, 25 (04) : 527 - 571
  • [8] Speech repairs, intonational phrases, and discourse markers: Modeling speakers' utterances in spoken dialogue
    Computer Science and Engineering, P.O. Box 91000, Portland, OR 97291, United States
    不详
    Comput. Linguist., 4 (527-571):
  • [9] Improving Long Distance Slot Carryover in Spoken Dialogue Systems
    Chen, Tongfei
    Naik, Chetan
    He, Hua
    Rastogi, Pushpendre
    Mathias, Lambert
    NLP FOR CONVERSATIONAL AI, 2019, : 96 - 105
  • [10] Using Information State to Improve Dialogue Move Identification in a Spoken Dialogue System
    Ai, Hua
    Roque, Antonio
    Leuski, Anton
    Traum, David
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2596 - +