ROLE PLAY DIALOGUE TOPIC MODEL FOR LANGUAGE MODEL ADAPTATION IN MULTI-PARTY CONVERSATION SPEECH RECOGNITION

被引:0
|
作者
Masumura, Ryo [1 ]
Oba, Takanobu [1 ]
Masataki, Hirokazu [1 ]
Yoshioka, Osamu [1 ]
Takahashi, Satoshi [1 ]
机构
[1] NTT Corp, NTT Media Intelligence Labs, Tokyo, Japan
关键词
Unsupervised language model adaptation; multi-party conversation speech recognition; topic model;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper introduces an unsupervised language model adaptation technique for multi-party conversation speech recognition. The use of topic models provides one of the most accurate frameworks for unsupervised language model adaptation since they can inject long-range topic information into language models. However, conventional topic models are not suitable for multi-party conversation because they assume that each speech set has each different topic. In a multi-party conversation, each speaker will share the same conversation topic and each speaker utterance will depend on both topic and speaker role. Accordingly, this paper proposes new concept of the "role play dialogue topic model" to utilize multiparty conversation attributes. The proposed topic model can share the topic distribution among each speaker and can also consider both topic and speaker role. The proposed topic model based adaptation realizes a new framework that sets multiple recognition hypotheses for each speaker and simultaneously adapts a language model for each speaker role. We use a call center dialogue data set in speech recognition experiments to show the effectiveness of the proposed method.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Dialogue speech recognition by combining hierarchical topic classification and language model switching
    Lane, IR
    Kawahara, T
    Matsui, T
    Nakamura, S
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (03): : 446 - 454
  • [2] Dialogue Modelling in Multi-party Social Media Conversation
    Dutta, Subhabrata
    Das, Dipankar
    TEXT, SPEECH, AND DIALOGUE, TSD 2017, 2017, 10415 : 219 - 227
  • [3] MODEL-BASED COMPRESSIVE SENSING FOR MULTI-PARTY DISTANT SPEECH RECOGNITION
    Asaei, Afsaneh
    Bourlard, Herve
    Cevher, Volkan
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4600 - 4603
  • [4] Multi-Party Conversation Modeling for Emotion Recognition
    Quan, Xiaojun
    Wu, Siyue
    Chen, Junqing
    Shen, Weizhou
    Yu, Jianxing
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 15 (03) : 751 - 768
  • [5] TOPIC N-GRAM COUNT LANGUAGE MODEL ADAPTATION FOR SPEECH RECOGNITION
    Haidar, Md. Akmal
    O'Shaughnessy, Douglas
    2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 165 - 169
  • [6] TOPIC CACHE LANGUAGE MODEL FOR SPEECH RECOGNITION
    Chueh, Chuang-Hua
    Chien, Jen-Tzung
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5194 - 5197
  • [7] Topic cache language model for speech recognition
    Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan, 70101, Taiwan
    ICASSP IEEE Int Conf Acoust Speech Signal Process Proc, 2010, (5194-5197):
  • [8] Topic tracking language model for speech recognition
    Watanabe, Shinji
    Iwata, Tomoharu
    Hori, Takaaki
    Sako, Atsushi
    Ariki, Yasuo
    COMPUTER SPEECH AND LANGUAGE, 2011, 25 (02): : 440 - 461
  • [9] MPC-BERT: A Pre-Trained Language Model for Multi-Party Conversation Understanding
    Gu, Jia-Chen
    Tao, Chongyang
    Ling, Zhen-Hua
    Xu, Can
    Geng, Xiubo
    Jianel, Daxin
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 3682 - 3692
  • [10] Topic identification techniques applied to dynamic language model adaptation for automatic speech recognition
    Echeverry-Correa, J. D.
    Ferreiros-Lopez, J.
    Coucheiro-Limeres, A.
    Cordoba, R.
    Montero, J. M.
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (01) : 101 - 112