ROLE PLAY DIALOGUE TOPIC MODEL FOR LANGUAGE MODEL ADAPTATION IN MULTI-PARTY CONVERSATION SPEECH RECOGNITION

被引:0
|
作者
Masumura, Ryo [1 ]
Oba, Takanobu [1 ]
Masataki, Hirokazu [1 ]
Yoshioka, Osamu [1 ]
Takahashi, Satoshi [1 ]
机构
[1] NTT Corp, NTT Media Intelligence Labs, Tokyo, Japan
关键词
Unsupervised language model adaptation; multi-party conversation speech recognition; topic model;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper introduces an unsupervised language model adaptation technique for multi-party conversation speech recognition. The use of topic models provides one of the most accurate frameworks for unsupervised language model adaptation since they can inject long-range topic information into language models. However, conventional topic models are not suitable for multi-party conversation because they assume that each speech set has each different topic. In a multi-party conversation, each speaker will share the same conversation topic and each speaker utterance will depend on both topic and speaker role. Accordingly, this paper proposes new concept of the "role play dialogue topic model" to utilize multiparty conversation attributes. The proposed topic model can share the topic distribution among each speaker and can also consider both topic and speaker role. The proposed topic model based adaptation realizes a new framework that sets multiple recognition hypotheses for each speaker and simultaneously adapts a language model for each speaker role. We use a call center dialogue data set in speech recognition experiments to show the effectiveness of the proposed method.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] A General Equilibrium Model of Multi-Party Competition
    Marek M. Kaminski
    Social Choice and Welfare, 2006, 26 : 333 - 361
  • [32] A general equilibrium model of multi-party competition
    Kaminski, Marek M.
    SOCIAL CHOICE AND WELFARE, 2006, 26 (02) : 333 - 361
  • [33] Data Anonymity in Multi-Party Service Model
    Kiyomoto, Shinsaku
    Fukushima, Kazuhide
    Miyake, Yutaka
    SECURITY TECHNOLOGY, 2011, 259 : 21 - 30
  • [34] Multi-party Human-computer Interaction Dialogue Psychology Model Based on Stackelberg Game
    HUANG, Hongcheng
    SU, Meidan
    KOU, Lan
    TAO, Yang
    HU, Min
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2023, 45 (05) : 1758 - 1765
  • [35] Transformer-based Multi-Party Conversation Generation using Dialogue Discourse Acts Planning
    Chernyavskiy, Alexander
    Ilvovsky, Dmitry
    24TH MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE, SIGDIAL 2023, 2023, : 519 - 529
  • [36] Topic-Dependent Language Model Switching for Embedded Automatic Speech Recognition
    Santos-Perez, Marcos
    Gonzalez-Parada, Eva
    Manuel Cano-Garcia, Jose
    AMBIENT INTELLIGENCE - SOFTWARE AND APPLICATIONS, 2012, 153 : 235 - 242
  • [37] MULTI-TURN RNN-T FOR STREAMING RECOGNITION OF MULTI-PARTY SPEECH
    Sklyar, Ilya
    Piunova, Anna
    Zheng, Xianrui
    Liu, Yulan
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8402 - 8406
  • [38] Attention-based Contextual Language Model Adaptation for Speech Recognition
    Martinez, Richard Diehl
    Novotney, Scott
    Bulyko, Ivan
    Rastrow, Ariya
    Stolcke, Andreas
    Gandhe, Ankur
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 1994 - 2003
  • [39] Recurrent Neural Network Language Model Adaptation for Conversational Speech Recognition
    Li, Ke
    Xu, Hainan
    Wang, Yiming
    Povey, Daniel
    Khudanpur, Sanjeev
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3373 - 3377
  • [40] Language Model Adaptation for Emotional Speech Recognition using Tweet data
    Saeki, Kazuya
    Kato, Masaharu
    Kosaka, Tetsuo
    2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 371 - 375