Unsupervised language model adaptation for meeting recognition

被引:0
|
作者
Tur, Gokhan [1 ]
Stolcke, Andreas [1 ]
机构
[1] SRI Int, Speech Technol & Res Lab, 333 Ravenswood Ave, Menlo Pk, CA 94025 USA
关键词
speech processing; language modeling; meeting recognition; unsupervised adaptation;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We present an application of unsupervised language model (LM) adaptation to meeting recognition, in a scenario where sequences of multiparty meetings on related topics are to be recognized, but no prior in-domain data for LM training is available. The recognizer LMs are adapted according to the recognition output on temporally preceding meetings, either in speaker-dependent or speaker-independent mode. Model adaptation is carried out by interpolating the n-gram probabilities of a large generic LM with those of a small LM estimated from the adaptation data, and minimizing perplexity on the automatic transcripts of a separate meeting set, also previously recognized. The adapted LMs yield about 5-9% relative reduction in word error compared to the baseline. This improvement is about half of what can be achieved with supervised adaptation, i.e., using human-generated speech transcripts.
引用
收藏
页码:173 / +
页数:2
相关论文
共 50 条
  • [1] Unsupervised Language Model Adaptation by Data Selection for Speech Recognition
    Khassanov, Yerbolat
    Chong, Tze Yuang
    Bigot, Benjamin
    Chng, Eng Siong
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2017, PT I, 2017, 10191 : 508 - 517
  • [2] Unsupervised language model adaptation for handwritten Chinese text recognition
    Wang, Qiu-Feng
    Yin, Fei
    Liu, Cheng-Lin
    PATTERN RECOGNITION, 2014, 47 (03) : 1202 - 1216
  • [3] Unsupervised language model adaptation
    Bacchiani, M
    Roark, B
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 224 - 227
  • [4] EXPLOITING USER FEEDBACK FOR LANGUAGE MODEL ADAPTATION IN MEETING RECOGNITION
    Vergyri, Dimitra
    Stolcke, Andreas
    Tur, Gokhan
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4737 - 4740
  • [5] Unsupervised cross-adaptation approach for speech recognition by combined language model and acoustic model adaptation
    School of Science and Engineering, Yamagata University, Yonezawa, Japan
    APSIPA ASC - Asia-Pac. Signal Inf. Process. Assoc. Annu. Summit Conf., (943-946):
  • [6] Unsupervised class-based language model adaptation for spontaneous speech recognition
    Yokoyama, T
    Shinozaki, T
    Iwano, K
    Furui, S
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 236 - 239
  • [7] Unsupervised crosslingual adaptation of tokenisers for spoken language recognition
    Ng, Raymond W. M.
    Nicolao, Mauro
    Hain, Thomas
    COMPUTER SPEECH AND LANGUAGE, 2017, 46 : 327 - 342
  • [8] Unsupervised language model adaptation for broadcast news
    Chen, LZ
    Gauvain, JL
    Lamel, L
    Adda, G
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 220 - 223
  • [9] Automatic Query Generation and Query Relevance Measurement for Unsupervised Language Model Adaptation of Speech Recognition
    Ito, Akinori
    Kajiura, Yasutomo
    Suzuki, Motoyuki
    Makino, Shozo
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2009,
  • [10] Automatic Query Generation and Query Relevance Measurement for Unsupervised Language Model Adaptation of Speech Recognition
    Akinori Ito
    Yasutomo Kajiura
    Motoyuki Suzuki
    Shozo Makino
    EURASIP Journal on Audio, Speech, and Music Processing, 2009