I-VECTOR BASED LANGUAGE MODELING FOR SPOKEN DOCUMENT RETRIEVAL

被引:0
作者
Chen, Kuan-Yu [1 ]
Lee, Hung-Shin [1 ]
Wang, Hsin-Min [1 ]
Chen, Berlin
Chen, Hsin-Hsi
机构
[1] Acad Sinica, Inst Informat Sci, Taipei, Taiwan
来源
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年
关键词
Spoken document retrieval; i-vector; language modeling; inductive; transductive; SPEAKER; MATRIX;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Since more and more multimedia data associated with spoken documents have been made available to the public, spoken document retrieval (SDR) has become an important research subject in the past two decades. The i-vector based framework has been proposed and introduced to language identification (LID) and speaker recognition (SR) tasks recently. The major contribution of the i-vector framework is to reduce a series of acoustic feature vectors of a speech utterance to a low-dimensional vector representation, and then numbers of well-developed post-processing techniques (such as probabilistic linear discriminative analysis, PLDA) can be readily and effectively used. However, to our best knowledge, there is no research up to date on applying the i-vector framework for SDR or information retrieval (IR). In this paper, we make a step forward to formulate an i-vector based language modeling (IVLM) framework for SDR. Furthermore, we evaluate the proposed IVLM framework with both inductive and transductive learning strategies. We also exploit multi-levels of index features, including word-and subword-level units, in concert with the proposed framework. The results of SDR experiments conducted on the TDT-2 (Topic Detection and Tracking) collection demonstrate the performance merits of our proposed framework when compared to several existing approaches.
引用
收藏
页数:5
相关论文
共 36 条
  • [1] [Anonymous], INTERSPEECH 2011
  • [2] [Anonymous], 2011, INTERSPEECH
  • [3] [Anonymous], 2009, Text Mining: Theory and Applications, DOI DOI 10.1201/9781420059458.CH4
  • [4] [Anonymous], 2011, P 12 ANN C INT SPEEC, DOI 10.21437/interspeech.2011-58
  • [5] [Anonymous], 2008, Introduction to information retrieval
  • [6] [Anonymous], P INTERSPEECH
  • [7] [Anonymous], 1998, SIGIR 98 P 21 ANN IN, DOI DOI 10.1145/290941.291008
  • [8] [Anonymous], INT J COMPUTATIONAL
  • [9] [Anonymous], 2000, PROJ TOP DET TRACK
  • [10] Using linear algebra for intelligent information retrieval
    Berry, MW
    Dumais, ST
    OBrien, GW
    [J]. SIAM REVIEW, 1995, 37 (04) : 573 - 595