Leveraging Relevance Cues for Improved Spoken Document Retrieval

被引:0
作者
Chen, Pei-Ning [1 ]
Chen, Kuan-Yu [2 ]
Chen, Berlin [1 ]
机构
[1] Natl Taiwan Normal Univ, Taipei, Taiwan
[2] Acad Sinica, Inst Informat Sci, Taipei, Taiwan
来源
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 | 2011年
关键词
spoken document retrieval; language modeling; relevance model; topic model; Kullback-Leibler divergence;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spoken document retrieval (SDR) has emerged as an active area of research in the speech processing community. The fundamental problems facing SDR are generally three-fold: 1) a query is often only a vague expression of an underlying information need, 2) there probably would be word usage mismatch between a query and a spoken document even if they are topically related to each other, and 3) the imperfect speech recognition transcript carries wrong information and thus deviates somewhat from representing the true theme of a spoken document. To mitigate the above problems, in this paper, we study a novel use of a relevance language modeling framework for SDR. It not only inherits the merits of several existing techniques but also provides a flexible but systematic way to render the lexical and topical relationships between a query and a spoken document. Moreover, we also investigate representing the query and documents with different granularities of index features to work in conjunction with the Various relevance cues. Experiments conducted on the TDT SDR task show promise of the methods deduced from our retrieval framework when compared with a few existing retrieval methods.
引用
收藏
页码:936 / +
页数:2
相关论文
共 17 条
[1]  
[Anonymous], 2009, Text Mining: Theory and Applications, DOI DOI 10.1201/9781420059458.CH4
[2]  
[Anonymous], 2000, PROJ TOP DET TRACK
[3]  
[Anonymous], 2011, Modern Information Retrieval: The Concepts and Technology behind Search
[4]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[5]  
Chelba C, 2008, IEEE SIGNAL PROC MAG, V25, P39, DOI 10.1109/MSP.200S.917992
[6]  
Chen B., 2009, ACM T ASIAN LANGUAGE, V8, P2
[7]  
Chen B., 2011, IEEE T AUDIO SPEECH
[8]  
Chen B., P ICASSP 2009
[9]  
Chen K. Y., P ICASSP 2011
[10]   Unsupervised learning by probabilistic latent semantic analysis [J].
Hofmann, T .
MACHINE LEARNING, 2001, 42 (1-2) :177-196