Hierarchical language models for expert finding in enterprise corpora

被引:32
作者
Petkova, Desislava [1 ]
Croft, W. Bruce [1 ]
机构
[1] Univ Massachusetts, Dept Comp Sci, Ctr Intelligent Informat Retrieval, Amherst, MA 01003 USA
关键词
enterprise search; expert finding; language modeling;
D O I
10.1142/S0218213008003741
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Enterprise corpora contain evidence of what employees work on and therefore can be used to automatically find experts on a given topic. We present a general approach for representing the knowledge of a potential expert as a mixture of language models from associated documents. First we retrieve documents given the expert's name using a generative probabilistic technique and weight the retrieved documents according to expert-specific posterior distribution. Then we model the expert indirectly through the set of associated documents, which allows us to exploit their underlying structure and complex language features. Experiments show that our method has excellent performance on the expert search task of the TREC Enterprise track and that it effectively collects and combines evidence for expertise in a heterogeneous collection.
引用
收藏
页码:5 / 18
页数:14
相关论文
共 19 条
[1]   Augmenting organizational memory: A field study of answer garden [J].
Ackerman, MS .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 1998, 16 (03) :203-224
[2]  
[Anonymous], INDRI LANGUAGE MODEL
[3]  
[Anonymous], 2003, P 26 ANN INT ACM SIG
[4]  
BALOG K, 2006, SIGIR 06
[5]  
BALOG K, 2006, P 15 INT C WORLD WID, P1035, DOI DOI 10.1145/1135777.1136002
[6]  
CAO Y, 2005, TREC 2005
[7]  
CRASWELL N, 2005, TREC 2005
[8]  
CRASWELL N, 2001, AUSW POST P
[9]  
Croft B, 2003, LANGUAGE MODELING IN, V13
[10]  
FU Y, 2005, TREC 2005