Web people search via connection analysis

被引:34
作者
Kalashnikov, Dmitri V. [1 ]
Chen, Zhaoqi [1 ]
Mehrotra, Sharad [1 ]
Nuray-Turan, Rabia [1 ]
机构
[1] Univ Calif Irvine, Dept Comp Sci, Donald Bren Sch Informat & Comp Sci, Irvine, CA 92697 USA
基金
美国国家科学基金会;
关键词
web people search; entity resolution; graph-based disambiguation; social network analysis; clustering;
D O I
10.1109/TKDE.2008.78
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, searches for the web pages of a person with a given name constitute a notable fraction of queries to Web search engines. Such a query would normally return web pages related to several namesakes, who happened to have the queried name, leaving the burden of disambiguating and collecting pages relevant to a particular person ( from among the namesakes) on the user. In this paper, we develop a Web People Search approach that clusters web pages based on their association to different people. Our method exploits a variety of semantic information extracted from web pages, such as named entities and hyperlinks, to disambiguate among namesakes referred to on the web pages. We demonstrate the effectiveness of our approach by testing the efficacy of the disambiguation algorithms and its impact on person search.
引用
收藏
页码:1550 / 1565
页数:16
相关论文
共 41 条
[1]  
ALKAMHA R, 2004, P INT WORKSH WEB INF
[2]  
ANANTHAKRISHNA R, 2002, P INT C VER LARG DAT
[3]  
ARTILES J, 2005, P SIGIR
[4]  
ARTILES J, 2007, P INT WORKSH SEM EV
[5]  
Baeza-Yates R.A., 1999, Modern Information Retrieval
[6]  
BAGGA A, 1998, ALG SCOR C CHAINS
[7]  
Bansal N, 2002, ANN IEEE SYMP FOUND, P238, DOI 10.1109/SFCS.2002.1181947
[8]  
Bekkerman R., 2005, P INT WORLD WID WEB
[9]  
BEKKERMAN R, 2007, P INT JOINT C ART IN
[10]   The visible touch: in planta visualization of protein-protein interactions by fluorophore-based methods [J].
Bhat, Riyaz A. ;
Lahaye, Thomas ;
Panstruga, Ralph .
PLANT METHODS, 2006, 2 (1)