Learning Topic-Oriented Word Embedding for Query Classification

被引:10
作者
Yang, Hebin [1 ,2 ]
Hu, Qinmin [1 ,2 ]
He, Liang [1 ,2 ]
机构
[1] E China Normal Univ, Dept Comp Sci & Technol, Shanghai 200241, Peoples R China
[2] E China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai 200241, Peoples R China
来源
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PART I | 2015年 / 9077卷
关键词
Query classification; Word embedding; Word2vec; Supervised learning;
D O I
10.1007/978-3-319-18038-0_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a topic-oriented word embedding approach to address the query classification problem. First, the topic information is encoded to generate query categories. Then, the user click-through information is also incorporated in the modified word embedding algorithms. After that, the short and ambiguous queries are enriched to be classified in a supervised learning way. The unique contributions are that we present four neural network strategies based on the proposed model. The experiments are designed on two open data sets, namely Baidu and Sogou, which are two famous commercial search companies. Our evaluation results show that the proposed approach is promising on both large data sets. Under the four proposed strategies, we achieve the high performance as 95.73% in terms of Precision, 97.79% in terms of the F1 measure.
引用
收藏
页码:188 / 198
页数:11
相关论文
共 19 条
[1]  
[Anonymous], 2013, 51 ANN M ASS COMP LI
[2]   Automatic classification of Web queries using very large unlabeled query logs [J].
Beitzel, Steven M. ;
Jensen, Eric C. ;
Lewis, David D. ;
Chowdhury, Abdur ;
Frieder, Ophir .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2007, 25 (02)
[3]  
Bengio Y, 2006, STUD FUZZ SOFT COMP, V194, P137
[4]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[5]   Robust classification of rare queries using web knowledge [J].
Broder, Andrei Z. ;
Fontoura, Marcus ;
Gabrilovich, Evgeniy ;
Joshi, Amruta ;
Josifovski, Vanja ;
Zhang, Tong .
Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07, 2007, :231-238
[6]  
Collobert R, 2011, J MACH LEARN RES, V12, P2493
[7]  
Ganti V., 2010, P 3 ACM INT C WEB SE, P61
[8]  
Hinton GE, 1986, P 8 ANN C COGN SCI S, P12, DOI DOI 10.1109/69.917563
[9]  
Le Q, 2014, PR MACH LEARN RES, P1188, DOI DOI 10.1145/2740908.2742760
[10]   Learning with Click Graph for Query Intent Classification [J].
Li, Xiao ;
Wang, Ye-Yi ;
Shen, Dou ;
Acero, Alex .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2010, 28 (03)