A learning-to-rank method for information updating task

被引:4
作者
Minh Quang Nhat Pham [1 ]
Minh Le Nguyen [1 ]
Bach Xuan Ngo [1 ]
Shimazu, Akira [1 ]
机构
[1] Japan Adv Inst Sci & Technol, Nomi, Ishikawa 9231292, Japan
关键词
Learning-to-rank; Information updating; Online hierarchical ranking; Legal domain;
D O I
10.1007/s10489-012-0343-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Our paper addresses the information updating task which is to determine the most appropriate location in an existing document to place a new piece of related information. We propose a new learning-to-rank method for the information updating task. The updating task is formalized as a learning-to-rank problem, and in training, a heuristic method of automatically assigning labels for training examples is proposed to exploit structural information of documents. With the proposed formulation, state-of-the-art learning-to-rank algorithms can be applied to the task. We deal with the problem of the lack of semantic information by incorporating semantic features derived from word clusters to further improve the performance of information updating. The proposed method is applied in updating Wikipedia biographical articles and Legal documents. Experimental results achieved on both Wikipedia biographical data set and Legal data set showed that our proposed learning-to-rank method with cluster-based features outperforms previously reported methods for information updating task.
引用
收藏
页码:499 / 510
页数:12
相关论文
共 28 条
[1]  
[Anonymous], 2005, INT C MACH LEARN
[2]  
[Anonymous], 2007, P 1 INT WORKSH JURIS
[3]  
[Anonymous], 2008, P 25 INT C MACH LEAR, DOI [10.1145/1390156.1390306, DOI 10.1145/1390156.1390306]
[4]  
[Anonymous], 1998, PRELIMINARY RECOMMEN
[5]  
Baker L. D., 1998, Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P96, DOI 10.1145/290941.290970
[6]  
Bekkerman R., 2003, Journal of Machine Learning Research, V3, P1183, DOI 10.1162/153244303322753625
[7]  
Brown P. F., 1992, Computational Linguistics, V18, P467
[8]  
Cao Z., 2007, P 24 INT C MACH LEAR, P129, DOI DOI 10.1145/1273496.1273513
[9]  
Caruana R, 1996, ADV NEUR IN, V8, P959
[10]  
Chen E, 2008, THESIS MIT