Structural query expansion based on weighted query term for XML documents

被引:7
作者
School of Information and Technology, Jiangxi University of Finance and Economics, Nanchang 330013, China [1 ]
不详 [2 ]
机构
[1] School of Information and Technology, Jiangxi University of Finance and Economics
[2] Jiangxi Key Laboratory of Data and Knowledge Engineering, Jiangxi University of Finance and Economics
来源
Ruan Jian Xue Bao/Journal of Software | 2008年 / 19卷 / 10期
关键词
Information retrieval; Relevance feedback; Structural query expansion; Structural semantics; XML;
D O I
10.3724/SP.J.1001.2008.02611
中图分类号
学科分类号
摘要
The main reason of low precision in information retrieval (IR) is that it is difficult for the users to submit a precise query expression for their query intensions. Furthermore, XML documents have characteristics not only in the content, but also in its structure. Therefore it is more difficult for users to submit precise query expressions. In order to solve this problem, this paper puts forward a new query expansion method based on relevance feedback. It can help users to construct a content and structure query expression which can satisfy users' intentions. This method includes two steps. The first step is to expand keywords for finding the weighted keyword which can represent the user's intentions. The second step is structural expansion based on the weighted keywords. Finally a full-edged content-structure query is formalized. Experimental results show that the method can obtain better retrieval results. The average precision of prec@10 and prec@20 is 30% higher than the original query.
引用
收藏
页码:2611 / 2619
页数:8
相关论文
共 31 条
[1]  
Voorhees E., Harman D., Overview of the 6th text retrieval Conf, Proc. of the 6th Text Retrieval Conf, pp. 1-24, (1997)
[2]  
Lu B.B., Zhao J., Query expansion based on modeling of relevant documents pool, Journal of Chinese Information, 20, 3, pp. 78-83, (2006)
[3]  
Liu S.H., Wu G.S., Zhang F.Y., Applying relevance feedback to information retrieval using keyword and weight algorithms, Journal of the China Society for Scientific and Technical Information, 21, 6, pp. 668-673, (2002)
[4]  
Huo H., Feng B.Q., Zhao S.S., Retrieval algorithm combining multi-query data fusion with positive relevance feedback, Journal of Xi'an Jiaotong University, 39, 8, pp. 820-823, (2005)
[5]  
Song L.L., Cheng Y., About term ranking methods in relevance feedback, New Technology of Library and Information Service, 8, pp. 44-47, (2004)
[6]  
Mitra M., Singhal A., Buckley C., Improving automatic query expansion, Proc. of the 21st Annual Int'l ACM SIGIR Conf. on Research and Development in Information Retrieval, pp. 206-214, (1998)
[7]  
Amo P., Ferreras F.L., Cruz F., Rosa M., Smoothing functions for automatic relevance feedback in information retrieval, Proc. of the 11th Int'l Workshop on Database and Expert Systems Applications, pp. 115-119, (2000)
[8]  
Lu A., Ayoub M., Dong J., Ad hoc experiments using EUREKA, Proc. of the 5th Text Retrieval Conf., pp. 229-240, (1996)
[9]  
Buckley C., Mitra M., Walz J., Cardie C., Using clustering and SuperConcepts within SMART, Proc. of the 6th Text Retrieval Conf, pp. 107-124, (1997)
[10]  
Claudio C., Giovanni R., Vittorio G., Improving retrieval feedback with multiple Term2Ranking function combination, ACM Trans. on Information Systems, 20, 3, pp. 259-290, (2002)