A Framework for Efficient Document Ranking Using Order and Non Order Based Fitness Function

被引:0
作者
Imran, Hazra [1 ,2 ]
Sharan, Aditi [3 ]
机构
[1] Jamia Hamdard, Dept Comp Sci, New Delhi, India
[2] Jawaharlal Nehru Univ, New Delhi, India
[3] Jawaharlal Nehru Univ, Sch Comp & Syst Sci, New Delhi, India
来源
INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS (IMECS 2010), VOLS I-III | 2010年
关键词
Document retrieval; genetic algorithms; similarity measures; information retrieval; vector space Model; RETRIEVAL;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
One central problem of information retrieval is to determine the relevance of documents with respect to the user information needs. The choice of similarity measure is crucial for improving search effectiveness of a retrieval system. Different similarity measures have been suggested to match the query and documents. This study investigates the use of Genetic Algorithm to increase the efficiency of information retrieval by defining a combined similarity measure. Genetic Algorithm has been used for learning weights of the components of the combined similarity measure. We have provided a weight-learning algorithm for the same. We have considered order based and non-order based fitness functions to evaluate the goodness of the solution. A non-order based fitness function is based on recall-precision values only. However, it has been observed that a better fitness function can be obtained if we also consider the order in which relevant documents are retrieved. This leads to an idea of order based fitness functions. We evaluated the efficacy of a genetic algorithm with various fitness functions. Further, we provide a framework for applying generic algorithms to improve the retrieval efficiency by combing various similarity measures. The experiments have been carried out on TREC data collection. The results have been compared with various well-known similarity measures.
引用
收藏
页码:71 / +
页数:3
相关论文
共 12 条
  • [1] BOOKSTEIN A, 1985, ANNU REV INFORM SCI, V20, P117
  • [2] Chang C. T., 1999, THESIS NATL TAIWAN U
  • [3] Goldberg DE., 1989, GENETIC ALGORITHMS S, V13
  • [4] Jose R, 2008, IMPROVING QUERY EXPA
  • [5] Kraft D. H., 1994, Proceedings of the First IEEE Conference on Evolutionary Computation. IEEE World Congress on Computational Intelligence (Cat. No.94TH0650-2), P468, DOI 10.1109/ICEC.1994.349905
  • [6] Kwok KL, 1997, PROCEEDINGS OF THE 20TH ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, P34, DOI 10.1145/278459.258531
  • [7] MartinBautista MJ, 1997, PROCEEDINGS OF THE SIXTH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS I - III, P1227, DOI 10.1109/FUZZY.1997.619463
  • [8] Michalewicz Z., 1996, Genetic Algorithms + Data Structures = Evolution Programs, V3rd
  • [9] Perez-Aguera R. Jose, 2007, BCS IRSG S FUT DIR I
  • [10] Robertson S. E., 1997, INFORM PROCESSING MA, V13, P247