Improving Retrieval Performance Of English-Hindi Based Cross-Language Information Retrieval

被引:0
作者
Varshney, Saurabh [1 ]
Bajpai, Jyoti [1 ]
机构
[1] GLA Univ, Dept CEA, Mathura, UP, India
来源
PROCEEDINGS OF THE 2013 IEEE INTERNATIONAL CONFERENCE IN MOOC, INNOVATION AND TECHNOLOGY IN EDUCATION (MITE) | 2013年
关键词
Cross Language Information Retrieval; Pre and Post query expansion; query ambiguity; Keyword ranking; Statistical co-occurrence of initial query; FIRE data collection;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The hurdle problem in Cross Language Information Retrieval (CLIR) is the poor performance when compared to monolingual performance in terms of average precision. The main reasons behind the poor performance of CLIR are query term mismatching, multiple representations of query terms and un-translated query terms. In this paper, we are putting our effort to solve the given problem which is discussed in detail. The limitations are needed to be addressed in order to increase the performance of the CLIR system. By analyzing those methods the architecture for English-Hindi CLIR system is proposed. Pre and post query expansion is used to improve the performance of English-Hindi CLIR system using English and Hindi WordNet, Local Expansion using initial query, definition based pre query expansion and keyword ranking. The pre and post query expansion helps to improving the performance of English-Hindi CLIR system and based upon past experiences the proposed approach retrieves more relevant information. All experiments are performed on FIRE 2010 (Forum of Information Retrieval Evaluation) datasets. The experimental results show that the proposed approach gives equal/ better performance of English-Hindi CLIR system compared to monolingual performance and also helps in overcoming existing problems and outperforms the existing English-Hindi CLIR system in terms of average precision.
引用
收藏
页码:300 / 305
页数:6
相关论文
共 9 条
  • [1] Ashwin B., 2012, ADV COMP COMM TECHN, P65
  • [2] BAI J., 2005, Proceedings of ACM CIKM 05, P688, DOI DOI 10.1145/1099554.1099725
  • [3] Ballesteros L., 1998, P 21 ANN INT ACM SIG, P61
  • [4] Chaware S. M, 2011, INT J COMPUTER TECHN, V2, P379
  • [5] Das S., 2010, 2010 International Conference on Industrial Electronics, Control and Robotics (IECR), P53, DOI 10.1109/IECR.2010.5720139
  • [6] Gonzalo J., 2005, SIGIR 2005. Proceedings of the Twenty-Eighth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P603, DOI 10.1145/1076034.1076149
  • [7] Kyung- Soon L., 2002, P 19 INT C COMP LING, V1, P1
  • [8] Ponte T., 1998, P 21 ANN INT ACM SIG, P65
  • [9] Zhou D, 2008, LECT NOTES COMPUT SC, V5152, P64, DOI 10.1007/978-3-540-85760-0_8