A New Hybrid Document Clustering for PRF-Based Automatic Query Expansion Approach for Effective IR

被引:2
作者
Gupta, Yogesh [1 ]
Saini, Ashish [2 ]
机构
[1] BML Munjal Univ, Sch Engn & Technol, Kapriwas, Haryana, India
[2] Dayalbagh Educ Inst, Agra, Uttar Pradesh, India
关键词
Automatic Query Expansion; Document Clustering; F-Measure; Fuzzy Logic; Particle Swarm Optimization; Precision; Pseudo Relevance Feedback; Recall; INFORMATION-RETRIEVAL;
D O I
10.4018/IJeC.2020070105
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic query expansion (AQE) is an effective measure to improve information retrieval performance by including additional terms in a user query. The pseudo relevance feedback (PRF) method employed for AQE so far has suffered from a major problem of query drift. Therefore, keeping it in view, a new hybrid document clustering for PRF based AQE approach is proposed in the present article. In this, Fuzzy logic and Particle Swarm Optimization (PSO) are used to construct document clusters. Further, a new and effective hybrid PSO and Fuzzy logic-based term weighting approach is followed to find more suitable additional query terms using a weighted score of four IR evidences which is considered maximized. Moreover, a combined semantic filtering method along with query terms re-weighting algorithms are also used to remove noisy or irrelevant terms semantically. The performance of the presented approaches in this article is tested and compared with other approaches on three benchmark data sets. The comparative analysis of all the tested approaches illustrates the superior performance of the proposed approach.
引用
收藏
页码:73 / 95
页数:23
相关论文
共 21 条
[1]  
[Anonymous], INT J COMPUT APPL
[2]  
[Anonymous], 2015 IEEE WORKSH COM
[3]   LOCAL FEEDBACK IN FULL-TEXT RETRIEVAL SYSTEMS [J].
ATTAR, R ;
FRAENKEL, AS .
JOURNAL OF THE ACM, 1977, 24 (03) :397-417
[4]   Query expansion techniques for information retrieval: A survey [J].
Azad, Hiteshwar Kumar ;
Deepak, Akshay .
INFORMATION PROCESSING & MANAGEMENT, 2019, 56 (05) :1698-1735
[5]  
Buckley C., 1995, Text REtrieval Conference (TREC-3) (NIST SP 500-225), P69
[6]  
Fall CJ, 2003, SIGIR FORUM, V37, P10, DOI DOI 10.1145/945546.945547
[7]  
Gupta Y., 2019, INT J ENG ADV TECHNO, V8, P130
[8]   A new swarm-based efficient data clustering approach using KHM and fuzzy logic [J].
Gupta, Yogesh ;
Saini, Ashish .
SOFT COMPUTING, 2019, 23 (01) :145-162
[9]   A novel Fuzzy-PSO term weighting automatic query expansion approach using combined semantic filtering [J].
Gupta, Yogesh ;
Saini, Ashish .
KNOWLEDGE-BASED SYSTEMS, 2017, 136 :97-120
[10]  
Imani, 2019, P SMAR 5 C SMART MON, P1, DOI DOI 10.1007/978-3-030-15719-7_26