Efficient query clustering and information retrieval using Sequenced User Search Pattern Query Optimization

被引:0
作者
Surya S. [1 ]
Sumitra P. [1 ]
机构
[1] Department of Computer Science and Computer Application, Vivekanandha College of Arts and Sciences for Women (Autonomous), Elayampalayam, Tiruchengode, TamilNadu, Namakkal(DT)
关键词
Density-based clustering; Information retrieval; Pattern query search; Pre-processing; Ranking; SOA-SUSPQO; Web page;
D O I
10.1007/s11042-024-19463-7
中图分类号
学科分类号
摘要
The rapid growth of the internet has ushered in an era of unprecedented information availability. However, this expansion has also given rise to the formidable challenge of efficiently extracting relevant data from the vast expanse of web pages. Conventional methods have often fallen short, inundating users with an abundance of irrelevant content. In response to this pressing issue, this study introduces the innovative Spider Optimization Algorithm (SOA) built upon the foundation of Sequenced User Search Pattern Query Optimization (SUSPQO). SOA-SUSPQO represents a pioneering approach that harnesses the power of density-based clustering for the thorough analysis of web page content. By doing so, it significantly enhances the efficiency and precision of information retrieval processes. In a rigorous evaluation involving a user base ranging from 100 to 500 individuals, the results were nothing short of remarkable. The SOA-SUSPQO framework yielded exceptional outcomes, boasting a 95.6% information retrieval performance rate, a precision rate of 93.4%, and an impressive recall rate of 92.1% for 500 users. By showcasing the demonstrated superiority of the SOA-SUSPQO framework over existing methodologies, it contributes significantly to the advancement of information retrieval technologies. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.
引用
收藏
页码:16033 / 16055
页数:22
相关论文
共 36 条
  • [1] Gudivada V.N., Rao D.L., Amogh R., Information retrieval: Concepts, models, and systems, In: Handbook of Statistics, 38, pp. 331-401, (2018)
  • [2] Jones S., Shao L., Content-based retrieval of human actions from realistic video databases, Inf Sci, 236, pp. 56-65, (2013)
  • [3] Delen D., Demirkan H., Data, information and analytics as services, Decis Support Syst, 55, 1, pp. 359-363, (2013)
  • [4] Bedi P., Thukral A., Banati H., Focused crawling of tagged web resources using ontology, Comput Electr Eng, 39, 2, pp. 613-628, (2013)
  • [5] Jin X., Wah B.W., Cheng X., Wang Y., Significance and challenges of big data research, Big Data Res, 2, 2, pp. 59-64, (2015)
  • [6] Palangi H., Deng L., Shen Y., Gao J., He X., Chen J., Song X., Ward R., Deep sentence embedding using long short-term memory networks: analysis and application to information retrieval, IEEE/ACM Trans Audio Speech Lang Process, 24, 4, pp. 694-707, (2016)
  • [7] Azad H.K., Deepak A., Query expansion techniques for information retrieval: a survey, Inf Process Manag, 56, 5, pp. 1698-1735, (2019)
  • [8] Sahlol A.T., Ahmed A.M., Hassanien A.E., Automatic acute lymphoblastic leukemia classification model using social spider optimization algorithm, Soft Comput, 23, pp. 6345-6360, (2019)
  • [9] Kalendar R., Khassenov B., Ramankulov Y., Samuilova O., Ivanov I.K., FastPCR: an in silico tool for fast primer and probe design and advanced sequence analysis, Genomics, 109, 3-4, pp. 312-319, (2017)
  • [10] Marshall I.J., Wallace B.C., Toward systematic review automation: a practical guide to using machine learning tools in research synthesis, Syst Reviews, 8, pp. 1-10, (2019)