Web Page Ranking Using Web Mining Techniques: A Comprehensive Survey

被引:6
作者
Sharma, Prem Sagar [1 ]
Yadav, Divakar [2 ]
Thakur, R. N. [3 ]
机构
[1] Uttarakhand Tech Univ, Dehra Dun, Uttaranchal, India
[2] Natl Inst Technol, Hamirpur, Himachal Prades, India
[3] LBEF Campus, Kathmandu, Nepal, India
关键词
SEARCH-ENGINE; ALGORITHM; SERVICES; RESOURCES;
D O I
10.1155/2022/7519573
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the exponential growth of Internet users and traffic, information seekers depend highly on search engines to extract relevant information. Due to the accessibility of a large amount of textual, audio, video etc., contents, the responsibility of search engines has increased. The search engine provides relevant information to Internet users concerning their query, based on content, link structure, etc. However, it does not guarantee the correctness of the information. The performance of a search engine is highly dependent upon the ranking module. The performance of the ranking module is dependent upon the link structure of web pages, which analyze through Web structure mining (WSM) and their content, which analyzes through Web content mining (WCM). Web mining plays a vital role in computing the rank of web pages. This article presents web mining types, techniques, tools, algorithms, and their challenges. Further, it provides a critical comprehensive survey for the researchers by presenting different features of web pages, which are essential to check their quality. In this work, authors presented different approaches/techniques, algorithms and evaluation approaches in previous researches and identified some critical issues in page ranking and web mining, which provide future directions for the researchers working in the area.
引用
收藏
页数:19
相关论文
共 113 条
  • [1] Movie Revenue Prediction Based on Purchase Intention Mining Using YouTube Trailer Reviews
    Ahmad, Ibrahim Said
    Abu Bakar, Azuraliza
    Yaakub, Mohd Ridzwan
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (05)
  • [2] Ahmed A., 2021, P INT C GEN EV COMP, P493
  • [3] Alphy A., 2011, 2011 International Conference on Recent Trends in Information Technology (ICRTIT 2011), P1271, DOI 10.1109/ICRTIT.2011.5972369
  • [4] Sampling search-engine results
    Anagnostopoulos, Aris
    Broder, Andrei Z.
    Carmel, David
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2006, 9 (04): : 397 - 429
  • [5] Local Explanations of Global Rankings: Insights for Competitive Rankings
    Anahideh, Hadis
    Mohabbati-Kalejahi, Nasrin
    [J]. IEEE ACCESS, 2022, 10 : 30676 - 30693
  • [6] [Anonymous], DETECTING HIERARCHIC
  • [7] [Anonymous], 2019, ACM Trans. Intell. Syst. Technol., DOI DOI 10.1145/3298981
  • [8] Evaluation of information retrieval: precision and recall
    Arora, Monika
    Kanjilal, Uma
    Varshney, Dinesh
    [J]. INTERNATIONAL JOURNAL OF INDIAN CULTURE AND BUSINESS MANAGEMENT, 2016, 12 (02) : 224 - 236
  • [9] Awad M., 2015, Efficient Learning Machines, P19
  • [10] Baeza-Yates R., 2004, PROC 13 INT WWW C AL, P328, DOI DOI 10.1145/1013367.1013459