A proposed multi criteria indexing and ranking model for documents and web pages on large scale data

被引:8
作者
Attia, Mohamed [1 ]
Abdel-Fattah, Manal A. [2 ]
Khedr, Ayman E. [1 ]
机构
[1] Future Univ Egypt, Cairo, Egypt
[2] Helwan Univ, Helwan, Egypt
关键词
Page ranking; User preferences; Crawling; Multi -criteria index; Retrieval process; Search Engine; SEARCH ENGINE OPTIMIZATION; USER;
D O I
10.1016/j.jksuci.2021.10.009
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the expansion of data, search engines encounter different obstacles for retrieving better relevant content to user's search queries. Consequently, various retrieval and ranking algorithms have been applied to satisfy the result's relevancy according to user's needs. Unfortunately, indexing and ranking processes face several challenges to achieve highly accurate results, since most of the existing indexes and ranking algorithms crawl documents and web pages based on limited number of criteria that satisfy user needs. So, this research studies and observes how search engines work and which factors contribute to higher rankings results. The research also proposes a Multi Criteria Indexing and Ranking Model (MCIR) based on weighted documents and pages which depend on one or more ranking factors, aiming at building a model that achieves high performance, better relevant pages, and the ability to index and rank both online/offline pages and documents. The MCIR model was applied on three different experi-ments to compare documents and pages results in terms of ranking scores, based on one or more criteria of user's preferences. The results of applying MCIR model proved that final pages ranking results depend-ing on multi-criteria are better than using only one criterion, and some criteria have great effect on rank-ing results than other criteria. It was also observed that the MCIR model achieved high performance on indexing and ranking dataset up to 100 gigabytes.(c) 2021 The Authors. Published by Elsevier B.V. on behalf of King Saud University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:8702 / 8715
页数:14
相关论文
共 48 条
  • [1] Alhaidari F., 2020, 3 INT C COMPUTER APP, P1
  • [2] Badr M, 2013, IEEE IMAGE PROC, P2901, DOI 10.1109/ICIP.2013.6738597
  • [3] Brin S., 1998, Technical report, V98, P161
  • [4] Chatterjee S., 2017, International Journal of Innovative Research in Computer and Communication Engineering, P6607
  • [5] Chowdhary A., 2019, ACTA INFORM MALAYSIA, V3, P01
  • [6] Chu H., 2018, IEEE INT C CONSUMER, P1
  • [7] Dean B., 2020, WE ANAL 11 8 MILLION
  • [8] Dode Albi., 2017, IOSR-JCE, V19, P01, DOI DOI 10.9790/0661-1901030107
  • [9] The Role of Search Engine Optimization on Keeping the User on the Site
    Egri, Gokhan
    Bayrak, Coskun
    [J]. COMPLEX ADAPTIVE SYSTEMS, 2014, 36 : 335 - +
  • [10] A Literature Review of Indexing and Searching Techniques Implementation in Educational Search Engines
    El Guemmat, Kamal
    Ouahabi, Sara
    [J]. INTERNATIONAL JOURNAL OF INFORMATION AND COMMUNICATION TECHNOLOGY EDUCATION, 2018, 14 (02) : 72 - 83