Search engine intelligent algorithm for big data

被引:0
作者
Li C.H. [1 ]
机构
[1] Hunan Mass Media Vocational and Technical College, Changsha, Hunan
来源
Telecommunications and Radio Engineering (English translation of Elektrosvyaz and Radiotekhnika) | 2020年 / 79卷 / 10期
关键词
Big data; Canopy algorithm; Clustering algorithm; Search engine;
D O I
10.1615/TelecomRadEng.v79.i10.50
中图分类号
学科分类号
摘要
How to improve the efficiency of search engine in big data environment has become a hot issue. In this study, for clustering search, key words were extracted by Term Frequency-Inverse Document Frequency (TF-IDF), the defect of K-means algorithm was improved by combining Canopy algorithm to obtain a Canopy-K-means (CKM) algorithm, and its retrieval performance was tested. The results showed that the performance of the algorithm increased with the increase of data volume in searching different key words, the search time shortened, and the recall factor and the pertinency factor improved. The CKM algorithm showed an excellent performance in big data processing and better performance compared to LDA and K-means algorithms. The comparison with the clustering performance of K-means algorithm demonstrated that the clustering result of the CKM algorithm was more similar to the actual number of clusters and its clustering accuracy was higher, indicating that the CKM algorithm was effective in retrieval. The experimental results of this study make some contributions to improve the efficiency of data retrieval and meet the needs of users, which is conducive to the better development of search engines. © 2020 Begell House Inc.. All rights reserved.
引用
收藏
页码:883 / 890
页数:7
相关论文
共 50 条
  • [21] Critical analysis of Big Data challenges on similar sequential Algorithm based data Search
    Peng, Yuhua
    Xu, Wenli
    Xiong, Yan
    JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2021, 24 (04): : 653 - 659
  • [22] Design of Intelligent Search Engine with Multiple Agents
    Jin Hongying
    FRONTIERS OF MANUFACTURING AND DESIGN SCIENCE IV, PTS 1-5, 2014, 496-500 : 1937 - 1940
  • [23] An intelligent optimization algorithm for sentiment classification of scene images in big data era
    Cao, Jianfang
    Chen, Junjie
    Chen, Lichao
    Yao, Huiting
    Sensor Letters, 2014, 12 (02) : 369 - 373
  • [24] Intelligent information recommendation algorithm under background of big data land cultivation
    Tang, Haoxiang
    Yang, Wei
    Zheng, Susheng
    MICROPROCESSORS AND MICROSYSTEMS, 2021, 81
  • [25] Data Mining Engine based on Big Data
    Song, Guo
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON EDUCATION, MANAGEMENT, COMPUTER AND SOCIETY, 2016, 37 : 264 - 267
  • [26] K-RBBSO Algorithm: A Result-Based Stochastic Search Algorithm in Big Data
    Park, Sungjin
    Kim, Sangkyun
    APPLIED SCIENCES-BASEL, 2022, 12 (23):
  • [27] Big data and open data for an intelligent governance
    Cerrillo-Martinez, Agusti
    PROFESIONAL DE LA INFORMACION, 2018, 27 (05): : 1128 - 1135
  • [28] Exploratory search on big data
    MOE Key Laboratory of Data Engineering and Knowledge Engineering, Renmin University of China, Beijing
    100872, China
    不详
    100872, China
    Tongxin Xuebao, 12
  • [29] Fast search of art culture resources based on big data and cuckoo algorithm
    Xia, Xuewen
    PERSONAL AND UBIQUITOUS COMPUTING, 2020, 24 (01) : 127 - 138
  • [30] Fast search of art culture resources based on big data and cuckoo algorithm
    Xuewen Xia
    Personal and Ubiquitous Computing, 2020, 24 : 127 - 138