Search engine intelligent algorithm for big data

被引:0
作者
Li C.H. [1 ]
机构
[1] Hunan Mass Media Vocational and Technical College, Changsha, Hunan
来源
Telecommunications and Radio Engineering (English translation of Elektrosvyaz and Radiotekhnika) | 2020年 / 79卷 / 10期
关键词
Big data; Canopy algorithm; Clustering algorithm; Search engine;
D O I
10.1615/TelecomRadEng.v79.i10.50
中图分类号
学科分类号
摘要
How to improve the efficiency of search engine in big data environment has become a hot issue. In this study, for clustering search, key words were extracted by Term Frequency-Inverse Document Frequency (TF-IDF), the defect of K-means algorithm was improved by combining Canopy algorithm to obtain a Canopy-K-means (CKM) algorithm, and its retrieval performance was tested. The results showed that the performance of the algorithm increased with the increase of data volume in searching different key words, the search time shortened, and the recall factor and the pertinency factor improved. The CKM algorithm showed an excellent performance in big data processing and better performance compared to LDA and K-means algorithms. The comparison with the clustering performance of K-means algorithm demonstrated that the clustering result of the CKM algorithm was more similar to the actual number of clusters and its clustering accuracy was higher, indicating that the CKM algorithm was effective in retrieval. The experimental results of this study make some contributions to improve the efficiency of data retrieval and meet the needs of users, which is conducive to the better development of search engines. © 2020 Begell House Inc.. All rights reserved.
引用
收藏
页码:883 / 890
页数:7
相关论文
共 50 条
  • [31] Application of genetic algorithm in search engine
    Li, WF
    Xu, BW
    Yang, HJ
    Chu, WCC
    Lu, CW
    INTERNATIONAL SYMPOSIUM ON MULTIMEDIA SOFTWARE ENGINEERING, PROCEEDINGS, 2000, : 366 - 371
  • [32] The study of key techniques in intelligent XML search engine
    Yuan, F
    Hao, YN
    Yu, G
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1194 - 1197
  • [33] Design Scheme for Intelligent English Translating Search Engine
    Wei, Li
    PROCEEDINGS OF 2017 9TH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA), 2017, : 431 - 434
  • [34] Research and Application of Intelligent Algorithm for Architectural Design Based on Big Data and CAD Technology
    Teng Y.
    Ju X.
    Computer-Aided Design and Applications, 2024, 21 (S21): : 242 - 258
  • [35] Evaluation model and algorithm of intelligent manufacturing system based on pattern recognition and big data
    Yuan Guo
    Qiang Qin
    Weitang Zhang
    Yun Wei
    Wei Li
    Soft Computing, 2023, 27 : 4195 - 4208
  • [36] Big data and intelligent software systems
    Jalal, Ahmed Adeeb
    INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2018, 22 (03) : 177 - 193
  • [37] Big Data Driven Intelligent Manufacturing
    Zhang J.
    Wang J.
    Lyu Y.
    Bao J.
    Zhongguo Jixie Gongcheng/China Mechanical Engineering, 2019, 30 (02): : 127 - 133and158
  • [38] Intelligent services for Big Data science
    Dobre, C.
    Xhafa, F.
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2014, 37 : 267 - 281
  • [39] Intelligent Network Storage On The Big Data
    Li Haixia
    Lu Chuiwei
    Sun Sheng
    PROCESSING OF 2014 INTERNATIONAL CONFERENCE ON MULTISENSOR FUSION AND INFORMATION INTEGRATION FOR INTELLIGENT SYSTEMS (MFI), 2014,
  • [40] Evaluation model and algorithm of intelligent manufacturing system based on pattern recognition and big data
    Guo, Yuan
    Qin, Qiang
    Zhang, Weitang
    Wei, Yun
    Li, Wei
    SOFT COMPUTING, 2023, 27 (07) : 4195 - 4208