An optimization approach of known-item search on large-scale graph data

被引:0
|
作者
机构
[1] Zhong, Ming
[2] Wang, Sheng
[3] Liu, Mengchi
来源
Zhong, M. (clock@whu.edu.cn) | 1600年 / Science Press卷 / 51期
关键词
Query processing - Data mining - Optimization - Social sciences computing - Graph theory - Search engines - Economic and social effects;
D O I
10.7544/issn1000-1239.2014.20130692
中图分类号
学科分类号
摘要
Recently, a large amount of naturally graph-structured data are generated in various fields, such as social network, bioinformatics, software engineering, knowledge engineering, and etc. Consequently, querying, searching and mining such graph data have become the popular research topics. However, due to the extremely high computational complexity, existing keyword search approaches for graph data do not scale well on large graphs. This paper is motivated by a novel study of the user information needs of graph search. We discuss some possible types of graph search and their potentials of being optimized, and propose an idea that we should adopt customized optimization strategies according to the features of different types of graph search. Based on that, we propose an effective heuristic optimization approach for an important and usual particular type of search called known-item search. We construct an index to capture the local topology information in the graph. For large-scale graph data, we can use the MapReduce technique to handle the indexing task. By using the index, we can prune matched vertices before the search, and thereby significantly reduce the search space with a little possible loss of the top-k answers as trade-offs. Our experiments testify that the approach can decrease the response time of the known-item search dramatically.
引用
收藏
相关论文
共 50 条
  • [21] Efficient Task Scheduling for Large-scale Graph Data Processing in Cloud Computing: A Particle Swarm Optimization Approach
    Shang, Rui
    Journal of Combinatorial Mathematics and Combinatorial Computing, 2024, 122 : 135 - 148
  • [22] Graph Optimization Methods for Large-Scale Crowdsourced Mapping
    Stoven-Dubois, Alexis
    Dziri, Aziz
    Leroy, Bertrand
    Chapuis, Roland
    PROCEEDINGS OF 2020 23RD INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2020), 2020, : 980 - 987
  • [23] Large-Scale Visual Search with Binary Distributed Graph at Alibaba
    Zhao, Kang
    Pan, Pan
    Zheng, Yun
    Zhang, Yanhao
    Wang, Changxu
    Zhang, Yingya
    Xu, Yinghui
    Jin, Rong
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2567 - 2575
  • [24] Heterogeneous Graph Propagation for Large-Scale Web Image Search
    Xie, Lingxi
    Tian, Qi
    Zhou, Wengang
    Zhang, Bo
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (11) : 4287 - 4298
  • [25] Known-item searches and search tactics in library search systems: Results from four transaction log analysis studies
    Schultheiss, Sebastian
    Linhart, Alexandra
    Behnert, Christiane
    Rulik, Imke
    Lewandowski, Dirk
    JOURNAL OF ACADEMIC LIBRARIANSHIP, 2020, 46 (05):
  • [26] Large-Scale Optimization for Evaluation Functions with Minimax Search
    Hoki, Kunihito
    Kaneko, Tomoyuki
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2014, 49 : 527 - 568
  • [27] Large-scale var optimization and planning by tabu search
    Gan, DQ
    Qu, ZH
    Cai, HZ
    ELECTRIC POWER SYSTEMS RESEARCH, 1996, 39 (03) : 195 - 204
  • [28] Large-scale graph signal denoising: A heuristic approach
    Fattahi, Mohammadreza
    Saeedi-Sourck, Hamid
    Abootalebi, Vahid
    DIGITAL SIGNAL PROCESSING, 2025, 158
  • [29] A Novel Clustering Algorithm on Large-Scale Graph Data
    Zhang, Hao
    Zhou, Wei
    Wan, Xiaoyu
    Fu, Ge
    Xu, Zhiyong
    Han, Jizhong
    2014 INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA (CCBD), 2014, : 47 - 54
  • [30] Visualizing Large-scale Linked Data with Memo Graph
    Ghorbel, Fatma
    Hamdi, Faycal
    Ellouze, Nebrasse
    Metais, Elisabeth
    Gargouri, Faiez
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS, 2017, 112 : 854 - 863