Querying large graphs in biomedicine with colored graphs and decomposition

被引:1
|
作者
Chou, Chung-Hsien [1 ]
Sheu, Phillip [1 ]
Hayakawa, Masahiro [2 ]
Kitazawa, Atsushi [2 ]
机构
[1] Univ Calif Irvine, Dept EECS, Irvine, CA 92617 USA
[2] NEC Solut Innovators Ltd, Tokyo, Japan
关键词
Large graphs; OLAP; Coloredgraphs; Betweenness; CENTRALITY; RESOURCE;
D O I
10.1016/j.jbi.2020.103503
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In graph networks, graph structural analytics such as betweenness centrality has played an important role in finding the most central vertices in graph data. Hence, betweenness centrality has been heavily applied to discover the most important genes with respect to multiple diseases in biomedicine research. Considering color as a property of graph data to represent different categories for the nodes and edges in the graph, we may investigate the betweenness centrality of each colored subgraph composed of a specific color. However, as investigators may be interested in querying betweenness centrality on multiple combinations of the colored subgraphs, the total execution time on all the subgraphs may be excessively long, considering all the possible combinations. In addition, the performance could be worse when the size of the graph grows larger. In this research, we propose an approach to computing betweenness centrality by incorporating node colors and edge colors. We propose that the node with the highest betweenness centrality can be computed for a very large and colored graph by decomposing the graph into colored subgraphs and merging the result from the base cases. Furthermore, we compare our approach with the conventional approaches in the experiments, and we demonstrate that our scalable approach is more efficient when finding the global backbone node with the highest betweenness centrality.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Visual Querying of Large Multilayer Graphs
    Cuenca, Erick
    Sallaberry, Arnaud
    Ienco, Dino
    Poncelet, Pascal
    30TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM 2018), 2018,
  • [2] Scalable SPARQL Querying of Large RDF Graphs
    Huang, Jiewen
    Abadi, Daniel J.
    Ren, Kun
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2011, 4 (11): : 1123 - 1134
  • [3] EMBEDDING GRAPHS INTO COLORED GRAPHS
    HAJNAL, A
    KOMJATH, P
    TRANSACTIONS OF THE AMERICAN MATHEMATICAL SOCIETY, 1988, 307 (01) : 395 - 409
  • [4] Span-reachability querying in large temporal graphs
    Wen, Dong
    Yang, Bohua
    Zhang, Ying
    Qin, Lu
    Cheng, Dawei
    Zhang, Wenjie
    VLDB JOURNAL, 2022, 31 (04): : 629 - 647
  • [5] Span-reachability querying in large temporal graphs
    Dong Wen
    Bohua Yang
    Ying Zhang
    Lu Qin
    Dawei Cheng
    Wenjie Zhang
    The VLDB Journal, 2022, 31 : 629 - 647
  • [6] Querying Influential Maximum Connected Community in Large Graphs
    Xu, Yuehao
    Sun, Renjie
    Wu, Yanping
    Chen, Chen
    Wang, Xiaoyang
    2020 IEEE 7TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2020), 2020, : 757 - 758
  • [7] Towards Prescriptive Analyses of Querying Large Knowledge Graphs
    Ragab, Mohamed
    NEW TRENDS IN DATABASE AND INFORMATION SYSTEMS, ADBIS 2022, 2022, 1652 : 639 - 647
  • [8] Knowledge Graphs Querying
    Khan, Arijit
    SIGMOD RECORD, 2023, 52 (02) : 18 - 29
  • [9] Querying Graphs with Preferences
    Fionda, Valeria
    Pirro, Giuseppe
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 929 - 938
  • [10] Querying complex graphs
    Liu, YHA
    Stoller, SD
    PRACTICAL ASPECTS OF DECLARATIVE LANGUAGES, 2006, 3819 : 199 - 214