FSM: Fast and scalable network motif discovery for exploring higher-order network organizations

被引:20
作者
Wang, Tao [1 ]
Peng, Jiajie [2 ]
Peng, Qidi [1 ]
Wang, Yadong [1 ]
Chen, Jin [3 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin, Peoples R China
[2] Northwestern Polytech Univ, Sch Comp Sci, Xian, Peoples R China
[3] Univ Kentucky, Inst Biomed Informat, Lexington, KY USA
关键词
Biological network; Network motif; Higher-order organization; ALGORITHM; TOOL;
D O I
10.1016/j.ymeth.2019.07.008
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Networks exhibit rich and diverse higher-order organizational structures. Network motifs, which are recurring significant patterns of inter-connections, are recognized as fundamental units to study the higher-order organizations of networks. However, the principle of selecting representative network motifs for local motif based clustering remains largely unexplored. We present a scalable algorithm called FSM for network motif discovery. FSM is advantageous in twofold. First, it accelerates the motif discovery process by effectively reducing the number of times for subgraph isomorphism labeling. Second, FSM adopts multiple heuristic optimizations for subgraph enumeration and classification to further improve its performance. Experimental results on biological networks show that, comparing with the existing network motif discovery algorithm, FSM is more efficient on computational efficiency and memory usage. Furthermore, with the large, frequent, and sparse network motifs discovered by FSM, the higher-order organizational structures of biological networks were successfully revealed, indicating that FSM is suitable to select network representative network motifs for exploring high-order network organizations.
引用
收藏
页码:83 / 93
页数:11
相关论文
共 43 条
[1]   Taking the mystery out of biological networks [J].
Aloy, P ;
Russell, RB .
EMBO REPORTS, 2004, 5 (04) :349-350
[2]   Classes of small-world networks [J].
Amaral, LAN ;
Scala, A ;
Barthélémy, M ;
Stanley, HE .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (21) :11149-11152
[3]  
[Anonymous], 2010, P 2010 ACM S APPL CO, DOI DOI 10.1145/1774088.1774422
[4]  
[Anonymous], 2006, NEMOFINDER DISSECTIN
[5]   An automated method for finding molecular complexes in large protein interaction networks [J].
Bader, GD ;
Hogue, CW .
BMC BIOINFORMATICS, 2003, 4 (1)
[6]  
Batagelj V, 2002, LECT NOTES COMPUT SC, V2265, P477
[7]   Higher-order organization of complex networks [J].
Benson, Austin R. ;
Gleich, David F. ;
Leskovec, Jure .
SCIENCE, 2016, 353 (6295) :163-166
[8]   Topological structure analysis of the protein-protein interaction network in budding yeast [J].
Bu, DB ;
Zhao, Y ;
Cai, L ;
Xue, H ;
Zhu, XP ;
Lu, HC ;
Zhang, JF ;
Sun, SW ;
Ling, LJ ;
Zhang, N ;
Li, GJ ;
Chen, RS .
NUCLEIC ACIDS RESEARCH, 2003, 31 (09) :2443-2450
[9]   Canonical forms for labelled trees and their applications in frequent subtree mining [J].
Chi, Y ;
Yang, YR ;
Muntz, RR .
KNOWLEDGE AND INFORMATION SYSTEMS, 2005, 8 (02) :203-234
[10]   Structure and promoter activity of the 5′ flanking region of ace-1, the gene encoding acetylcholinesterase of class a in Caenorhabditis elegans [J].
Culetto, E ;
Combes, D ;
Fedon, Y ;
Roig, A ;
Toutant, JP ;
Arpagaus, M .
JOURNAL OF MOLECULAR BIOLOGY, 1999, 290 (05) :951-966