Revealing the predictability of intrinsic structure in complex networks

被引:32
作者
Sun, Jiachen [1 ,2 ]
Feng, Ling [3 ,4 ]
Xie, Jiarong [1 ]
Ma, Xiao [1 ]
Wang, Dashun [5 ,6 ,7 ]
Hu, Yanqing [1 ,8 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510006, Peoples R China
[2] Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Peoples R China
[3] ASTAR, Inst High Performance Comp, Singapore 138632, Singapore
[4] Natl Univ Singapore, Dept Phys, Singapore 117551, Singapore
[5] Northwestern Univ, Ctr Sci Sci & Innovat, Evanston, IL USA
[6] Northwestern Univ, Kellogg Sch Management, Evanston, IL USA
[7] Northwestern Univ, McCormick Sch Engn, Evanston, IL USA
[8] Southern Marine Sci & Engn Guangdong Lab, Zhuhai 519082, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
LINK-PREDICTION;
D O I
10.1038/s41467-020-14418-6
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Structure prediction is an important and widely studied problem in network science and machine learning, finding its applications in various fields. Despite the significant progress in prediction algorithms, the fundamental predictability of structures remains unclear, as networks' complex underlying formation dynamics are usually unobserved or difficult to describe. As such, there has been a lack of theoretical guidance on the practical development of algorithms for their absolute performances. Here, for the first time, we find that the normalized shortest compression length of a network structure can directly assess the structure predictability. Specifically, shorter binary string length from compression leads to higher structure predictability. We also analytically derive the origin of this linear relationship in artificial random networks. In addition, our finding leads to analytical results quantifying maximum prediction accuracy, and allows the estimation of the network dataset potential values through the size of the compressed network data file.
引用
收藏
页数:10
相关论文
共 39 条
[1]   Friends and neighbors on the Web [J].
Adamic, LA ;
Adar, E .
SOCIAL NETWORKS, 2003, 25 (03) :211-230
[2]  
Al Hasan M., 2006, SDM06, V30, P798
[3]  
[Anonymous], 2011, ACM SIGKDD
[4]  
[Anonymous], 2001, RANDOM GRAPHS
[5]   Network link prediction by global silencing of indirect correlations [J].
Barzel, Baruch ;
Barabasi, Albert-Laszlo .
NATURE BIOTECHNOLOGY, 2013, 31 (08) :720-725
[6]   E-commerce recommendation applications [J].
Ben Schafer, J ;
Konstan, JA ;
Riedl, J .
DATA MINING AND KNOWLEDGE DISCOVERY, 2001, 5 (1-2) :115-153
[7]   SUBMODEL SELECTION AND EVALUATION IN REGRESSION - THE X-RANDOM CASE [J].
BREIMAN, L ;
SPECTOR, P .
INTERNATIONAL STATISTICAL REVIEW, 1992, 60 (03) :291-319
[8]   Network-based prediction of drug combinations [J].
Cheng, Feixiong ;
Kovacs, Istvan A. ;
Barabasi, Albert-Laszlo .
NATURE COMMUNICATIONS, 2019, 10 (1)
[9]   Compression of Graphical Structures: Fundamental Limits, Algorithms, and Experiments [J].
Choi, Yongwook ;
Szpankowski, Wojciech .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2012, 58 (02) :620-638
[10]   Hierarchical structure and the prediction of missing links in networks [J].
Clauset, Aaron ;
Moore, Cristopher ;
Newman, M. E. J. .
NATURE, 2008, 453 (7191) :98-101