Identifying Protein Complexes From Protein-Protein Interaction Networks Based on Fuzzy Clustering and GO Semantic Information

被引:18
作者
Pan, Xiangyu [1 ]
Hu, Lun [2 ]
Hu, Pengwei [2 ]
You, Zhu-Hong [3 ]
机构
[1] Wuhan Univ Technol, Sch Comp Sci & Technol, Wuhan 430070, Peoples R China
[2] Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi 830011, Peoples R China
[3] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Peoples R China
关键词
Proteins; Semantics; Clustering algorithms; Task analysis; Topology; Ontologies; Search problems; Protein complex identification; fuzzy clustering; protein-protein interaction network; gene ontology; FUNCTIONAL MODULES; ONTOLOGY; IDENTIFICATION; SIMILARITY; DISCOVERY; ALGORITHM; DATABASE; TOOL;
D O I
10.1109/TCBB.2021.3095947
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Protein complexes are of great significance to provide valuable insights into the mechanisms of biological processes of proteins. A variety of computational algorithms have thus been proposed to identify protein complexes in a protein-protein interaction network. However, few of them can perform their tasks by taking into account both network topology and protein attribute information in a unified fuzzy-based clustering framework. Since proteins in the same complex are similar in terms of their attribute information and the consideration of fuzzy clustering can also make it possible for us to identify overlapping complexes, we target to propose such a novel fuzzy-based clustering framework, namely FCAN-PCI, for an improved identification accuracy. To do so, the semantic similarity between the attribute information of proteins is calculated and we then integrate it into a well-established fuzzy clustering model together with the network topology. After that, a momentum method is adopted to accelerate the clustering procedure. FCAN-PCI finally applies a heuristical search strategy to identify overlapping protein complexes. A series of extensive experiments have been conducted to evaluate the performance of FCAN-PCI by comparing it with state-of-the-art identification algorithms and the results demonstrate the promising performance of FCAN-PCI.
引用
收藏
页码:2882 / 2893
页数:12
相关论文
共 47 条
[41]   A Fast Hierarchical Clustering Algorithm for Functional Modules Discovery in Protein Interaction Networks [J].
Wang, Jianxin ;
Li, Min ;
Chen, Jianer ;
Pan, Yi .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2011, 8 (03) :607-620
[42]   Identifying protein complexes based on an edge weight algorithm and core-attachment structure [J].
Wang, Rongquan ;
Liu, Guixia ;
Wang, Caixia .
BMC BIOINFORMATICS, 2019, 20 (01)
[43]   A core-attachment based method to detect protein complexes in PPI networks [J].
Wu, Min ;
Li, Xiaoli ;
Kwoh, Chee-Keong ;
Ng, See-Kiong .
BMC BIOINFORMATICS, 2009, 10
[44]   DIP: the Database of Interacting Proteins [J].
Xenarios, I ;
Rice, DW ;
Salwinski, L ;
Baron, MK ;
Marcotte, EM ;
Eisenberg, D .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :289-291
[45]   Improving GO semantic similarity measures by exploring the ontology beneath the terms and modelling uncertainty [J].
Yang, Haixuan ;
Nepusz, Tamas ;
Paccanaro, Alberto .
BIOINFORMATICS, 2012, 28 (10) :1383-1389
[46]  
Zhang XF, 2014, BMC BIOINFORMATICS, V15, DOI [10.1186/1471-2105-15-186, 10.1186/1471-2105-15-335]
[47]   Protein Complex Prediction in Large Ontology Attributed Protein-Protein Interaction Networks [J].
Zhang, Yijia ;
Lin, Hongfei ;
Yang, Zhihao ;
Wang, Jian ;
Li, Yanpeng ;
Xu, Bo .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2013, 10 (03) :729-741