A multi-objective evolutionary algorithm for detecting protein complexes in PPI networks using gene ontology

被引:0
作者
Abbas, Mustafa N. [1 ]
Broneske, David [2 ]
Saake, Gunter [1 ]
机构
[1] Otto von Guericke Univ, Databases & Software Engn, Magdeburg, Germany
[2] German Ctr Higher Educ Res & Sci Studies, Hannover, Germany
关键词
Evolutionary algorithm; Multi-objective optimization; Heuristic perturbation operator; Protein-protein interaction network; Gene ontology; Protein complexes; SEMANTIC SIMILARITY; PREDICTION; DATABASE;
D O I
10.1038/s41598-025-01667-y
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Detecting protein complexes is crucial in computational biology for understanding cellular mechanisms and facilitating drug discovery. Evolutionary algorithms (EAs) have proven effective in uncovering protein complexes within networks of protein-protein interactions (PPIs). However, their integration with functional insights from gene ontology (GO) annotations remains underexplored. This paper presents two primary contributions: First, it proposes a novel multi-objective optimization model for detecting protein complexes, conceptualizing the task as a problem with inherently conflicting objectives based on biological data. Second, it introduces an innovative gene ontology-based mutation operator, termed the Functional Similarity-Based Protein Translocation Operator (\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$FS-PTO$$\end{document}). This operator enhances collaboration between the canonical model and the GO-informed mutation strategy, thereby improving the algorithm's performance. As far as we know, this is the initial effort to incorporate the biological characteristics of PPIs into both the problem formulation and the development of intricate perturbation strategies. We assess the effectiveness of the proposed multi-objective evolutionary algorithm through experiments conducted on two widely recognized PPI networks and two standard complex datasets provided by the Munich Information Center for Protein Sequences (MIPS). To further assess the robustness of our algorithm, we create artificial networks by introducing different noise levels into the original Saccharomyces cerevisiae (yeast) PPI networks. This allows us to evaluate how perturbations in protein interactions affect the algorithm's performance compared to other approaches. The experimental results highlight that our algorithm outperforms several state-of-the-art methods in accurately identifying protein complexes. Moreover, the findings emphasize the substantial advantages of incorporating our heuristic perturbation operator, which significantly improves the quality of the detected complexes over other evolutionary algorithm-based methods.
引用
收藏
页数:31
相关论文
共 59 条
[1]   An Evolutionary Algorithm With Heuristic Operator for Detecting Protein Complexes in Protein Interaction Networks With Negative Controls [J].
Abbas, Mustafa N. ;
Attea, Bara'a A. ;
Broneske, David ;
Saake, Gunter .
IEEE ACCESS, 2024, 12 :28873-28897
[2]   Mass-spectrometric exploration of proteome structure and function [J].
Aebersold, Ruedi ;
Mann, Matthias .
NATURE, 2016, 537 (7620) :347-355
[3]   Assessment of Discrete BAT-Modified (DBAT-M) Optimization Algorithm for Community Detection in Complex Network [J].
Aggarwal, Kirti ;
Arora, Anuja .
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 48 (02) :2277-2296
[4]   Link communities reveal multiscale complexity in networks [J].
Ahn, Yong-Yeol ;
Bagrow, James P. ;
Lehmann, Sune .
NATURE, 2010, 466 (7307) :761-U11
[5]   Comparative Analysis of Protein Networks: Hard Problems, Practical Solutions [J].
Atias, Nir ;
Sharan, Roded .
COMMUNICATIONS OF THE ACM, 2012, 55 (05) :88-97
[6]   Improving the performance of evolutionary multi-objective co-clustering models for community detection in complex social networks [J].
Attea, Bara'a A. ;
Hariz, Wisam A. ;
Abdulhalim, Mayyadah F. .
SWARM AND EVOLUTIONARY COMPUTATION, 2016, 26 :137-156
[7]   An automated method for finding molecular complexes in large protein interaction networks [J].
Bader, GD ;
Hogue, CW .
BMC BIOINFORMATICS, 2003, 4 (1)
[8]   A multiobjective approach for identifying protein complexes and studying their association in multiple disorders [J].
Bandyopadhyay, Sanghamitra ;
Ray, Sumanta ;
Mukhopadhyay, Anirban ;
Maulik, Ujjwal .
ALGORITHMS FOR MOLECULAR BIOLOGY, 2015, 10
[9]   Network biology:: Understanding the cell's functional organization [J].
Barabási, AL ;
Oltvai, ZN .
NATURE REVIEWS GENETICS, 2004, 5 (02) :101-U15
[10]   The GOA database in 2009-an integrated Gene Ontology Annotation resource [J].
Barrell, Daniel ;
Dimmer, Emily ;
Huntley, Rachael P. ;
Binns, David ;
O'Donovan, Claire ;
Apweiler, Rolf .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D396-D403