Modifying the DPClus algorithm for identifying protein complexes based on new topological structures

被引:193
作者
Li, Min [1 ]
Chen, Jian-er [1 ,2 ]
Wang, Jian-xin [1 ]
Hu, Bin [1 ]
Chen, Gang [1 ]
机构
[1] Cent S Univ, Sch Informat Sci & Engn, Changsha 410083, Hunan, Peoples R China
[2] Texas A&M Univ, Dept Comp Sci, College Stn, TX 77843 USA
基金
美国国家科学基金会;
关键词
D O I
10.1186/1471-2105-9-398
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Identification of protein complexes is crucial for understanding principles of cellular organization and functions. As the size of protein-protein interaction set increases, a general trend is to represent the interactions as a network and to develop effective algorithms to detect significant complexes in such networks. Results: Based on the study of known complexes in protein networks, this paper proposes a new topological structure for protein complexes, which is a combination of subgraph diameter (or average vertex distance) and subgraph density. Following the approach of that of the previously proposed clustering algorithm DPClus which expands clusters starting from seeded vertices, we present a clustering algorithm IPCA based on the new topological structure for identifying complexes in large protein interaction networks. The algorithm IPCA is applied to the protein interaction network of Sacchromyces cerevisiae and identifies many well known complexes. Experimental results show that the algorithm IPCA recalls more known complexes than previously proposed clustering algorithms, including DPClus, CFinder, LCMA, MCODE, RNSC and STM. Conclusion: The proposed algorithm based on the new topological structure makes it possible to identify dense subgraphs in protein interaction networks, many of which correspond to known protein complexes. The algorithm is robust to the known high rate of false positives and false negatives in data from high-throughout interaction techniques. The program is available at http://netlab.csu.edu.cn/bioinformatics/limin/ IPCA.
引用
收藏
页数:16
相关论文
共 31 条
  • [1] Proteomics: The society of proteins
    Abbott, A
    [J]. NATURE, 2002, 417 (6892) : 894 - 896
  • [2] CFinder:: locating cliques and overlapping modules in biological networks
    Adamcsek, B
    Palla, G
    Farkas, IJ
    Derényi, I
    Vicsek, T
    [J]. BIOINFORMATICS, 2006, 22 (08) : 1021 - 1023
  • [3] Development and implementation of an algorithm for detection of protein complexes in large interaction networks
    Altaf-Ul-Amin, Md
    Shinbo, Yoko
    Mihara, Kenji
    Kurokawa, Ken
    Kanaya, Shigehiko
    [J]. BMC BIOINFORMATICS, 2006, 7 (1)
  • [4] An automated method for finding molecular complexes in large protein interaction networks
    Bader, GD
    Hogue, CW
    [J]. BMC BIOINFORMATICS, 2003, 4 (1)
  • [5] Affinity purification-mass spectrometry - Powerful tools for the characterization of protein complexes
    Bauer, A
    Kuster, B
    [J]. EUROPEAN JOURNAL OF BIOCHEMISTRY, 2003, 270 (04): : 570 - 578
  • [6] Evaluation of clustering algorithms for protein-protein interaction networks
    Brohee, Sylvain
    van Helden, Jacques
    [J]. BMC BIOINFORMATICS, 2006, 7 (1)
  • [7] Topology of small-world networks of protein-protein complex structures
    del Sol, A
    Fujihashi, H
    O'Meara, P
    [J]. BIOINFORMATICS, 2005, 21 (08) : 1311 - 1315
  • [8] Small-world network approach to identify key residues in protein-protein interaction
    del Sol, A
    O'Meara, P
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 58 (03) : 672 - 682
  • [9] Bioinformatics analysis of experimentally determined protein complexes in the yeast Saccharomyces cerevisiae
    Dezso, Z
    Oltvai, ZN
    Barabási, AL
    [J]. GENOME RESEARCH, 2003, 13 (11) : 2450 - 2454
  • [10] Cryo-electron microscopy as an investigative tool: the ribosome as an example
    Frank, J
    [J]. BIOESSAYS, 2001, 23 (08) : 725 - 732