An algorithm to cluster orthologous proteins across multiple genomes

被引:0
|
作者
Kim, Sunshin [1 ]
Rhee, Chung Sei [1 ]
Choi, Jung-Do [2 ]
机构
[1] Chungbuk Natl Univ, Sch Elect Engn & Comp Engn, Cheongju, South Korea
[2] Chungbuk Natl Univ, Dept Biochem, Cheongju, South Korea
关键词
D O I
10.1109/ICISS.2008.42
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In GOLD (Genomes OnLine Database), the re It has been a successful technique, for researches on genome evolution and for functional annotation of newly sequenced genomes, to construct an OPCs(Orthologous Protein Clusters) with the best reciprocal BLAST hits from multiple complete-genomes. It, however, needs time-labor processes to make the OPCs by hand and biological analysis. In order to reduce the load, we propose an automatic parallel computing method that clusters OPs(Orthologous Proteins) from multiple complete-genomes. For systematic representation of clustering OPs, a mathematical frame of a vector is suggested. The algorithm starts, on the hypercube model, with parallelism of clustering pairwise genomes(CPG) and parallelizes all the processes of clustering multiple genomes(CMG). In CPG, all pairwise-comparisons are divided into sub-pairwise-comparisons, and the local results of clustering OPs between two genomes are integrated and broadcast to each processor. In CMG, all the clustering-processes are split into subclustering-processes, and the local results of OPCs among multiple genomes are broadcast to each processor and integrated in it.
引用
收藏
页码:32 / +
页数:3
相关论文
共 50 条
  • [2] Accurate identification of orthologous segments among multiple genomes
    Hachiya, Tsuyoshi
    Osana, Yasunori
    Popendorf, Kris
    Sakakibara, Yasubumi
    BIOINFORMATICS, 2009, 25 (07) : 853 - 860
  • [3] Algorithm for large-scale clustering across multiple genomes
    Yi, Gangman
    Jung, Jaehee
    BIOINFORMATION, 2011, 7 (05) : 251 - 255
  • [4] OMA Browser - Exploring orthologous relations across 352 complete genomes
    Schneider, Adrian
    Dessimoz, Christophe
    Gonnet, Gaston H.
    BIOINFORMATICS, 2007, 23 (16) : 2180 - 2182
  • [5] Clustering orthologous proteins across phylogenetically distant species
    Kim, Sunshin
    Kang, Jaewoo
    Chung, Yong Je
    Li, Jinyan
    Ryu, Keun Ho
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2008, 71 (03) : 1113 - 1122
  • [6] PSP: rapid identification of orthologous coding genes under positive selection across multiple closely related prokaryotic genomes
    Su, Fei
    Ou, Hong-Yu
    Tao, Fei
    Tang, Hongzhi
    Xu, Ping
    BMC GENOMICS, 2013, 14
  • [7] PSP: rapid identification of orthologous coding genes under positive selection across multiple closely related prokaryotic genomes
    Fei Su
    Hong-Yu Ou
    Fei Tao
    Hongzhi Tang
    Ping Xu
    BMC Genomics, 14
  • [8] OrthoVenn3: an integrated platform for exploring and visualizing orthologous data across genomes
    Sun, Jiahe
    Lu, Fang
    Luo, Yongjiang
    Bie, Lingzi
    Xu, Ling
    Wang, Yi
    NUCLEIC ACIDS RESEARCH, 2023, 51 (W1) : W397 - W403
  • [9] A web-based software system for dynamic gene cluster comparison across multiple genomes
    Revanna, Kashi Vishwanath
    Krishnakumar, Vivek
    Dong, Qunfeng
    BIOINFORMATICS, 2009, 25 (07) : 956 - 957
  • [10] An alignment-free method to identify candidate orthologous enhancers in multiple Drosophila genomes
    Arunachalam, Manonmani
    Jayasurya, Karthik
    Tomancak, Pavel
    Ohler, Uwe
    BIOINFORMATICS, 2010, 26 (17) : 2109 - 2115