Automatic Parameter Learning for Multiple Local Network Alignment

被引:54
作者
Flannick, Jason [1 ]
Novak, Antal
Do, Chuong B.
Srinivasan, Balaji S. [2 ]
Batzoglou, Serafim
机构
[1] Stanford Univ, Dept Comp Sci, James H Clark Ctr, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
关键词
algorithms; computational molecular biology; PROTEIN-INTERACTION; GENETIC INTERACTIONS; FUNCTIONAL MODULES; IDENTIFICATION; COMPLEXES; DATABASE; SEARCH; YEAST; TOOL;
D O I
10.1089/cmb.2009.0099
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We developed Graemlin 2.0, a new multiple network aligner with ( 1) a new multi-stage approach to local network alignment; ( 2) a novel scoring function that can use arbitrary features of a multiple network alignment, such as protein deletions, protein duplications, protein mutations, and interaction losses; ( 3) a parameter learning algorithm that uses a training set of known network alignments to learn parameters for our scoring function and thereby adapt it to any set of networks; and ( 4) an algorithm that uses our scoring function to find approximate multiple network alignments in linear time. We tested Graemlin 2.0's accuracy on protein interaction networks from IntAct, DIP, and the Stanford Network Database. We show that, on each of these datasets, Graemlin 2.0 has higher sensitivity and specificity than existing network aligners. Graemlin 2.0 is available under the GNU public license at http://graemlin.stanford.edu.
引用
收藏
页码:1001 / 1022
页数:22
相关论文
共 47 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   WEIGHTS FOR DATA RELATED BY A TREE [J].
ALTSCHUL, SF ;
CARROLL, RJ ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1989, 207 (04) :647-653
[3]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[4]   Local graph alignment and motif search in biological networks [J].
Berg, J ;
Lässig, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (41) :14689-14694
[5]   Cross-species analysis of biological networks by Bayesian alignment [J].
Berg, Johannes ;
Lassig, Michael .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (29) :10967-10972
[6]   Identification and analysis of evolutionarily cohesive functional modules in protein networks [J].
Campillos, M ;
von Mering, C ;
Jensen, LJ ;
Bork, P .
GENOME RESEARCH, 2006, 16 (03) :374-382
[7]   Interactome: gateway into systems biology [J].
Cusick, ME ;
Klitgord, N ;
Vidal, M ;
Hill, DE .
HUMAN MOLECULAR GENETICS, 2005, 14 :R171-R181
[8]  
Cusick ME, 2009, NAT METHODS, V6, P39, DOI [10.1038/NMETH.1284, 10.1038/nmeth.1284]
[9]  
DO C, 2006, P 10 ANN INT C COMP, P160
[10]   CONTRAfold: RNA secondary structure prediction without physics-based models [J].
Do, Chuong B. ;
Woods, Daniel A. ;
Batzoglou, Serafim .
BIOINFORMATICS, 2006, 22 (14) :E90-E98