Topology-Free Querying of Protein Interaction Networks

被引:50
作者
Bruckner, Sharon [1 ]
Hueffner, Falk [1 ]
Karp, Richard M. [2 ]
Shamir, Ron [1 ]
Sharan, Roded [1 ]
机构
[1] Tel Aviv Univ, Sch Comp Sci, IL-69978 Tel Aviv, Israel
[2] Int Comp Sci Inst, Berkeley, CA 94704 USA
基金
以色列科学基金会;
关键词
algorithms; gene expression; gene networks; genetic variation; sequence analysis; INTERACTION MAP; YEAST; IDENTIFICATION; PATHWAYS; ALGORITHMS; COMPLEXES; CYTOSCAPE; ALIGNMENT; SOFTWARE; RESOURCE;
D O I
10.1089/cmb.2009.0170
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
In the network querying problem, one is given a protein complex or pathway of species A and a protein-protein interaction network of species B; the goal is to identify subnetworks of B that are similar to the query in terms of sequence, topology, or both. Existing approaches mostly depend on knowledge of the interaction topology of the query in the network of species A; however, in practice, this topology is often not known. To address this problem, we develop a topology-free querying algorithm, which we call TORQUE. Given a query, represented as a set of proteins, Torque seeks a matching set of proteins that are sequence-similar to the query proteins and span a connected region of the network, while allowing both insertions and deletions. The algorithm uses alternatively dynamic programming and integer linear programming for the search task. We test Torque with queries from yeast, fly, and human, where we compare it to the QNet topology-based approach, and with queries from less studied species, where only topology-free algorithms apply. TORQUE detects many more matches than QNet, while giving results that are highly functionally coherent.
引用
收藏
页码:237 / 252
页数:16
相关论文
共 48 条
[1]   COLOR-CODING [J].
ALON, N ;
YUSTER, R ;
ZWICK, U .
JOURNAL OF THE ASSOCIATION FOR COMPUTING MACHINERY, 1995, 42 (04) :844-856
[2]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[3]   Analyzing yeast protein-protein interaction data obtained from different sources [J].
Bader, GD ;
Hogue, CWV .
NATURE BIOTECHNOLOGY, 2002, 20 (10) :991-997
[4]   NetGrep: fast network schema searches in interactomes [J].
Banks, Eric ;
Nabieva, Elena ;
Peterson, Ryan ;
Singh, Mona .
GENOME BIOLOGY, 2008, 9 (09)
[5]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[6]  
Betzler N, 2008, LECT NOTES COMPUT SC, V5029, P31, DOI 10.1007/978-3-540-69068-9_6
[7]   Fourier Meets Mobius: Fast Subset Convolution [J].
Bjorklund, Andreas ;
Husfeldt, Thore ;
Kaski, Petteri ;
Koivisto, Mikko .
STOC 07: PROCEEDINGS OF THE 39TH ANNUAL ACM SYMPOSIUM ON THEORY OF COMPUTING, 2007, :67-74
[8]   GO::TermFinder - open source software for accessing Gene Ontology information and finding significantly enriched Gene Ontology terms associated with a list of genes [J].
Boyle, EI ;
Weng, SA ;
Gollub, J ;
Jin, H ;
Botstein, D ;
Cherry, JM ;
Sherlock, G .
BIOINFORMATICS, 2004, 20 (18) :3710-3715
[9]   Torque: topology-free querying of protein interaction networks [J].
Bruckner, Sharon ;
Huffner, Falk ;
Karp, Richard M. ;
Shamir, Ron ;
Sharan, Roded .
NUCLEIC ACIDS RESEARCH, 2009, 37 :W106-W108
[10]  
Bruckner S, 2009, LECT NOTES COMPUT SC, V5541, P74, DOI 10.1007/978-3-642-02008-7_6