Associating Gene Ontology Terms with Pfam Protein Domains

被引:2
作者
Alborzi, Seyed Ziaeddin [1 ,3 ]
Devignes, Marie-Dominique [2 ]
Ritchie, David W. [3 ]
机构
[1] Univ Lorraine, LORIA, UMR 7503, F-54506 Vandoeuvre Les Nancy, France
[2] CNRS, LORIA, UMR 7503, F-54506 Vandoeuvre Les Nancy, France
[3] Inria Nancy Grand Est, F-54600 Villers Les Nancy, France
来源
BIOINFORMATICS AND BIOMEDICAL ENGINEERING, IWBBIO 2017, PT II | 2017年 / 10209卷
关键词
Protein structure; Protein function; Gene Ontology; Content-based filtering;
D O I
10.1007/978-3-319-56154-7_13
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
With the growing number of three-dimensional protein structures in the protein data bank (PDB), there is a need to annotate these structures at the domain level in order to relate protein structure to protein function. Thanks to the SIFTS database, many PDB chains are now cross-referenced with Pfam domains and Gene ontology (GO) terms. However, these annotations do not include any explicit relationship between individual Pfam domains and GO terms. Therefore, creating a direct mapping between GO terms and Pfam domains will provide a new and more detailed level of protein structure annotation. This article presents a novel content-based filtering method called GODM that can automatically infer associations between GO terms and Pfam domains directly from existing GO-chain/Pfam-chain associations from the SIFTS database and GO-sequence/Pfam-sequence associations from the UniProt databases. Overall, GODM finds a total of 20,318 nonredundant GO-Pfam associations with a F-measure of 0.98 with respect to the InterPro database, which is treated here as a "Gold Standard". These associations could be used to annotate thousands of PDB chains or protein sequences for which their domain composition is known but which currently lack any GO annotation. The GODM database is publicly available at http://godm.loria.fr/.
引用
收藏
页码:127 / 138
页数:12
相关论文
共 50 条
[31]   Linking molecular function and biological process terms in the gene ontology for gene expression data analysis [J].
DeJongh, M ;
Van Dort, P ;
Ramsay, B .
PROCEEDINGS OF THE 26TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2004, 26 :2984-2986
[32]   A novel gene functional similarity calculation model by utilizing the specificity of terms and relationships in gene ontology [J].
Zhen Tian ;
Haichuan Fang ;
Yangdong Ye ;
Zhenfeng Zhu .
BMC Bioinformatics, 23
[33]   A novel gene functional similarity calculation model by utilizing the specificity of terms and relationships in gene ontology [J].
Tian, Zhen ;
Fang, Haichuan ;
Ye, Yangdong ;
Zhu, Zhenfeng .
BMC BIOINFORMATICS, 2022, 23 (SUPPL 1)
[34]   Inferring semantic similarity through correlating information contents of gene ontology terms [J].
Gan, Mingxin ;
Jiang, Rui .
2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,
[35]   Applicability of semi-supervised learning assumptions for gene ontology terms prediction [J].
Alberto Jaramillo-Garzon, Jorge ;
German Castellanos-Dominguez, Cesar ;
Perera-Lluna, Alexandre .
REVISTA FACULTAD DE INGENIERIA-UNIVERSIDAD DE ANTIOQUIA, 2016, (79) :19-32
[36]   Bioinformatic Prediction of Gene Ontology Terms of Uncharacterized Proteins from Chromosome 11 [J].
Hwang, Heeyoun ;
Im, Ji Eun ;
Yang, Yeji ;
Kim, Hyejin ;
Kwon, Kyung-Hoon ;
Kim, Yun-Hee ;
Kim, Jin Young ;
Yoo, Jong Shin .
JOURNAL OF PROTEOME RESEARCH, 2020, 19 (12) :4907-4912
[37]   Functional Interpretation of Gene Sets: Semantic-Based Clustering of Gene Ontology Terms on the BioTest Platform [J].
Gruca, Aleksandra ;
Jaksik, Roman ;
Psiuk-Maksymowicz, Krzysztof .
MAN-MACHINE INTERACTIONS 5, ICMMI 2017, 2018, 659 :125-136
[38]   HashGO: hashing gene ontology for protein function prediction [J].
Yu, Guoxian ;
Zhao, Yingwen ;
Lu, Chang ;
Wang, Jun .
COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2017, 71 :264-273
[39]   PIGOK: Linking protein identity to gene ontology and function [J].
Jacob, Richard J. ;
Cramer, Rainer .
JOURNAL OF PROTEOME RESEARCH, 2006, 5 (12) :3429-3432
[40]   The relationship between protein sequences and their gene ontology functions [J].
Zhong-Hui Duan ;
Brent Hughes ;
Lothar Reichel ;
Dianne M Perez ;
Ting Shi .
BMC Bioinformatics, 7