FROG - Fingerprinting Genomic Variation Ontology

被引:2
作者
Abinaya, E. [1 ]
Narang, Pankaj [2 ]
Bhardwaj, Anshu [3 ]
机构
[1] SASTRA Univ, Dept Bioinformat, Thanjavur, Tamil Nadu, India
[2] Jawaharlal Nehru Univ, Sch Computat & Integrat Sci, New Delhi 110067, India
[3] CSIR, Open Source Drug Discovery Unit, New Delhi 110001, India
关键词
GENE ONTOLOGY; DATABASE;
D O I
10.1371/journal.pone.0134693
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Genetic variations play a crucial role in differential phenotypic outcomes. Given the complexity in establishing this correlation and the enormous data available today, it is imperative to design machine-readable, efficient methods to store, label, search and analyze this data. A semantic approach, FROG: "FingeRprinting Ontology of Genomic variations" is implemented to label variation data, based on its location, function and interactions. FROG has six levels to describe the variation annotation, namely, chromosome, DNA, RNA, protein, variations and interactions. Each level is a conceptual aggregation of logically connected attributes each of which comprises of various properties for the variant. For example, in chromosome level, one of the attributes is location of variation and which has two properties, allosomes or autosomes. Another attribute is variation kind which has four properties, namely, indel, deletion, insertion, substitution. Likewise, there are 48 attributes and 278 properties to capture the variation annotation across six levels. Each property is then assigned a bit score which in turn leads to generation of a binary fingerprint based on the combination of these properties (mostly taken from existing variation ontologies). FROG is a novel and unique method designed for the purpose of labeling the entire variation data generated till date for efficient storage, search and analysis. A web-based platform is designed as a test case for users to navigate sample datasets and generate fingerprints. The platform is available at http://ab-openlab.csir.res.in/frog.
引用
收藏
页数:11
相关论文
共 24 条
[1]  
Adzhubei Ivan, 2013, Curr Protoc Hum Genet, VChapter 7, DOI 10.1002/0471142905.hg0720s76
[2]   An integrated map of genetic variation from 1,092 human genomes [J].
Altshuler, David M. ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Donnelly, Peter ;
Eichler, Evan E. ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Green, Eric D. ;
Hurles, Matthew E. ;
Knoppers, Bartha M. ;
Korbel, Jan O. ;
Lander, Eric S. ;
Lee, Charles ;
Lehrach, Hans ;
Mardis, Elaine R. ;
Marth, Gabor T. ;
McVean, Gil A. ;
Nickerson, Deborah A. ;
Schmidt, Jeanette P. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Dinh, Huyen ;
Kovar, Christie ;
Lee, Sandra ;
Lewis, Lora ;
Muzny, Donna ;
Reid, Jeff ;
Wang, Min ;
Wang, Jun ;
Fang, Xiaodong ;
Guo, Xiaosen ;
Jian, Min ;
Jiang, Hui ;
Jin, Xin ;
Li, Guoqing ;
Li, Jingxiang ;
Li, Yingrui ;
Li, Zhuo ;
Liu, Xiao ;
Lu, Yao ;
Ma, Xuedi ;
Su, Zhe ;
Tai, Shuaishuai ;
Tang, Meifang .
NATURE, 2012, 491 (7422) :56-65
[3]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[4]  
Bastos HP, 2011, METHODS MOL BIOL, V760, P141, DOI 10.1007/978-1-61779-176-5_9
[5]   GWAS Central: a comprehensive resource for the comparison and interrogation of genome-wide association studies [J].
Beck, Tim ;
Hastings, Robert K. ;
Gollapudi, Sirisha ;
Free, Robert C. ;
Brookes, Anthony J. .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2014, 22 (07) :949-952
[6]  
Becker KG, 2004, NAT GENET, V36, P431, DOI 10.1038/ng0504-431
[7]   The Phenotype and Genotype Experiment Object Model (PaGE-OM): A Robust Data Structure for Information Related to DNA Variation [J].
Brookes, Anthony J. ;
Lehvaslaiho, Heikki ;
Muilu, Juha ;
Shigemoto, Yasumasa ;
Oroguchi, Takashige ;
Tomiki, Takeshi ;
Mukaiyama, Atsuhiro ;
Konagaya, Akihiko ;
Kojima, Toshio ;
Inoue, Ituro ;
Kuroda, Masako ;
Mizushima, Hiroshi ;
Thorisson, Gudmundur A. ;
Dash, Debasis ;
Rajeevan, Haseena ;
Darlison, Matthew W. ;
Woon, Mark ;
Fredman, David ;
Smith, Albert V. ;
Senger, Martin ;
Naito, Kimitoshi ;
Sugawara, Hideaki .
HUMAN MUTATION, 2009, 30 (06) :968-977
[8]   Predicting the insurgence of human genetic diseases associated to single point protein mutations with support vector machines and evolutionary information [J].
Capriotti, E. ;
Calabrese, R. ;
Casadio, R. .
BIOINFORMATICS, 2006, 22 (22) :2729-2734
[9]   Chem2Bio2RDF: a semantic framework for linking and data mining chemogenomic and systems chemical biology data [J].
Chen, Bin ;
Dong, Xiao ;
Jiao, Dazhi ;
Wang, Huijun ;
Zhu, Qian ;
Ding, Ying ;
Wild, David J. .
BMC BIOINFORMATICS, 2010, 11
[10]   Intra- and interindividual epigenetic variation in human germ cells [J].
Flanagan, James M. ;
Popendikyte, Violeta ;
Pozdniakovaite, Natalija ;
Sobolev, Martha ;
Assadzadeh, Abbas ;
Schumacher, Axel ;
Zangeneh, Masood ;
Lau, Lynette ;
Virtanen, Carl ;
Wang, Sun-Chong ;
Petronis, Arturas .
AMERICAN JOURNAL OF HUMAN GENETICS, 2006, 79 (01) :67-84