Introducing glycomics data into the Semantic Web

被引:21
作者
Aoki-Kinoshita, Kiyoko F. [1 ]
Bolleman, Jerven [2 ]
Campbell, Matthew P. [3 ]
Kawano, Shin [4 ]
Kim, Jin-Dong [4 ]
Luetteke, Thomas [5 ]
Matsubara, Masaaki [6 ]
Okuda, Shujiro [7 ,8 ]
Ranzinger, Rene [9 ]
Sawaki, Hiromichi [10 ]
Shikanai, Toshihide [10 ]
Shinmachi, Daisuke [10 ]
Suzuki, Yoshinori [10 ]
Toukach, Philip [11 ]
Yamada, Issaku [6 ]
Packer, Nicolle H. [3 ]
Narimatsu, Hisashi [10 ]
机构
[1] Soka Univ, Dept Bioinformat, Fac Engn, Hachioji, Tokyo 1928577, Japan
[2] Swiss Inst Bioinformat, CH-1211 Geneva 4, Switzerland
[3] Macquarie Univ, Biomol Frontiers Res Ctr, Sydney, NSW 2109, Australia
[4] Res Org Informat & Syst, Database Ctr Life Sci, Bunkyo Ku, Tokyo 1130032, Japan
[5] Univ Giessen, Inst Vet Physiol & Biochem, D-35392 Giessen, Germany
[6] Noguchi Inst, Lab Glycoorgan Chem, Itabashi Ku, Tokyo 1730003, Japan
[7] Ritsumeikan Univ, Dept Bioinformat, Coll Life Sci, Kusatsu, Shiga 5258577, Japan
[8] Niigata Univ, Grad Sch Med & Dent Sci, Chuo Ku, Niigata 9518510, Japan
[9] Univ Georgia, Complex Carbohydrate Res Ctr, Athens, GA 30602 USA
[10] Natl Inst Adv Ind Sci & Technol, Res Ctr Med Glycosci, Tsukuba, Ibaraki 3058568, Japan
[11] ND Zelinskii Inst Organ Chem, NMR Lab, Moscow 119991, Russia
基金
俄罗斯基础研究基金会; 日本科学技术振兴机构;
关键词
BioHackathon; Carbohydrate; Data integration; Glycan; Glycoconjugate; SPARQL; RDF standard; Carbohydrate structure database; PROTEIN DATA-BANK; CARBOHYDRATE STRUCTURES; GLYCOSCIENCES.DE; RESOURCES;
D O I
10.1186/2041-1480-4-39
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Glycoscience is a research field focusing on complex carbohydrates (otherwise known as glycans)(a), which can, for example, serve as "switches" that toggle between different functions of a glycoprotein or glycolipid. Due to the advancement of glycomics technologies that are used to characterize glycan structures, many glycomics databases are now publicly available and provide useful information for glycoscience research. However, these databases have almost no link to other life science databases. Results: In order to implement support for the Semantic Web most efficiently for glycomics research, the developers of major glycomics databases agreed on a minimal standard for representing glycan structure and annotation information using RDF (Resource Description Framework). Moreover, all of the participants implemented this standard prototype and generated preliminary RDF versions of their data. To test the utility of the converted data, all of the data sets were uploaded into a Virtuoso triple store, and several SPARQL queries were tested as "proofs-of-concept" to illustrate the utility of the Semantic Web in querying across databases which were originally difficult to implement. Conclusions: We were able to successfully retrieve information by linking UniCarbKB, GlycomeDB and JCGGDB in a single SPARQL query to obtain our target information. We also tested queries linking UniProt with GlycoEpitope as well as lectin data with GlycomeDB through PDB. As a result, we have been able to link proteomics data with glycomics data through the implementation of Semantic Web technologies, allowing for more flexible queries across these domains.
引用
收藏
页数:7
相关论文
共 19 条
[1]   Using Databases and Web Resources for Glycomics Research [J].
Aoki-Kinoshita, Kiyoko F. .
MOLECULAR & CELLULAR PROTEOMICS, 2013, 12 (04) :1036-1045
[2]   Update on activities at the Universal Protein Resource (UniProt) in 2013 [J].
Apweiler, Rolf ;
Martin, Maria Jesus ;
O'Donovan, Claire ;
Magrane, Michele ;
Alam-Faruque, Yasmin ;
Alpi, Emanuela ;
Antunes, Ricardo ;
Arganiska, Joanna ;
Casanova, Elisabet Barrera ;
Bely, Benoit ;
Bingley, Mark ;
Bonilla, Carlos ;
Britto, Ramona ;
Bursteinas, Borisas ;
Chan, Wei Mun ;
Chavali, Gayatri ;
Cibrian-Uhalte, Elena ;
Da Silva, Alan ;
De Giorgi, Maurizio ;
Dimmer, Emily ;
Fazzini, Francesco ;
Gane, Paul ;
Fedotov, Alexander ;
Castro, Leyla Garcia ;
Garmiri, Penelope ;
Hatton-Ellis, Emma ;
Hieta, Reija ;
Huntley, Rachael ;
Jacobsen, Julius ;
Jones, Rachel ;
Legge, Duncan ;
Liu, Wudong ;
Luo, Jie ;
MacDougall, Alistair ;
Mutowo, Prudence ;
Nightingale, Andrew ;
Orchard, Sandra ;
Patient, Samuel ;
Pichler, Klemens ;
Poggioli, Diego ;
Pundir, Sangya ;
Pureza, Luis ;
Qi, Guoying ;
Rosanoff, Steven ;
Sawford, Tony ;
Sehra, Harminder ;
Turner, Edward ;
Volynkin, Vladimir ;
Wardell, Tony ;
Watkins, Xavier .
NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) :D43-D47
[3]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[4]   GlycoBase and autoGU: tools for HPLC-based glycan analysis [J].
Campbell, Matthew P. ;
Royle, Louise ;
Radcliffe, Catherine M. ;
Dwek, Raymond A. ;
Rudd, Pauline M. .
BIOINFORMATICS, 2008, 24 (09) :1214-1216
[5]   UniCarbKB: Putting the pieces together for glycomics research [J].
Campbell, Matthew P. ;
Hayes, Catherine A. ;
Struwe, Weston B. ;
Wilkins, Marc R. ;
Aoki-Kinoshita, Kiyoko F. ;
Harvey, David J. ;
Rudd, Pauline M. ;
Kolarich, Daniel ;
Lisacek, Frederique ;
Karlsson, Niclas G. ;
Packer, Nicolle H. .
PROTEOMICS, 2011, 11 (21) :4117-4121
[6]  
Council N.R., 2012, TRANSFORMING GLYCOSC
[7]   CARBBANK [J].
DOUBET, S ;
ALBERSHEIM, P .
GLYCOBIOLOGY, 1992, 2 (06) :505-505
[8]   KEGG as a glycome informatics resource [J].
Hashimoto, K ;
Goto, S ;
Kawano, S ;
Aoki-Kinoshita, KF ;
Ueda, N ;
Hamajima, M ;
Kawasaki, T ;
Kanehisa, M .
GLYCOBIOLOGY, 2006, 16 (05) :63R-70R
[9]   GlycoCT - a unifying sequence format for carbohydrates [J].
Herget, S. ;
Ranzinger, R. ;
Maass, K. ;
Von der Lieth, C. -W. .
CARBOHYDRATE RESEARCH, 2008, 343 (12) :2162-2171
[10]   Large-scale Identification of N-Glycosylated Proteins of Mouse Tissues and Construction of a Glycoprotein Database, GlycoProtDB [J].
Kaji, Hiroyuki ;
Shikanai, Toshihide ;
Sasaki-Sawa, Akiko ;
Wen, Hongling ;
Fujita, Mika ;
Suzuki, Yoshinori ;
Sugahara, Daisuke ;
Sawaki, Hiromichi ;
Yamauchi, Yoshio ;
Shinkawa, Takashi ;
Taoka, Masato ;
Takahashi, Nobuhiro ;
Isobe, Toshiaki ;
Narimatsu, Hisashi .
JOURNAL OF PROTEOME RESEARCH, 2012, 11 (09) :4553-4566