Comprehensive analysis of the N-glycan biosynthetic pathway using bioinformatics to generate UniCorn: A theoretical N-glycan structure database

被引:22
作者
Akune, Yukie [1 ,2 ]
Lin, Chi-Hung [1 ]
Abrahams, Jodie L. [1 ]
Zhang, Jingyu [1 ]
Packer, Nicolle H. [1 ]
Aoki-Kinoshita, Kiyoko F. [2 ]
Campbell, Matthew P. [1 ]
机构
[1] Macquarie Univ, Fac Sci & Engn, Dept Chem & Biomol Sci, Balaclava Rd, N Ryde, NSW 2109, Australia
[2] Soka Univ, Grad Sch Engn, Dept Bioinformat, 1-236 Tangi, Hachioji, Tokyo 1928577, Japan
关键词
Glycoinformatics; N-glycan synthetic pathway; Human glycosyltransferases; SYMBOL NOMENCLATURE; MATHEMATICAL-MODEL; PROTEIN GLYCOSYLATION; LINKED GLYCANS; RESOURCE; GLYCOPROTEOMICS; KNOWLEDGE; UNICARBKB; PLATFORM; LIBRARY;
D O I
10.1016/j.carres.2016.05.012
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Glycan structures attached to proteins are comprised of diverse monosaccharide sequences and linkages that are produced from precursor nucleotide-sugars by a series of glycosyltransferases. Databases of these structures are an essential resource for the interpretation of analytical data and the development of bioinformatics tools. However, with no template to predict what structures are possible the human glycan structure databases are incomplete and rely heavily on the curation of published, experimentally determined, glycan structure data. In this work, a library of 45 human glycosyltransferases was used to generate a theoretical database of N-glycan structures comprised of 15 or less monosaccharide residues. Enzyme specificities were sourced from major online databases including Kyoto Encyclopedia of Genes and Genomes (KEGG) Glycan, Consortium for Functional Glycomics (CFG), Carbohydrate-Active enZymes (CAZy), GlycoGene DataBase (GGDB) and BRENDA. Based on the known activities, more than 1.1 million theoretical structures and 4.7 million synthetic reactions were generated and stored in our database called UniCorn. Furthermore, we analyzed the differences between the predicted glycan structures in UniCorn and those contained in UniCarbKB (www.unicarbkb.org), a database which stores experimentally described glycan structures reported in the literature, and demonstrate that UniCorn can be used to aid in the assignment of ambiguous structures whilst also serving as a discovery database. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:56 / 63
页数:8
相关论文
共 55 条
[1]   GlycoPattern: a web platform for glycan array mining [J].
Agravat, Sanjay B. ;
Saltz, Joel H. ;
Cummings, Richard D. ;
Smith, David F. .
BIOINFORMATICS, 2014, 30 (23) :3417-3418
[2]   The RINGS Resource for Glycome Informatics Analysis and Data Mining on the Web [J].
Akune, Yukie ;
Hosoda, Masae ;
Kaiya, Sakiko ;
Shinmachi, Daisuke ;
Aoki-Kinoshita, Kiyoko F. .
OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2010, 14 (04) :475-486
[3]  
[Anonymous], GLYCOCONJ J
[4]   GlyTouCan 1.0-The international glycan structure repository [J].
Aoki-Kinoshita, Kiyoko ;
Agravat, Sanjay ;
Aoki, Nobuyuki P. ;
Arpinar, Sena ;
Cummings, Richard D. ;
Fujita, Akihiro ;
Fujita, Noriaki ;
Hart, Gerald M. ;
Haslam, Stuart M. ;
Kawasaki, Toshisuke ;
Matsubara, Masaaki ;
Moreman, Kelley W. ;
Okuda, Shujiro ;
Pierce, Michael ;
Ranzinger, Rene ;
Shikanai, Toshihide ;
Shinmachi, Daisuke ;
Solovieva, Elena ;
Suzuki, Yoshinori ;
Tsuchiya, Shinichiro ;
Yamada, Issaku ;
York, William S. ;
Zaia, Joseph ;
Narimatsu, Hisashi .
NUCLEIC ACIDS RESEARCH, 2016, 44 (D1) :D1237-D1242
[5]   A novel Linear Code® nomenclature for complex carbohydrates [J].
Banin, E ;
Neuberger, Y ;
Altshuler, Y ;
Halevi, A ;
Inbar, O ;
Nir, D ;
Dukler, A .
TRENDS IN GLYCOSCIENCE AND GLYCOTECHNOLOGY, 2002, 14 (77) :127-137
[6]   UniProt: a hub for protein information [J].
Bateman, Alex ;
Martin, Maria Jesus ;
O'Donovan, Claire ;
Magrane, Michele ;
Apweiler, Rolf ;
Alpi, Emanuele ;
Antunes, Ricardo ;
Arganiska, Joanna ;
Bely, Benoit ;
Bingley, Mark ;
Bonilla, Carlos ;
Britto, Ramona ;
Bursteinas, Borisas ;
Chavali, Gayatri ;
Cibrian-Uhalte, Elena ;
Da Silva, Alan ;
De Giorgi, Maurizio ;
Dogan, Tunca ;
Fazzini, Francesco ;
Gane, Paul ;
Cas-tro, Leyla Garcia ;
Garmiri, Penelope ;
Hatton-Ellis, Emma ;
Hieta, Reija ;
Huntley, Rachael ;
Legge, Duncan ;
Liu, Wudong ;
Luo, Jie ;
MacDougall, Alistair ;
Mutowo, Prudence ;
Nightin-gale, Andrew ;
Orchard, Sandra ;
Pichler, Klemens ;
Poggioli, Diego ;
Pundir, Sangya ;
Pureza, Luis ;
Qi, Guoying ;
Rosanoff, Steven ;
Saidi, Rabie ;
Sawford, Tony ;
Shypitsyna, Aleksandra ;
Turner, Edward ;
Volynkin, Vladimir ;
Wardell, Tony ;
Watkins, Xavier ;
Zellner, Hermann ;
Cowley, Andrew ;
Figueira, Luis ;
Li, Weizhong ;
McWilliam, Hamish .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D204-D212
[7]   GlycoFly: A Database of Drosophila N-linked Glycoproteins Identified Using SPEG-MS Techniques [J].
Baycin-Hizal, Deniz ;
Tian, Yuan ;
Akan, Ilhan ;
Jacobson, Elena ;
Clark, Dean ;
Chu, Jeffrey ;
Palter, Karen ;
Zhang, Hui ;
Betenbaugh, Michael J. .
JOURNAL OF PROTEOME RESEARCH, 2011, 10 (06) :2777-2784
[8]   GlycoBase and autoGU: tools for HPLC-based glycan analysis [J].
Campbell, Matthew P. ;
Royle, Louise ;
Radcliffe, Catherine M. ;
Dwek, Raymond A. ;
Rudd, Pauline M. .
BIOINFORMATICS, 2008, 24 (09) :1214-1216
[9]   UniCarbKB: New database features for integrating glycan structure abundance, compositional glycoproteomics data, and disease associations [J].
Campbell, Matthew P. ;
Packer, Nicolle H. .
BIOCHIMICA ET BIOPHYSICA ACTA-GENERAL SUBJECTS, 2016, 1860 (08) :1669-1675
[10]   UniCarbKB: building a knowledge platform for glycoproteomics [J].
Campbell, Matthew P. ;
Peterson, Robyn ;
Mariethoz, Julien ;
Gasteiger, Elisabeth ;
Akune, Yukie ;
Aoki-Kinoshita, Kiyoko F. ;
Lisacek, Frederique ;
Packer, Nicolle H. .
NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) :D215-D221