Correcting for sparsity and interdependence in glycomics by accounting for glycan biosynthesis

被引:25
作者
Bao, Bokan [1 ,2 ,3 ]
Kellman, Benjamin P. [1 ,2 ,3 ]
Chiang, Austin W. T. [1 ,4 ]
Zhang, Yujie [1 ]
Sorrentino, James T. [1 ,2 ,3 ]
York, Austin K. [1 ]
Mohammad, Mahmoud A. [5 ]
Haymond, Morey W. [5 ]
Bode, Lars [1 ]
Lewis, Nathan E. [1 ,3 ,4 ]
机构
[1] Univ Calif San Diego, Dept Pediat, La Jolla, CA 92093 USA
[2] Univ Calif San Diego, Bioinformat & Syst Biol Grad Program, La Jolla, CA 92093 USA
[3] Univ Calif San Diego, Dept Bioengn, La Jolla, CA 92093 USA
[4] Univ Calif San Diego, Novo Nordisk Fdn Ctr Biosustainabil, La Jolla, CA 92093 USA
[5] Agr Res Serv, USDA, Childrens Nutr Res Ctr, Baylor Coll Med,Dept Pediatr, Houston, TX USA
关键词
HIGH-THROUGHPUT; HUMAN-MILK; N-GLYCOSYLATION; ASSOCIATION; DIVERSITY; CELLS; CODE; DB;
D O I
10.1038/s41467-021-25183-5
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Glycans are fundamental cellular building blocks, involved in many organismal functions. Advances in glycomics are elucidating the essential roles of glycans. Still, it remains challenging to properly analyze large glycomics datasets, since the abundance of each glycan is dependent on many other glycans that share many intermediate biosynthetic steps. Furthermore, the overlap of measured glycans can be low across samples. We address these challenges with GlyCompare, a glycomic data analysis approach that accounts for shared biosynthetic steps for all measured glycans to correct for sparsity and non-independence in glycomics, which enables direct comparison of different glycoprofiles and increases statistical power. Using GlyCompare, we study diverse N-glycan profiles from glycoengineered erythropoietin. We obtain biologically meaningful clustering of mutant cell glycoprofiles and identify knockout-specific effects of fucosyltransferase mutants on tetra-antennary structures. We further analyze human milk oligosaccharide profiles and find mother's fucosyltransferase-dependent secretor-status indirectly impact the sialylation. Finally, we apply our method on mucin-type O-glycans, gangliosides, and site-specific compositional glycosylation data to reveal tissues and disease-specific glycan presentations. Our substructure-oriented approach will enable researchers to take full advantage of the growing power and size of glycomics data. Glycomics can uncover important molecular changes but measured glycans are highly interconnected and incompatible with common statistical methods, introducing pitfalls during analysis. Here, the authors develop an approach to identify glycan dependencies across samples to facilitate comparative glycomics.
引用
收藏
页数:14
相关论文
共 56 条
[1]   Associations between human milk oligosaccharides and infant body composition in the first 6 mo of life [J].
Alderete, Tanya L. ;
Autran, Chloe ;
Brekke, Benjamin E. ;
Knight, Rob ;
Bode, Lars ;
Goran, Michael I. ;
Fields, David A. .
AMERICAN JOURNAL OF CLINICAL NUTRITION, 2015, 102 (06) :1381-1388
[2]   Understanding the glycome: an interactive view of glycosylation from glycocompositions to glycoepitopes [J].
Alocci, Davide ;
Ghraichy, Marie ;
Barletta, Elena ;
Gastaldello, Alessandra ;
Mariethoz, Julien ;
Lisacek, Frederique .
GLYCOBIOLOGY, 2018, 28 (06) :349-362
[3]  
Angel PM, 2018, METHODS MOL BIOL, V1788, P225, DOI 10.1007/7651_2017_81
[4]   GlyTouCan 1.0-The international glycan structure repository [J].
Aoki-Kinoshita, Kiyoko ;
Agravat, Sanjay ;
Aoki, Nobuyuki P. ;
Arpinar, Sena ;
Cummings, Richard D. ;
Fujita, Akihiro ;
Fujita, Noriaki ;
Hart, Gerald M. ;
Haslam, Stuart M. ;
Kawasaki, Toshisuke ;
Matsubara, Masaaki ;
Moreman, Kelley W. ;
Okuda, Shujiro ;
Pierce, Michael ;
Ranzinger, Rene ;
Shikanai, Toshihide ;
Shinmachi, Daisuke ;
Solovieva, Elena ;
Suzuki, Yoshinori ;
Tsuchiya, Shinichiro ;
Yamada, Issaku ;
York, William S. ;
Zaia, Joseph ;
Narimatsu, Hisashi .
NUCLEIC ACIDS RESEARCH, 2016, 44 (D1) :D1237-D1242
[5]   On the frequency of protein glycosylation, as deduced from analysis of the SWISS-PROT database [J].
Apweiler, R ;
Hermjakob, H ;
Sharon, N .
BIOCHIMICA ET BIOPHYSICA ACTA-GENERAL SUBJECTS, 1999, 1473 (01) :4-8
[6]   Standardization of PGC-LC-MS-based glycomics for sample specific glycotyping [J].
Ashwood, Christopher ;
Pratt, Brian ;
MacLean, Brendan X. ;
Gundry, Rebekah L. ;
Packer, Nicolle H. .
ANALYST, 2019, 144 (11) :3601-3612
[7]   Human Milk Oligosaccharide Concentrations Are Associated with Multiple Fixed and Modifiable Maternal Characteristics, Environmental Factors, and Feeding Practices [J].
Azad, Meghan B. ;
Robertson, Bianca ;
Atakora, Faisal ;
Becker, Allan B. ;
Subbarao, Padmaja ;
Moraes, Theo J. ;
Mandhane, Piushkumar J. ;
Turvey, Stuart E. ;
Lefebvre, Diana L. ;
Sears, Malcolm R. ;
Bode, Lars .
JOURNAL OF NUTRITION, 2018, 148 (11) :1733-1742
[8]   Systematic Evaluation of Normalization Methods for Glycomics Data Based on Performance of Network Inference [J].
Benedetti, Elisa ;
Gerstner, Nathalie ;
Pucic-Bakovic, Maja ;
Keser, Toma ;
Reiding, Karli R. ;
Ruhaak, L. Renee ;
Stambuk, Tamara ;
Selman, Maurice H. J. ;
Rudan, Igor ;
Polasek, Ozren ;
Hayward, Caroline ;
Beekman, Marian ;
Slagboom, Eline ;
Wuhrer, Manfred ;
Dunlop, Malcolm G. ;
Lauc, Gordan ;
Krumsiek, Jan .
METABOLITES, 2020, 10 (07) :1-16
[9]   Network inference from glycoproteomics data reveals new reactions in the IgG glycosylation pathway [J].
Benedetti, Elisa ;
Pucic-Bakovic, Maja ;
Keser, Toma ;
Wahl, Annika ;
Hassinen, Antti ;
Yang, Jeong-Yeh ;
Liu, Lin ;
Trbojevic-Akmacic, Irena ;
Razdorov, Genadij ;
Stambuk, Jerko ;
Klaric, Lucija ;
Ugrina, Ivo ;
Selman, Maurice H. J. ;
Wuhrer, Manfred ;
Rudan, Igor ;
Polasek, Ozren ;
Hayward, Caroline ;
Grallert, Harald ;
Strauch, Konstantin ;
Peters, Annette ;
Meitinger, Thomas ;
Gieger, Christian ;
Vilaj, Marija ;
Boons, Geert-Jan ;
Moremen, Kelley W. ;
Ovchinnikova, Tatiana ;
Bovin, Nicolai ;
Kellokumpu, Sakari ;
Theis, Fabian J. ;
Lauc, Gordan ;
Krumsiek, Jan .
NATURE COMMUNICATIONS, 2017, 8
[10]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300