IntLIM: integration using linear models of metabolomics and gene expression data

被引:28
作者
Siddiqui, Jalal K. [1 ]
Baskin, Elizabeth [1 ]
Liu, Mingrui [1 ]
Cantemir-Stone, Carmen Z. [1 ]
Zhang, Bofei [1 ,5 ]
Bonneville, Russell [2 ,3 ]
McElroy, Joseph P. [4 ]
Coombes, Kevin R. [1 ]
Mathe, Ewy A. [1 ]
机构
[1] Ohio State Univ, Coll Med, Dept Biomed Informat, Columbus, OH 43210 USA
[2] Ohio State Univ, Biomed Sci Grad Program, Columbus, OH 43210 USA
[3] Ohio State Univ, Dept Internal Med, Comprehens Canc Ctr, Columbus, OH 43210 USA
[4] Ohio State Univ, Ctr Biostat, Columbus, OH 43210 USA
[5] Ohio State Univ, Biomed Engn Undegrad Program, Columbus, OH 43210 USA
来源
BMC BIOINFORMATICS | 2018年 / 19卷
关键词
Metabolomics; Transcriptomics; Linear Modeling; Integration; PROSTATE-CANCER; R PACKAGE; BIOMARKERS; BREAST; TRANSCRIPTOMICS; METAANALYSIS; METABOLISM; CHALLENGES; DISCOVERY; MIGRATION;
D O I
10.1186/s12859-018-2085-6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Integration of transcriptomic and metabolomic data improves functional interpretation of disease-related metabolomic phenotypes, and facilitates discovery of putative metabolite biomarkers and gene targets. For this reason, these data are increasingly collected in large (> 100 participants) cohorts, thereby driving a need for the development of user-friendly and open-source methods/tools for their integration. Of note, clinical/translational studies typically provide snapshot (e.g. one time point) gene and metabolite profiles and, oftentimes, most metabolites measured are not identified. Thus, in these types of studies, pathway/network approaches that take into account the complexity of transcript-metabolite relationships may neither be applicable nor readily uncover novel relationships. With this in mind, we propose a simple linear modeling approach to capture disease-(or other phenotype) specific gene-metabolite associations, with the assumption that co-regulation patterns reflect functionally related genes and metabolites. Results: The proposed linear model, metabolite similar to gene + phenotype + gene: phenotype, specifically evaluates whether gene-metabolite relationships differ by phenotype, by testing whether the relationship in one phenotype is significantly different from the relationship in another phenotype (via a statistical interaction gene: phenotype p-value). Statistical interaction p-values for all possible gene-metabolite pairs are computed and significant pairs are then clustered by the directionality of associations (e.g. strong positive association in one phenotype, strong negative association in another phenotype). We implemented our approach as an R package, IntLIM, which includes a user-friendly R Shiny web interface, thereby making the integrative analyses accessible to non-computational experts. We applied IntLIM to two previously published datasets, collected in the NCI-60 cancer cell lines and in human breast tumor and non-tumor tissue, for which transcriptomic and metabolomic data are available. We demonstrate that IntLIM captures relevant tumor-specific gene-metabolite associations involved in known cancer-related pathways, including glutamine metabolism. Using IntLIM, we also uncover biologically relevant novel relationships that could be further tested experimentally. Conclusions: IntLIM provides a user-friendly, reproducible framework to integrate transcriptomic and metabolomic data and help interpret metabolomic data and uncover novel gene-metabolite relationships. The IntLIM R package is publicly available in GitHub (https://github. com/mathelab/IntLIM) and includes a user-friendly web application, vignettes, sample data and data/code to reproduce results.
引用
收藏
页数:12
相关论文
共 70 条
  • [1] Affymetrix I, 2002, AFFYMETRIX 1 STAT AL
  • [2] Nucleotide metabolism, oncogene-induced senescence and cancer
    Aird, Katherine M.
    Zhang, Rugang
    [J]. CANCER LETTERS, 2015, 356 (02) : 204 - 210
  • [3] From Krebs to clinic: glutamine metabolism to cancer therapy
    Altman, Brian J.
    Stine, Zachary E.
    Dang, Chi V.
    [J]. NATURE REVIEWS CANCER, 2016, 16 (10) : 619 - 634
  • [4] [Anonymous], 2009, CALC INT P VAL FUNCT
  • [5] CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING
    BENJAMINI, Y
    HOCHBERG, Y
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) : 289 - 300
  • [6] Circadian clocks and breast cancer
    Blakeman, Victoria
    Williams, Jack L.
    Meng, Qing-Jun
    Streuli, Charles H.
    [J]. BREAST CANCER RESEARCH, 2016, 18
  • [7] Letter to the Editor: On the term 'interaction' and related phrases in the literature on Random Forests
    Boulesteix, Anne-Laure
    Janitza, Silke
    Hapfelmeier, Alexander
    Van Steen, Kristel
    Strobl, Carolin
    [J]. BRIEFINGS IN BIOINFORMATICS, 2015, 16 (02) : 338 - 345
  • [8] Coordinated Concentration Changes of Transcripts and Metabolites in Saccharomyces cerevisiae
    Bradley, Patrick H.
    Brauer, Matthew J.
    Rabinowitz, Joshua D.
    Troyanskaya, Olga G.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (01)
  • [9] Integrated Metabolite and Gene Expression Profiles Identify Lipid Biomarkers Associated With Progression of Hepatocellular Carcinoma and Patient Outcomes
    Budhu, Anuradha
    Roessler, Stephanie
    Zhao, Xuelian
    Yu, Zhipeng
    Forgues, Marshonna
    Ji, Junfang
    Karoly, Edward
    Qin, Lun-Xiu
    Ye, Qing-Hai
    Jia, Hu-Liang
    Fan, Jia
    Sun, Hui-Chuan
    Tang, Zhao-You
    Wang, Xin Wei
    [J]. GASTROENTEROLOGY, 2013, 144 (05) : 1066 - +
  • [10] Buescher JM, 2016, CANCER METAB, V4, DOI 10.1186/s40170-016-0143-y