Analyse multiple disease subtypes and build associated gene networks using genome-wide expression profiles

被引:17
作者
Aibar, Sara [1 ]
Fontanillo, Celia [1 ,3 ]
Droste, Conrad [1 ]
Roson-Burgo, Beatriz [1 ]
Campos-Laborie, Francisco J. [1 ]
Hernandez-Rivas, Jesus M. [2 ]
Rivas, Javier De Las [1 ]
机构
[1] USAL, CSIC, IMBCC,IBSAL, Canc Res Ctr, Salamanca 37007, Spain
[2] USAL, Hosp Univ Salamanca, HUS,IBSAL, Dept Hematol, Salamanca 37007, Spain
[3] CITRE, Seville 41092, Spain
来源
BMC GENOMICS | 2015年 / 16卷
关键词
DOWN-REGULATION; CLASSIFICATION; PACKAGE; MEIS1;
D O I
10.1186/1471-2164-16-S5-S3
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Despite the large increase of transcriptomic studies that look for gene signatures on diseases, there is still a need for integrative approaches that obtain separation of multiple pathological states providing robust selection of gene markers for each disease subtype and information about the possible links or relations between those genes. Results: We present a network-oriented and data-driven bioinformatic approach that searches for association of genes and diseases based on the analysis of genome-wide expression data derived from microarrays or RNA-Seq studies. The approach aims to (i) identify gene sets associated to different pathological states analysed together; (ii) identify a minimum subset within these genes that unequivocally differentiates and classifies the compared disease subtypes; (iii) provide a measurement of the discriminant power of these genes and (iv) identify links between the genes that characterise each of the disease subtypes. This bioinformatic approach is implemented in an R package, named geNetClassifier, available as an open access tool in Bioconductor. To illustrate the performance of the tool, we applied it to two independent datasets: 250 samples from patients with four major leukemia subtypes analysed using expression arrays; another leukemia dataset analysed with RNA-Seq that includes a subtype also present in the previous set. The results show the selection of key deregulated genes recently reported in the literature and assigned to the leukemia subtypes studied. We also show, using these independent datasets, the selection of similar genes in a network built for the same disease subtype. Conclusions: The construction of gene networks related to specific disease subtypes that include parameters such as gene-to-gene association, gene disease specificity and gene discriminant power can be very useful to draw gene-disease maps and to unravel the molecular features that characterize specific pathological states. The application of the bioinformatic tool here presented shows a neat way to achieve such molecular characterization of the diseases using genome-wide expression data.
引用
收藏
页数:10
相关论文
共 30 条
  • [1] Selection bias in gene extraction on the basis of microarray gene-expression data
    Ambroise, C
    McLachlan, GJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (10) : 6562 - 6566
  • [2] [Anonymous], IN MAN
  • [3] [Anonymous], EBARRAYS UNIFIED APP
  • [4] [Anonymous], SYSTEMS BIOMEDICINE
  • [5] STRUCTURE AND PRE-B LYMPHOCYTE RESTRICTED EXPRESSION OF THE VPREB GENE IN HUMANS AND CONSERVATION OF ITS STRUCTURE IN OTHER MAMMALIAN-SPECIES
    BAUER, SR
    KUDO, A
    MELCHERS, F
    [J]. EMBO JOURNAL, 1988, 7 (01) : 111 - 116
  • [6] Use of gene-expression profiling to identify prognostic subclasses in adult acute myeloid leukemia
    Bullinger, L
    Döhner, K
    Bair, E
    Fröhling, S
    Schlenk, RF
    Tibshirani, R
    Döhner, H
    Pollack, JR
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 2004, 350 (16) : 1605 - 1616
  • [7] Cruz JA, 2006, CANCER INFORM, V2, P59
  • [8] GeneSigDB: a manually curated database and resource for analysis of gene expression signatures
    Culhane, Aedin C.
    Schroeder, Markus S.
    Sultana, Razvan
    Picard, Shaita C.
    Martinelli, Enzo N.
    Kelly, Caroline
    Haibe-Kains, Benjamin
    Kapushesky, Misha
    St Pierre, Anne-Alyssa
    Flahive, William
    Picard, Kermshlise C.
    Gusenleitner, Daniel
    Papenhausen, Gerald
    O'Connor, Niall
    Correll, Mick
    Quackenbush, John
    [J]. NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) : D1060 - D1066
  • [9] Pattern recognition in bioinformatics
    de Ridder, Dick
    de Ridder, Jeroen
    Reinders, Marcel J. T.
    [J]. BRIEFINGS IN BIOINFORMATICS, 2013, 14 (05) : 633 - 647
  • [10] Fontanillo C, 2010, PLOS COMPUT BIOL, V6