Discovering Biological Progression Underlying Microarray Samples

被引:39
|
作者
Qiu, Peng [1 ,2 ]
Gentles, Andrew J. [1 ]
Plevritis, Sylvia K. [1 ]
机构
[1] Stanford Univ, Dept Radiol, Stanford, CA 94305 USA
[2] Univ Texas MD Anderson Canc Ctr, Dept Bioinformat & Computat Biol, Houston, TX 77030 USA
关键词
EXPRESSION; CLASSIFICATION; CANCER; NETWORKS;
D O I
10.1371/journal.pcbi.1001123
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
In biological systems that undergo processes such as differentiation, a clear concept of progression exists. We present a novel computational approach, called Sample Progression Discovery (SPD), to discover patterns of biological progression underlying microarray gene expression data. SPD assumes that individual samples of a microarray dataset are related by an unknown biological process (i.e., differentiation, development, cell cycle, disease progression), and that each sample represents one unknown point along the progression of that process. SPD aims to organize the samples in a manner that reveals the underlying progression and to simultaneously identify subsets of genes that are responsible for that progression. We demonstrate the performance of SPD on a variety of microarray datasets that were generated by sampling a biological process at different points along its progression, without providing SPD any information of the underlying process. When applied to a cell cycle time series microarray dataset, SPD was not provided any prior knowledge of samples' time order or of which genes are cell-cycle regulated, yet SPD recovered the correct time order and identified many genes that have been associated with the cell cycle. When applied to B-cell differentiation data, SPD recovered the correct order of stages of normal B-cell differentiation and the linkage between preB-ALL tumor cells with their cell origin preB. When applied to mouse embryonic stem cell differentiation data, SPD uncovered a landscape of ESC differentiation into various lineages and genes that represent both generic and lineage specific processes. When applied to a prostate cancer microarray dataset, SPD identified gene modules that reflect a progression consistent with disease stages. SPD may be best viewed as a novel tool for synthesizing biological hypotheses because it provides a likely biological progression underlying a microarray dataset and, perhaps more importantly, the candidate genes that regulate that progression.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Discovering Pair-wise Synergies in Microarray Data
    Chen, Yuan
    Cao, Dan
    Gao, Jun
    Yuan, Zheming
    SCIENTIFIC REPORTS, 2016, 6
  • [2] Discovering monotonic stemness marker genes from time-series stem cell microarray data
    Wang, Hsei-Wei
    Sun, Hsing-Jen
    Chang, Ting-Yu
    Lo, Hung-Hao
    Cheng, Wei-Chung
    Tseng, George C.
    Lin, Chin-Teng
    Chang, Shing-Jyh
    Pal, Nikhil Ranjan
    Chung, I-Fang
    BMC GENOMICS, 2015, 16
  • [3] The Genetic and Immunologic Landscape Underlying the Risk of Malignant Progression in Laryngeal Dysplasia
    Chu, Francesco
    Maffini, Fausto
    Lepanto, Daniela
    Vacirca, Davide
    Taormina, Sergio Vincenzo
    De Berardinis, Rita
    Gandini, Sara
    Vignati, Silvano
    Ranghiero, Alberto
    Rappa, Alessandra
    Chiocca, Susanna
    Barberis, Massimo
    Tagliabue, Marta
    Ansarin, Mohssen
    CANCERS, 2023, 15 (04)
  • [4] The derivation of diagnostic markers of chronic myeloid leukemia progression from microarray data
    Oehler, Vivian G.
    Yeung, Ka Yee
    Choi, Yongjae E.
    Bumgarner, Roger E.
    Raftery, Adrian E.
    Radich, Jerald P.
    BLOOD, 2009, 114 (15) : 3292 - 3298
  • [5] Identifying Genes Relevant to Specific Biological Conditions in Time Course Microarray Experiments
    Singh, Nitesh Kumar
    Repsilber, Dirk
    Liebscher, Volkmar
    Taher, Leila
    Fuellen, Georg
    PLOS ONE, 2013, 8 (10):
  • [6] Causal Path of COPD Progression-Associated Genes in Different Biological Samples
    Mostafaei, Shayan
    Borna, Hojat
    Emamvirdizadeh, Alireza
    Arabfard, Masoud
    Ahmadi, Ali
    Salimian, Jafar
    Salesi, Mahmood
    Jamalkandi, Sadegh Azimzadeh
    COPD-JOURNAL OF CHRONIC OBSTRUCTIVE PULMONARY DISEASE, 2022, 19 (01) : 290 - 299
  • [7] Gene network modular-based classification of microarray samples
    Hu, Pingzhao
    Bull, Shelley B.
    Jiang, Hui
    BMC BIOINFORMATICS, 2012, 13 : S17
  • [8] Detecting Outlier Samples in Microarray Data
    Shieh, Albert D.
    Hung, Yeung Sam
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2009, 8 (01)
  • [9] Microarray Data Integration: Frameworks and a List of Underlying Issues
    Sarmah, Chintanu Kumar
    Samarasinghe, Sandhya
    CURRENT BIOINFORMATICS, 2010, 5 (04) : 280 - 289
  • [10] Significance of gene ranking for classification of microarray samples
    Zhang, Chaolin
    Lu, Xuesong
    Zhang, Xuegong
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2006, 3 (03) : 312 - 320