Discovering Biological Progression Underlying Microarray Samples

被引:39
作者
Qiu, Peng [1 ,2 ]
Gentles, Andrew J. [1 ]
Plevritis, Sylvia K. [1 ]
机构
[1] Stanford Univ, Dept Radiol, Stanford, CA 94305 USA
[2] Univ Texas MD Anderson Canc Ctr, Dept Bioinformat & Computat Biol, Houston, TX 77030 USA
关键词
EXPRESSION; CLASSIFICATION; CANCER; NETWORKS;
D O I
10.1371/journal.pcbi.1001123
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
In biological systems that undergo processes such as differentiation, a clear concept of progression exists. We present a novel computational approach, called Sample Progression Discovery (SPD), to discover patterns of biological progression underlying microarray gene expression data. SPD assumes that individual samples of a microarray dataset are related by an unknown biological process (i.e., differentiation, development, cell cycle, disease progression), and that each sample represents one unknown point along the progression of that process. SPD aims to organize the samples in a manner that reveals the underlying progression and to simultaneously identify subsets of genes that are responsible for that progression. We demonstrate the performance of SPD on a variety of microarray datasets that were generated by sampling a biological process at different points along its progression, without providing SPD any information of the underlying process. When applied to a cell cycle time series microarray dataset, SPD was not provided any prior knowledge of samples' time order or of which genes are cell-cycle regulated, yet SPD recovered the correct time order and identified many genes that have been associated with the cell cycle. When applied to B-cell differentiation data, SPD recovered the correct order of stages of normal B-cell differentiation and the linkage between preB-ALL tumor cells with their cell origin preB. When applied to mouse embryonic stem cell differentiation data, SPD uncovered a landscape of ESC differentiation into various lineages and genes that represent both generic and lineage specific processes. When applied to a prostate cancer microarray dataset, SPD identified gene modules that reflect a progression consistent with disease stages. SPD may be best viewed as a novel tool for synthesizing biological hypotheses because it provides a likely biological progression underlying a microarray dataset and, perhaps more importantly, the candidate genes that regulate that progression.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Multiclass classification of microarray data samples with a reduced number of genes
    Tapia, Elizabeth
    Ornella, Leonardo
    Bulacio, Pilar
    Angelone, Laura
    BMC BIOINFORMATICS, 2011, 12
  • [22] Regulatory mechanisms underlying sepsis progression in patients with tumor necrosis factor-α genetic variations
    Liu, Yangzhou
    Han, Ning
    Li, Qinchuan
    Li, Zengchun
    EXPERIMENTAL AND THERAPEUTIC MEDICINE, 2016, 12 (01) : 323 - 328
  • [23] Progression model tissue microarray (TMA) for the study of uterine carcinomas
    Arafa, Mohammad
    Boniver, Jacques
    Delvenne, Philippe
    DISEASE MARKERS, 2010, 28 (05) : 267 - 272
  • [24] The Application of Artificial Intelligence to Microarray Data: Identification of a Novel Gene Signature to Identify Bladder Cancer Progression
    Catto, James W. F.
    Abbod, Maysam F.
    Wild, Peter J.
    Linkens, Derek A.
    Pilarsky, Christian
    Rehman, Ishtiaq
    Rosario, Derek J.
    Denzinger, Stefan
    Burger, Maximilian
    Stoehr, Robert
    Knuechel, Ruth
    Hartmann, Arndt
    Hamdy, Freddie C.
    EUROPEAN UROLOGY, 2010, 57 (03) : 398 - 406
  • [25] Pathological role of RAGE underlying progression of various diseases: its potential as biomarker and therapeutic target
    Sarkar, Sinjini
    NAUNYN-SCHMIEDEBERGS ARCHIVES OF PHARMACOLOGY, 2024, : 3467 - 3487
  • [26] Stepwise progression of invasive mucinous adenocarcinoma based on radiological and biological characteristics
    Goto, Eisuke
    Takamochi, Kazuya
    Kishikawa, Satsuki
    Hayashi, Takuo
    Ueda, Takuya
    Hattori, Aritoshi
    Fukui, Mariko
    Matsunaga, Takeshi
    Suzuki, Kenji
    LUNG CANCER, 2023, 184
  • [27] Underlying Genes Involved in Atherosclerotic Macrophages: Insights from Microarray Data Mining
    Wang, Weihan
    Zhang, Kai
    Zhang, Hao
    Li, Mengqi
    Zhao, Yan
    Wang, Bangyue
    Xin, Wenqiang
    Yang, Weidong
    Zhang, Jianning
    Yue, Shuyuan
    Yang, Xinyu
    MEDICAL SCIENCE MONITOR, 2019, 25 : 9949 - 9962
  • [28] Molecular mechanisms underlying endometriosis pathogenesis revealed by bioinformatics analysis of microarray data
    Ping, Shengmin
    Ma, Chengbin
    Liu, Ping
    Yang, Longtao
    Yang, Xiaoer
    Wu, Qiongwei
    Zhao, Xuejun
    Gong, Bo
    ARCHIVES OF GYNECOLOGY AND OBSTETRICS, 2016, 293 (04) : 797 - 804
  • [29] Cancer initiation and progression: a biological system
    Manuel Valdespino-Gomez, Victor
    Edmundo Valdespino-Castillo, Victor
    GACETA MEXICANA DE ONCOLOGIA, 2011, 10 (06): : 358 - 365
  • [30] Robust gene signatures from microarray data using genetic algorithms enriched with biological pathway keywords
    Luque-Baena, R. M.
    Urda, D.
    Gonzalo Claros, M.
    Franco, L.
    Jerez, J. M.
    JOURNAL OF BIOMEDICAL INFORMATICS, 2014, 49 : 32 - 44