Automated multidimensional phenotypic profiling using large public microarray repositories

被引:20
作者
Xu, Min [1 ]
Li, Wenyuan [1 ]
James, Gareth M. [2 ]
Mehan, Michael R. [1 ]
Zhou, Xianghong Jasmine [1 ]
机构
[1] Univ So Calif, Dept Biol Sci, Los Angeles, CA 90089 USA
[2] Univ So Calif, Marshall Sch Business, Los Angeles, CA 90089 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
genotype-phenotype association; phenotype prediction; phenotype profiling; REFRACTORY-ANEMIA; PHENOME; LEUKEMIA; NETWORK;
D O I
10.1073/pnas.0900883106
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Phenotypes are complex, and difficult to quantify in a high-throughput fashion. The lack of comprehensive phenotype data can prevent or distort genotype-phenotype mapping. Here, we describe "PhenoProfiler,'' a computational method that enables in silico phenotype profiling. Drawing on the principle that similar gene expression patterns are likely to be associated with similar phenotype patterns, PhenoProfiler supplements the missing quantitative phenotype information for a given microarray dataset based on other well-characterized microarray datasets. We applied our method to 587 human microarray datasets covering >14,000 samples, and confirmed that the predicted phenotype profiles are highly consistent with true phenotype descriptions. PhenoProfiler offers several unique capabilities: (i) automated, multidimensional phenotype profiling, facilitating the analysis and treatment design of complex diseases; (ii) the extrapolation of phenotype profiles beyond provided classes; and (iii) the detection of confounding phenotype factors that could otherwise bias biological inferences. Finally, because no direct comparisons are made between gene expression values from different datasets, the method can use the entire body of cross-platform microarray data. This work has produced a compendium of phenotype profiles for the National Center for Biotechnology Information GEO datasets, which can facilitate an unbiased understanding of the transcriptome-phenome mapping. The continued accumulation of microarray data will further increase the power of PhenoProfiler, by increasing the variety and the quality of phenotypes to be profiled.
引用
收藏
页码:12323 / 12328
页数:6
相关论文
共 18 条
[1]   Differences between refractory anemia with excess blasts in transformation and acute myeloid leukemia [J].
Albitar, M ;
Beran, M ;
O'Brien, S ;
Kantarjian, H ;
Frieriech, E ;
Keating, M ;
Estey, E .
BLOOD, 2000, 96 (01) :372-373
[2]  
ARONSON AR, 2001, P AMIA S, V17, P17
[3]   Clinical importance of transforming growth factor-β but not of tumor necrosis factor-α gene polymorphisms in patients with the myelodysplastic syndrome belonging to the refractory anemia subtype [J].
Balog, A ;
Borbényi, Z ;
Gyulai, Z ;
Molnár, L ;
Mándi, Y .
PATHOBIOLOGY, 2005, 72 (03) :165-170
[4]   NCBI GEO: mining tens of millions of expression profiles - database and tools update [J].
Barrett, Tanya ;
Troup, Dennis B. ;
Wilhite, Stephen E. ;
Ledoux, Pierre ;
Rudnev, Dmitry ;
Evangelista, Carlos ;
Kim, Irene F. ;
Soboleva, Alexandra ;
Tomashevsky, Maxim ;
Edgar, Ron .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D760-D765
[5]  
Bazaraa MS., 2013, Nonlinear programming: theory and algorithms
[6]   The Unified Medical Language System (UMLS): integrating biomedical terminology [J].
Bodenreider, O .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D267-D270
[7]   Metagenes and molecular pattern discovery using matrix factorization [J].
Brunet, JP ;
Tamayo, P ;
Golub, TR ;
Mesirov, JP .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (12) :4164-4169
[8]   From syndrome families to functional genomics [J].
Brunner, HG ;
van Driel, MA .
NATURE REVIEWS GENETICS, 2004, 5 (07) :545-551
[9]   Creation and implications of a phenome-genome network [J].
Butte, AJ ;
Kohane, IS .
NATURE BIOTECHNOLOGY, 2006, 24 (01) :55-62
[10]   Acute promyelocytic leukemia with t(15;17): frequency of additional clonal chromosome abnormalities and FLT3 mutations [J].
Chauffaille, Maria De Lourdes ;
Borri, Daniela ;
Proto-Siqueira, Rodrigo ;
Moreira, Eloisa S. ;
Alberto, Fernando L. .
LEUKEMIA & LYMPHOMA, 2008, 49 (12) :2387-2389