Pharmacogenomic analysis: Correlating molecular substructure classes with microarray gene expression data

被引:81
作者
Blower P.E. [1 ]
Yang C. [1 ]
Fligner M.A. [2 ]
Verducci J.S. [2 ]
Yu L. [1 ]
Richman S. [3 ]
Weinstein J.N. [3 ]
机构
[1] Leadscope Inc., Columbus, OH
[2] Department of Statistics, Ohio State University, Columbus, OH
[3] Laboratory of Molecular Pharmacology, Division of Basic Sciences, National Cancer Institute, National Institutes of Health, Bethesda, MD
关键词
Molecular class; NCI cell lines; Statistical correlation;
D O I
10.1038/sj.tpj.6500116
中图分类号
学科分类号
摘要
Genomic studies are producing large databases of molecular information on and other cell and tissue types. Hence, we have the opportunity to link these accumulating data to the drug discovery processes. Our previous efforts at 'information-intensive' molecular pharmacology have focused on the relationship between patterns of gene expression and patterns of drug activity. In the present study, we take the process a step further-relating gene expression patterns, not just to the drugs as entities, but to ∼27 000 substructures and other chemical features within the drugs. This coupling of genomic information with structure-based data mining can be used to identify classes of compounds for which detailed experimental structure-activity studies may be fruitful. Using a systematic substructure analysis coupled with statistical correlations of compound activity with differential gene expression, we have identified two subclasses of quinones whose patterns of activity in the National Cancer Institute's 60-cell line screening panel (NCI-60) correlate strongly with the expression patterns of particular genes: (i) The growth inhibitory patterns of an electron-withdrawing subclass of benzodithiophenedione-containing compounds over the NCI-60 are highly correlated with the expression patterns of Rab7 and other melanoma-specific genes; (ii) the inhibitory patterns of indolonaphthoquinone-containing compounds are highly correlated with the expression patterns of the hematopoietic lineage-specific gene HS1 and other leukemia genes. As illustrated by these proof-of-principle examples, we introduce here a set of conceptual tools and fluent computational methods for projecting directly from gene expression patterns to drug substructures and vice versa. The analysis is presented in terms of the NCI-60 cell lines and microarray- based gene expression patterns, but the concept and methods are broadly applicable to other large-scale pharmacogenomic database sets as well. The approach (SAT for Structure-Activity-Target) provides a systematic way to mine databases for the design of further structure-activity studies, particularly to aid in target and lead identification.
引用
收藏
页码:259 / 271
页数:12
相关论文
共 59 条
  • [1] Ross D.T., Scherf U., Eisen M.B., Perou C.M., Rees C., Spellman P., Et al., Systematic variation in gene expression patterns in human cancer cell lines, Nat. Genet, 24, pp. 227-235, (2000)
  • [2] Scherf U., Ross D.T., Waltham M., Smith L.H., Lee J.K., Tanabe L., Et al., A gene expression database for the molecular pharmacology of cancer, Nat. Genet, 24, pp. 236-244, (2000)
  • [3] Boyd M.R., Paull K.D., Some practical consideration and applications of the National Cancer Institute in vitro anti-cancer drug discovery screen, Drug Dev. Des, 34, pp. 91-109, (1995)
  • [4] Monks A.P., Scudiero D.A., Johnson G.S., Paull K.D., Sausville E.A., The NCI anti-cancer drug screen: A smart screen to identify effectors of novel targets, Anti-Cancer Drug Des, 12, pp. 533-541, (1997)
  • [5] Paull K.D., Shoemaker R.H., Hodes L., Monks A., Scudiero D.A., Rubinstein L., Et al., Display and analysis of patterns of differential activity of drugs against human tumor cell lines: Development of mean graph and COMPARE algorithm, J. Natl. Cancer Inst, 81, pp. 1088-1092, (1989)
  • [6] Weinstein J.N., Myers T.G., O'Connor P.M., Friend S.H., Fornace A.J., Kohn K.W., Et al., An information intensive approach to the molecular pharmacology of cancer, Science, 275, pp. 343-349, (1997)
  • [7] Weinstein J.N., Kohn K.W., Grever M.R., Viswanadhan V.N., Rubinstein L.V., Monks A.P., Et al., Neural computing in cancer drug development: Predicting mechanism of action, Science, 258, pp. 447-451, (1992)
  • [8] Paull K.D., Hamel E., Malspeis L., Prediction of biochemical mechanism of action from the in vitro antitumor screen of the National Cancer Institute, Cancer Chemotherapeutic Agents, pp. 1574-1581, (1993)
  • [9] Weinstein I.N., Myers T.G., Buolamwini J.K., Raghavan K., Van Osdol W., Licht J., Et al., Predictive statistics and artificial intelligence in the US National Cancer Institutes drug discovery program for cancer and AIDS, Stem. Cells, 12, pp. 13-22, (1994)
  • [10] Shi L.M., Myers T.G., Fan Y., O'Connor P.M., Paull K.D., Friend S.H., Et al., Mining the National Cancer Institute anticancer drug discovery database: Cluster analysis of ellipticine analogs with p53-inverse and central nervous system-selective patterns of activity, Mol. Pharmacol, 53, pp. 241-251, (1998)