PREDICTIVE STATISTICS AND ARTIFICIAL-INTELLIGENCE IN THE US-NATIONAL-CANCER-INSTITUTES-DRUG-DISCOVERY-PROGRAM-FOR-CANCER-AND-AIDS

被引:55
作者
WEINSTEIN, JN
MYERS, T
BUOLAMWINI, J
RAGHAVAN, K
VANOSDOL, W
LICHT, J
VISWANADHAN, VN
KOHN, KW
RUBINSTEIN, LV
KOUTSOUKOS, AD
MONKS, A
SCUDIERO, DA
ANDERSON, NL
ZAHAREVITZ, D
CHABNER, BA
GREVER, MR
PAULL, KD
机构
[1] NCI,DCT,CANC THERAPY EVALUAT PROGRAM,BIOMETR RES BRANCH,FREDERICK,MD
[2] NCI,FCRDC,PROGRAM RESOURCES INC,DYNCORP,FREDERICK,MD
[3] LARGE SCALE BIOL INC,ROCKVILLE,MD
[4] NCI,DCT,DTP,INFORMAT TECHNOL BRANCH,SAN DIEGO,CA
关键词
CANCER; AIDS; HIV; THERAPY; DRUG DISCOVERY; ARTIFICIAL INTELLIGENCE; NEURAL NETWORK; STATISTICS; DISCOVERY; CYTOTOXICITY;
D O I
10.1002/stem.5530120106
中图分类号
Q813 [细胞工程];
学科分类号
摘要
The National Cancer Institute's drug discovery program screens more than 20,000 chemical compounds and natural products a year for activity against a panel of 60 tumor cell lines in vitro. The result is an information-rich database of patterns that form the basis for what we term an ''information-intensive'' approach to the process of drug discovery. The first step was a demonstration, both by statistical methods (including the program COMPARE) and by neural networks, that patterns of activity in the screen can be used to predict a compound's mechanism of action. Given this finding, the overall plan has been to develop three large matrices of information: the first (designated A) gives the pattern of activity for each compound tested against each cell line in the screen; the second (S) encodes any of a number of types of 2-D or 3-D structural motifs for each compound; the third (T) indicates each cell's expression of molecular targets (e.g., from 2-dimensional protein gel electrophoresis). Construction and updating of these matrices is an ongoing process. The matrices can be concatenated in various ways to test a variety of specific hypotheses about compounds screened, as well as to ''prioritize'' candidate compounds for testing. To aid in these efforts, we have developed the DISCOVERY program package, which integrates the matrix data for visual pattern recognition. The ''information-intensive'' approach summarized here in some senses serves to bridge the perceived gap between screening and structure-based drug design.
引用
收藏
页码:13 / 22
页数:10
相关论文
共 20 条
[1]  
ALLEY MC, 1988, CANCER RES, V48, P589
[2]  
ALVAREZ M, UNPUB J BIOL CHEM
[3]   A 2-DIMENSIONAL GEL DATABASE OF RAT-LIVER PROTEINS USEFUL IN GENE-REGULATION AND DRUG EFFECTS STUDIES [J].
ANDERSON, NL ;
ESQUERBLASCO, R ;
HOFMANN, JP ;
ANDERSON, NG .
ELECTROPHORESIS, 1991, 12 (11) :907-930
[4]  
BAI R, 1991, J BIOL CHEM, V266, P15882
[5]  
Boyd M.K., 1989, CANCER PRINCIPLES PR, V3, P1
[6]  
CHABNER BA, IN PRESS P ANTICANCE
[7]  
Dayhoff JE., 1990, NEURAL NETWORK ARCHI
[8]  
HODES I, 1992, J BIOPHARM STATISTIC, V2, P31
[9]  
KHANNA T, 1991, F NEURAL NETWORKS
[10]  
KOUTSOUKOS AD, IN PRESS STATISTICS