Mining and visualizing large anticancer drug discovery databases

被引:81
|
作者
Shi, LM
Fan, Y
Lee, JK
Waltham, M
Andrews, DT
Scherf, U
Paull, KD
Weinstein, JN
机构
[1] NCI, Mol Pharmacol Lab, Div Basic Sci, NIH, Bethesda, MD 20892 USA
[2] NCI, Informat Technol Branch, Div Canc Treatment Diag & Ctr, NIH, Bethesda, MD 20892 USA
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2000年 / 40卷 / 02期
关键词
D O I
10.1021/ci990087b
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In order to find more effective anticancer drugs, the U.S. National Cancer Institute (NCI) screens a large number of compounds in vitro against 60 human cancer cell lines from different organs of origin. About 70 000 compounds have been tested in the program since 1990, and each tested compound can be characterized by a vector (i.e., "fingerprint") of 60 anticancer activity, or -[log(GI(50))], values. GI(50) is the concentration required to inhibit cell growth by 50% compared with untreated controls. Although cell growth inhibitory activity for a single cell line is not very informative, activity patterns across the 60 cell lines can provide incisive information on the mechanisms of action of screened compounds and also on molecular targets and modulators of activity within the cancer cells. Various statistical and artificial intelligence methods, including principal component analysis, hierarchical cluster analysis,stepwise linear regression, multidimensional scaling, neural network modeling, and genetic function approximation, among others, can be used to analyze this large activity database. Mining the database can provide useful information: (a) for the development of anticancer drugs; (b) for a better understanding of the molecular pharmacology of cancer; and (c) for improvement of the drug discovery process.
引用
收藏
页码:367 / 379
页数:13
相关论文
共 50 条
  • [21] Large compound databases for structure-activity relationships studies in drug discovery
    Scior, Thomas
    Bemard, Philippe
    Medina-Franco, Jose Luis
    Maggiora, Gerald. M.
    MINI-REVIEWS IN MEDICINAL CHEMISTRY, 2007, 7 (08) : 851 - 860
  • [22] Anticancer drug discovery and development
    Colotta, Francesco
    TARGETED THERAPIES IN CANCER: MYTH OR REALITY?, 2008, 610 : 19 - 42
  • [23] Visualizing Bibliographic Databases as Graphs and Mining Potential Research Synergies
    Varlamis, Iraklis
    Tsatsaronis, George
    2011 INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2011), 2011, : 53 - 60
  • [24] Data mining and knowledge discovery in databases - An overview
    MacKinnon, MJ
    Glick, N
    AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 1999, 41 (03) : 255 - 275
  • [25] Data mining and knowledge discovery in databases: An overview
    Zhu, W
    RESEARCH QUARTERLY FOR EXERCISE AND SPORT, 2002, 73 (01) : A37 - A37
  • [26] From data mining to knowledge discovery in databases
    Fayyad, U
    PiatetskyShapiro, G
    Smyth, P
    AI MAGAZINE, 1996, 17 (03) : 37 - 54
  • [27] Knowledge discovery and data mining in biological databases
    Brusic, V
    Zeleznikow, J
    KNOWLEDGE ENGINEERING REVIEW, 1999, 14 (03) : 257 - 277
  • [28] Using glycome databases for drug discovery
    Aoki-Kinoshita, Kiyoko F.
    EXPERT OPINION ON DRUG DISCOVERY, 2008, 3 (08) : 877 - 890
  • [29] Mining constrained gradients in large databases
    Dong, GZ
    Han, JW
    Lam, JMW
    Pei, JA
    Wang, K
    Zou, W
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2004, 16 (08) : 922 - 938
  • [30] Case mining from large databases
    Yang, Q
    Cheng, H
    CASE-BASED REASONING RESEARCH AND DEVELOPMENT, PROCEEDINGS, 2003, 2689 : 691 - 702