PubChem and ChEMBL beyond Lipinski

被引:37
作者
Capecchi, Alice [1 ]
Awale, Mahendra [1 ]
Probst, Daniel [1 ]
Reymond, Jean-Louis [1 ]
机构
[1] Univ Bern, Dept Chem & Biochem, Freiestr 3, CH-3012 Bern, Switzerland
基金
瑞士国家科学基金会;
关键词
chemical space; visualization; fingerprints; databases; biomolecules; CHEMICAL SPACE; VISUALIZATION; DATABASE; DISCOVERY; DRUGS; RULE; CLASSIFICATION; EXPLORATION; MOLECULES; PEPTIDES;
D O I
10.1002/minf.201900016
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Seven million of the currently 94 million entries in the PubChem database break at least one of the four Lipinski constraints for oral bioavailability, 183,185 of which are also found in the ChEMBL database. These non-Lipinski PubChem (NLP) and ChEMBL (NLC) subsets are interesting because they contain new modalities that can display biological properties not accessible to small molecule drugs. Unfortunately, the current search tools in PubChem and ChEMBL are designed for small molecules and are not well suited to explore these subsets, which therefore remain poorly appreciated. Herein we report MXFP (macromolecule extended atom-pair fingerprint), a 217-D fingerprint tailored to analyze large molecules in terms of molecular shape and pharmacophores. We implement MXFP in two web-based applications, the first one to visualize NLP and NLC interactively using Faerun (http://faerun.gdb.tools/), the second one to perform MXFP nearest neighbor searches in NLP and NLC (http://similaritysearch.gdb.tools/). We show that these tools provide a meaningful insight into the diversity of large molecules in NLP and NLC. The interactive tools presented here are publicly available at http://gdb.unibe.ch and can be used freely to explore and better understand the diversity of non-Lipinski molecules in PubChem and ChEMBL.
引用
收藏
页数:11
相关论文
共 56 条
[1]   WebMolCS: A Web-Based Interface for Visualizing Molecules in Three-Dimensional Chemical Spaces [J].
Awale, Mahendra ;
Probst, Daniel ;
Reymond, Jean-Louis .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2017, 57 (04) :643-649
[2]   The polypharmacology browser: a web-based multi-fingerprint target prediction tool using ChEMBL bioactivity data [J].
Awale, Mahendra ;
Reymond, Jean-Louis .
JOURNAL OF CHEMINFORMATICS, 2017, 9
[3]   Web-based 3D-visualization of the DrugBank chemical space [J].
Awale, Mahendra ;
Reymond, Jean-Louis .
JOURNAL OF CHEMINFORMATICS, 2016, 8
[4]   Similarity Mapplet: Interactive Visualization of the Directory of Useful Decoys and ChEMBL in High Dimensional Chemical Spaces [J].
Awale, Mahendra ;
Reymond, Jean-Louis .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2015, 55 (08) :1509-1516
[5]   Stereoselective virtual screening of the ZINC database using atom pair 3D-fingerprints [J].
Awale, Mahendra ;
Jin, Xian ;
Reymond, Jean-Louis .
JOURNAL OF CHEMINFORMATICS, 2015, 7
[6]   A multi-fingerprint browser for the ZINC database [J].
Awale, Mahendra ;
Reymond, Jean-Louis .
NUCLEIC ACIDS RESEARCH, 2014, 42 (W1) :W234-W239
[7]   MQN-Mapplet: Visualization of Chemical Space with Interactive Maps of Drug Bank, ChEMBL, Pub Chem, GDB-11, and GDB-13 [J].
Awale, Mahendra ;
van Deursen, Ruud ;
Reymond, Jean-Louis .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2013, 53 (02) :509-518
[8]   Similarity analysis, synthesis, and bioassay of antibacterial cyclic peptidomimetics [J].
Berhanu, Workalemahu M. ;
Ibrahim, Mohamed A. ;
Pillai, Girinath G. ;
Oliferenko, Alexander A. ;
Khelashvili, Levan ;
Jabeen, Farukh ;
Mirza, Bushra ;
Ansari, Farzana Latif ;
ul-Haq, Ihsan ;
El-Feky, Said A. ;
Katritzky, Alan R. .
BEILSTEIN JOURNAL OF ORGANIC CHEMISTRY, 2012, 8 :1146-1160
[9]   The exopolysaccharide properties and structures database: EPS-DB. Application to bacterial exopolysaccharides [J].
Birch, Johnny ;
Van Calsteren, Marie-Rose ;
Perez, Serge ;
Svensson, Birte .
CARBOHYDRATE POLYMERS, 2019, 205 :565-570
[10]   Visualisation and subsets of the chemical universe database GDB-13 for virtual screening [J].
Blum, Lorenz C. ;
van Deursen, Ruud ;
Reymond, Jean-Louis .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2011, 25 (07) :637-647