Review on natural products databases: where to find data in 2020

被引:277
作者
Sorokina, Maria [1 ]
Steinbeck, Christoph [1 ]
机构
[1] Univ Friedrich Schiller, Lessing Str 8, D-07743 Jena, Germany
关键词
Natural products; Databases; Traditional medicines; Drug discovery; TRADITIONAL CHINESE MEDICINE; 3-DIMENSIONAL STRUCTURE DATABASE; DRUG DISCOVERY; CURATED DATABASE; A DATABASE; RESOURCE; PLANTS; CHEMISTRY; DEREPLICATION; PLATFORM;
D O I
10.1186/s13321-020-00424-9
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Natural products (NPs) have been the centre of attention of the scientific community in the last decencies and the interest around them continues to grow incessantly. As a consequence, in the last 20 years, there was a rapid multiplication of various databases and collections as generalistic or thematic resources for NP information. In this review, we establish a complete overview of these resources, and the numbers are overwhelming: over 120 different NP databases and collections were published and re-used since 2000. 98 of them are still somehow accessible and only 50 are open access. The latter include not only databases but also big collections of NPs published as supplementary material in scientific publications and collections that were backed up in the ZINC database for commercially-available compounds. Some databases, even published relatively recently are already not accessible anymore, which leads to a dramatic loss of data on NPs. The data sources are presented in this manuscript, together with the comparison of the content of open ones. With this review, we also compiled the open-access natural compounds in one single dataset a COlleCtion of Open NatUral producTs (COCONUT), which is available on Zenodo and contains structures and sparse annotations for over 400,000 non-redundant NPs, which makes it the biggest open collection of NPs available to this date.
引用
收藏
页数:51
相关论文
共 140 条
[1]   SuperSweet-a resource on natural and artificial sweetening agents [J].
Ahmed, Jessica ;
Preissner, Saskia ;
Dunkel, Mathias ;
Worth, Catherine L. ;
Eckert, Andreas ;
Preissner, Robert .
NUCLEIC ACIDS RESEARCH, 2011, 39 :D377-D382
[2]   A systematic comparison of the MetaCyc and KEGG pathway databases [J].
Altman, Tomer ;
Travers, Michael ;
Kothari, Anamika ;
Caspi, Ron ;
Karp, Peter D. .
BMC BIOINFORMATICS, 2013, 14
[3]   BIOFACQUIM: A Mexican Compound Database of Natural Products [J].
Angelica Pilon-Jimenez, B. ;
Saldivar-Gonzalez, Fernanda, I ;
Diaz-Eufracio, Barbara, I ;
Medina-Franco, Jose L. .
BIOMOLECULES, 2019, 9 (01)
[4]  
[Anonymous], ARXIV11117183QBIO
[5]  
[Anonymous], MOLECULES, DOI DOI 10.3390/MOLECULES21050573
[6]  
[Anonymous], PRESTW PHYT LIB COLL
[7]  
[Anonymous], AMB GREENPH NAT COMP
[8]  
[Anonymous], MTBLS999 DATABASE HI
[9]  
[Anonymous], LOPAC1280 LIB PHARM
[10]  
[Anonymous], LIST NATURAL PRODUCT