IMPPAT: A curated database of Indian Medicinal Plants, Phytochemistry And Therapeutics

被引:381
作者
Mohanraj, Karthikeyan [1 ]
Karthikeyan, Bagavathy Shanmugam [1 ]
Vivek-Ananth, R. P. [1 ]
Chand, R. P. Bharath [1 ]
Aparna, S. R. [2 ]
Mangalapandi, Pattulingam [1 ]
Samal, Areejit [1 ]
机构
[1] HBNI, Inst Math Sci IMSc, Madras 600113, Tamil Nadu, India
[2] Stella Maris Coll, Madras 600086, Tamil Nadu, India
关键词
NATURAL-PRODUCTS; DRUG DISCOVERY; SMALL MOLECULES; CLASSIFICATION; PREDICTION; TOOL; VISUALIZATION; PERMEABILITY; DERIVATIVES; GENERATION;
D O I
10.1038/s41598-018-22631-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Phytochemicals of medicinal plants encompass a diverse chemical space for drug discovery. India is rich with a flora of indigenous medicinal plants that have been used for centuries in traditional Indian medicine to treat human maladies. A comprehensive online database on the phytochemistry of Indian medicinal plants will enable computational approaches towards natural product based drug discovery. In this direction, we present, IMPPAT, a manually curated database of 1742 Indian Medicinal Plants, 9596 Phytochemicals, And 1124 Therapeutic uses spanning 27074 plant-phytochemical associations and 11514 plant-therapeutic associations. Notably, the curation effort led to a non-redundant in silico library of 9596 phytochemicals with standard chemical identifiers and structure information. Using cheminformatic approaches, we have computed the physicochemical, ADMET (absorption, distribution, metabolism, excretion, toxicity) and drug-likeliness properties of the IMPPAT phytochemicals. We show that the stereochemical complexity and shape complexity of IMPPAT phytochemicals differ from libraries of commercial compounds or diversity-oriented synthesis compounds while being similar to other libraries of natural products. Within IMPPAT, we have filtered a subset of 960 potential druggable phytochemicals, of which majority have no significant similarity to existing FDA approved drugs, and thus, rendering them as good candidates for prospective drugs.
引用
收藏
页数:17
相关论文
共 105 条
[1]   KNApSAcK Family Databases: Integrated Metabolite-Plant Species Databases for Multifaceted Plant Research [J].
Afendi, Farit Mochamad ;
Okada, Taketo ;
Yamazaki, Mami ;
Hirai-Morita, Aki ;
Nakamura, Yukiko ;
Nakamura, Kensuke ;
Ikeda, Shun ;
Takahashi, Hiroki ;
Altaf-Ul-Amin, Md. ;
Darusman, Latifah K. ;
Saito, Kazuki ;
Kanaya, Shigehiko .
PLANT AND CELL PHYSIOLOGY, 2012, 53 (02) :e1
[2]  
Agarwala R, 2018, NUCLEIC ACIDS RES, V46, pD8, DOI [10.1093/nar/gks1189, 10.1093/nar/gkx1095, 10.1093/nar/gkq1172]
[3]   A new ketone and a known anticancer triterpenoid from the leaves of Onosma limitaneum [J].
Ahmad, VU ;
Kousar, F ;
Khan, A ;
Zubair, M ;
Iqbal, S ;
Tareen, RB .
HELVETICA CHIMICA ACTA, 2005, 88 (02) :309-311
[4]  
[Anonymous], 2003, MED PLANTS INDIAN TR
[5]   Why is Tanimoto index an appropriate choice for fingerprint-based similarity calculations? [J].
Bajusz, David ;
Racz, Anita ;
Heberger, Kroly .
JOURNAL OF CHEMINFORMATICS, 2015, 7
[6]  
Bickerton GR, 2012, NAT CHEM, V4, P90, DOI [10.1038/NCHEM.1243, 10.1038/nchem.1243]
[7]  
Bird S, 2009, Natural language processing with python, DOI DOI 10.5555/1717171
[8]   The Unified Medical Language System (UMLS): integrating biomedical terminology [J].
Bodenreider, O .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D267-D270
[9]   Selecting Relevant Descriptors for Classification by Bayesian Estimates: A Comparison with Decision Trees and Support Vector Machines Approaches for Disparate Data Sets [J].
Carbon-Mangels, Miriam ;
Hutter, Michael C. .
MOLECULAR INFORMATICS, 2011, 30 (10) :885-895