Prediction of the taxonomical classification of the Ranunculaceae family using a machine learning method

被引:4
作者
Chen, Jiao [1 ]
Yang, Wenlu [2 ]
Tan, Guodong [1 ]
Tian, Chunyao [1 ]
Wang, Hongjun [2 ]
Zhou, Jiayu [1 ]
Liao, Hai [1 ]
机构
[1] Southwest Jiaotong Univ, Sch Life Sci & Engn, Chengdu 610031, Sichuan, Peoples R China
[2] Southwest Jiaotong Univ, Inst Artificial Intelligence, Chengdu 610031, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
EVOLUTION; PLANTS;
D O I
10.1039/d1nj03632g
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Ranunculaceae is a botanical source for various pharmaceutically active compounds, which has been commonly utilized in traditional Chinese medicine. Increasing interest in Ranunculaceae pharmaceutical resources has led to a taxonomical study of this family, which might provide new insight to understand its diversification, relationship and phylogenetic position, and further to find new medicinal resources and promising compounds. In this study, we used the machine learning method to explore the classification of the medicinal Ranunculaceae family. 204 species representing 17 genera of the Ranunculaceae family were collected from the TCMID with their 1280 active compounds composed of structure-based fingerprints. After the construction of species-compound and genus-compound matrices, CNNs and Ext fingerprints were determined as the best machine learning method and fingerprint type using ACC and F-score as clustering criteria, respectively. We found that taxonomical classification within the Ranunculaceae family could be accurately predicted, especially at the genus level with a top ACC of 0.86 and an F-score of 0.85. The top features of compounds that were important for the classification of 17 genera were also identified, and thus some genera with high medicinal values were associated with characteristic cis and (or) trans features. As far as we know, this is the first time that some genera are found to be associated with the structural features of compounds.
引用
收藏
页码:5150 / 5161
页数:12
相关论文
共 51 条
[1]   Diterpenoid alkaloids from Aconitum barbatum var. puberulum Ledeb [J].
Ablajan, Nurfida ;
Zhao, Bo ;
Zhao, Jiang-Yu ;
Wang, Bian-Lin ;
Sagdullaev, Sh Sh ;
Aisa, H. A. .
PHYTOCHEMISTRY, 2021, 181
[2]   Not that kind of tree: Assessing the potential for decision tree-based plant identification using trait databases [J].
Almeida, Brianna K. ;
Garg, Manish ;
Kubat, Miroslav ;
Afkhami, Michelle E. .
APPLICATIONS IN PLANT SCIENCES, 2020, 8 (07)
[3]   Evolution of alkaloid biosynthesis in the genus Narcissus [J].
Berkov, Strahil ;
Martinez-Frances, Vanessa ;
Bastida, Jaume ;
Codina, Caries ;
Rios, Segundo .
PHYTOCHEMISTRY, 2014, 99 :95-106
[4]   Decision Tree and Ensemble Learning Algorithms with Their Applications in Bioinformatics [J].
Che, Dongsheng ;
Liu, Qi ;
Rasheed, Khaled ;
Tao, Xiuping .
SOFTWARE TOOLS AND ALGORITHMS FOR BIOLOGICAL SYSTEMS, 2011, 696 :191-199
[5]   Bioactive triterpenoids from Sambucus java']javanica Blume [J].
Chen, Feilong ;
Liu, Dong-Li ;
Wang, Wei ;
Lv, Xiao-Man ;
Li, Weixi ;
Shao, Li-Dong ;
Wang, Wen-Jing .
NATURAL PRODUCT RESEARCH, 2020, 34 (19) :2816-2821
[6]   Unsupervised learning of Gaussian mixtures based on variational component splitting [J].
Constantinopoulos, Constantinos ;
Likas, Aristidis .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2007, 18 (03) :745-755
[7]   Genetics of flower development in Ranunculales - a new, basal eudicot model order for studying flower evolution [J].
Damerval, Catherine ;
Becker, Annette .
NEW PHYTOLOGIST, 2017, 216 (02) :361-366
[8]   Evolutionary relationships in the medicinally important genus Fritillaria L. (Liliaceae) [J].
Day, Peter D. ;
Berger, Madeleine ;
Hill, Laurence ;
Fay, Michael F. ;
Leitch, Andrew R. ;
Leitch, Ilia J. ;
Kelly, Laura J. .
MOLECULAR PHYLOGENETICS AND EVOLUTION, 2014, 80 :11-19
[9]   Simultaneous data pre-processing and SVM classification model selection based on a parallel genetic algorithm applied to spectroscopic data of olive oils [J].
Devos, Olivier ;
Downey, Gerard ;
Duponchel, Ludovic .
FOOD CHEMISTRY, 2014, 148 :124-130
[10]   In silico polypharmacology of natural products [J].
Fang, Jiansong ;
Liu, Chuang ;
Wang, Qi ;
Lin, Ping ;
Cheng, Feixiong .
BRIEFINGS IN BIOINFORMATICS, 2018, 19 (06) :1153-1171