A novel Chinese herbal medicine clustering algorithm via artificial bee colony optimization

被引:15
作者
Han, Nan [1 ]
Qiao, Shaojie [2 ]
Yuan, Guan [3 ]
Huang, Ping [1 ]
Liu, Dingxiang [2 ]
Yue, Kun [4 ]
机构
[1] Chengdu Univ Informat Technol, Sch Management, Chengdu 610103, Sichuan, Peoples R China
[2] Chengdu Univ Informat Technol, Sch Software Engn, Chengdu 610225, Sichuan, Peoples R China
[3] China Univ Min & Technol, Sch Comp Sci & Technol, Xuzhou 221116, Jiangsu, Peoples R China
[4] Yunnan Univ, Sch Informat Sci & Engn, Kunming 650500, Yunnan, Peoples R China
基金
中国国家自然科学基金;
关键词
Artificial bee colony; Cluster analysis; Parameter optimization; Traditional Chinese medicine; Applied intelligence; GRAPH;
D O I
10.1016/j.artmed.2019.101760
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional Chinese medicine (TCM) has become popular and been viewed as an effective clinical treatment across the world. Accordingly, there is an ever-increasing interest in performing data analysis over TCM data. Aiming to cope with the problem of excessively depending on empirical values when selecting cluster centers by traditional clustering algorithms, an improved artificial bee colony algorithm is proposed by which to automatically select cluster centers and apply it to aggregate Chinese herbal medicines. The proposed method integrates the following new techniques: (1) improving the artificial bee colony algorithm by applying a new searching strategy of neighbour nectar, (2) employing the improved artificial bee colony algorithm to optimize the parameters of the cutoff distance d(c) the local density rho(i) and the minimum distance delta(i) between the element i and any other element with higher density in the cluster algorithm by fast search and finding of density peaks (called DP algorithm) to find the optimal cluster centers, in order to clustering herbal medicines in an accurate fashion with the guarantee of runtime performance. Extensive experiments were conducted on the UCI benchmark datasets and the TCM datasets and the results verify the effectiveness of the proposed method by comparing it with classical clustering algorithms including K-means, K-mediods and DBSCAN in multiple evaluation metrics, that is, Silhouette Coefficient, Entropy, Purity, Precision, Recall and Fl-Measure. The results show that the IABC-DP algorithm outperforms other approaches with good clustering quality and accuracy on the UCI and the TCM datasets as well. In addition, it can be found that the improved artificial bee colony algorithm can effectively reduce the number of iterations when compared to the traditional bee colony algorithm. In particular, the IABC-DP algorithm is applied to cluster multi-dimensional Chinese herbal medicines and the result shows that it outperforms other clustering algorithms in clustering Chinese herbal medicines, which can contribute to a larger effort targeted at advancing the study of discovering composition rules of traditional Chinese prescriptions.
引用
收藏
页数:14
相关论文
共 30 条
[1]   Hybrid clustering analysis using improved krill herd algorithm [J].
Abualigah, Laith Mohammad ;
Khader, Ahamad Tajudin ;
Hanandeh, Essam Said .
APPLIED INTELLIGENCE, 2018, 48 (11) :4047-4071
[2]  
[Anonymous], 2018, APPL INTELLIGENCE
[3]  
[Anonymous], 2004, Ant Colony Optimization
[4]  
[Anonymous], IEEE T CYBERN
[5]  
[Anonymous], 2002, XITONG GONGCHENG LIL
[6]   Fuzzy model-based clustering and its application in image segmentation [J].
Choy, Siu Kai ;
Lam, Shu Yan ;
Yu, Kwok Wai ;
Lee, Wing Yan ;
Leung, King Tai .
PATTERN RECOGNITION, 2017, 68 :141-157
[7]   A comparative analysis of chaotic particle swarm optimizations for detecting single nucleotide polymorphism barcodes [J].
Chuang, Li-Yeh ;
Moi, Sin-Hua ;
Lin, Yu-Da ;
Yang, Cheng-Hong .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2016, 73 :23-33
[8]   SDenPeak: Semi-Supervised Nonlinear Clustering based on Density and Distance [J].
Fan, Wen-Qi ;
Wang, Chang-Dong ;
Lai, Jian-Huang .
PROCEEDINGS 2016 IEEE SECOND INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (BIGDATASERVICE 2016), 2016, :269-275
[9]   On using supervised clustering analysis to improve classification performance [J].
Gan, Haitao ;
Huang, Rui ;
Luo, Zhizeng ;
Xi, Xugang ;
Gao, Yunyuan .
INFORMATION SCIENCES, 2018, 454 :216-228
[10]   A k-means binarization framework applied to multidimensional knapsack problem [J].
Garcia, Jos ;
Crawford, Broderick ;
Soto, Ricardo ;
Castro, Carlos ;
Paredes, Fernando .
APPLIED INTELLIGENCE, 2018, 48 (02) :357-380