FUZZY kNNMODEL APPLIED TO PREDICTIVE TOXICOLOGY DATA MINING

被引:4
作者
Guo, Gongde [1 ]
Neagu, Daniel [1 ]
机构
[1] Univ Bradford, Dept Comp, Bradford BD7 1DP, W Yorkshire, England
基金
英国工程与自然科学研究理事会;
关键词
Fuzzy kNNModel; classification; predictive toxicology;
D O I
10.1142/S1469026805001635
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A robust method, fuzzy kNNModel, for toxicity prediction of chemical compounds is proposed. The method is based on a supervised clustering method, called kNNModel, which employs fuzzy partitioning instead of crisp partitioning to group clusters. The merits of fuzzy kNNModel are two-fold: (1) it overcomes the problems of choosing the parameter e-allowed error rate in a cluster and the parameter N - minimal number of instances covered by a cluster, for each data set; (2) it better captures the characteristics of boundary data by assigning them with different degrees of membership between 0 and 1 to different clusters. The experimental results of fuzzy kNNModel conducted on thirteen public data sets from UCI machine learning repository and seven toxicity data sets from real-world applications, are compared with the results of fuzzy c-means clustering, k-means clustering, kNN, fuzzy kNN, and kNNModel in terms of classification performance. This application shows that fuzzy kNNModel is a promising method for the toxicity prediction of chemical compounds.
引用
收藏
页码:321 / 333
页数:13
相关论文
共 50 条
[21]   Mining fuzzy association rules from uncertain data [J].
Weng, Cheng-Hsiung ;
Chen, Yen-Liang .
KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 23 (02) :129-152
[22]   Caenorhabditis elegans for predictive toxicology [J].
Hunt, Piper Reid ;
Camacho, Jessica A. ;
Sprando, Robert L. .
CURRENT OPINION IN TOXICOLOGY, 2020, 23-24 :23-28
[23]   Emerging Approaches in Predictive Toxicology [J].
Zhang, Luoping ;
McHale, Cliona M. ;
Greene, Nigel ;
Snyder, Ronald D. ;
Rich, Ivan N. ;
Aardema, Marilyn J. ;
Roy, Shambhu ;
Pfuhler, Stefan ;
Venkatactahalam, Sundaresan .
ENVIRONMENTAL AND MOLECULAR MUTAGENESIS, 2014, 55 (09) :679-688
[24]   Data quality in predictive toxicology: Identification of chemical structures and calculation of chemical properties [J].
Helma, C ;
Kramer, S ;
Pfahringer, B ;
Gottmann, E .
ENVIRONMENTAL HEALTH PERSPECTIVES, 2000, 108 (11) :1029-1033
[25]   DATA MINING AND OPTIMIZATION APPLIED TO RAMAN SPECTROSCOPY FOR ONCOLOGY APPLICATIONS [J].
Fenn, Michael B. ;
Pappu, Vijay ;
Xanthopoulos, Petros ;
Pardalos, Panos M. .
BIOMAT 2011: INTERNATIONAL SYMPOSIUM ON MATHEMATICAL AND COMPUTATIONAL BIOLOGY, 2012, :84-108
[26]   Prediction Models Applied to Lung Cancer Using Data Mining [J].
Sousa, Rita ;
Sousa, Regina ;
Peixoto, Hugo ;
Machado, Jose .
INTELLIGENT DISTRIBUTED COMPUTING XV, IDC 2022, 2023, 1089 :195-200
[27]   EVALUATION OF PREDICTIVE DATA MINING ALGORITHMS IN STUDENT ACADEMIC PERFORMANCE [J].
Jidagam, Rohith ;
Rizk, Nouhad .
INTED2016: 10TH INTERNATIONAL TECHNOLOGY, EDUCATION AND DEVELOPMENT CONFERENCE, 2016, :6314-6324
[28]   A continuous ant colony system framework for fuzzy data mining [J].
Wu, Min-Thai ;
Hong, Tzung-Pei ;
Lee, Chung-Nan .
SOFT COMPUTING, 2012, 16 (12) :2071-2082
[29]   Application of fuzzy cluster analysis for medical image data mining [J].
Wang, Shuyan ;
Zhou, Mingquan ;
Geng, Guohua .
2005 IEEE International Conference on Mechatronics and Automations, Vols 1-4, Conference Proceedings, 2005, :631-636
[30]   A novel Neuro-fuzzy classification technique for data mining [J].
Ghosh, Soumadip ;
Biswas, Sushanta ;
Sarkar, Debasree ;
Sarkar, Partha Pratim .
EGYPTIAN INFORMATICS JOURNAL, 2014, 15 (03) :129-147