FUZZY kNNMODEL APPLIED TO PREDICTIVE TOXICOLOGY DATA MINING

被引:4
|
作者
Guo, Gongde [1 ]
Neagu, Daniel [1 ]
机构
[1] Univ Bradford, Dept Comp, Bradford BD7 1DP, W Yorkshire, England
基金
英国工程与自然科学研究理事会;
关键词
Fuzzy kNNModel; classification; predictive toxicology;
D O I
10.1142/S1469026805001635
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A robust method, fuzzy kNNModel, for toxicity prediction of chemical compounds is proposed. The method is based on a supervised clustering method, called kNNModel, which employs fuzzy partitioning instead of crisp partitioning to group clusters. The merits of fuzzy kNNModel are two-fold: (1) it overcomes the problems of choosing the parameter e-allowed error rate in a cluster and the parameter N - minimal number of instances covered by a cluster, for each data set; (2) it better captures the characteristics of boundary data by assigning them with different degrees of membership between 0 and 1 to different clusters. The experimental results of fuzzy kNNModel conducted on thirteen public data sets from UCI machine learning repository and seven toxicity data sets from real-world applications, are compared with the results of fuzzy c-means clustering, k-means clustering, kNN, fuzzy kNN, and kNNModel in terms of classification performance. This application shows that fuzzy kNNModel is a promising method for the toxicity prediction of chemical compounds.
引用
收藏
页码:321 / 333
页数:13
相关论文
共 50 条
  • [1] A comparative study of machine learning algorithms applied to predictive toxicology data mining
    Neagu, Daniel C.
    Guo, Gongde
    Trundle, Paul R.
    Cronin, Mark T. D.
    ATLA-ALTERNATIVES TO LABORATORY ANIMALS, 2007, 35 (01): : 25 - 32
  • [3] Predictive toxicology of chemicals and database mining
    Wang, JS
    Lai, LH
    Tang, YQ
    CHINESE SCIENCE BULLETIN, 2000, 45 (12): : 1093 - 1097
  • [4] Predictive toxicology of chemicals and database mining
    WANG Jiansuo
    ChineseScienceBulletin, 2000, (12) : 1093 - 1097
  • [5] A Minimal Coverage-based Classification Method and Its Application in Predictive Toxicology Data Mining
    Guo, Gongde
    Huang, Yu
    2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 1242 - +
  • [6] Predictive fuzzy reasoning method for time series stock market data mining
    Khokhar, RH
    Sap, BNM
    DATA MINING, INTRUSION DETECTION, INFORMATION ASSURANCE, AND DATA NETWORKS SECURITY 2005, 2005, 5812 : 243 - 254
  • [7] Data preprocessing in predictive data mining
    Alexandropoulos, Stamatios-Aggelos N.
    Kotsiantis, Sotiris B.
    Vrahatis, Michael N.
    KNOWLEDGE ENGINEERING REVIEW, 2019, 34
  • [8] Fuzzy machine learning and data mining
    Huellermeier, Eyke
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 1 (04) : 269 - 283
  • [9] Towards a qAOP framework for predictive toxicology - Linking data to decisions
    Paini, Alicia
    Campia, Ivana
    Cronin, Mark T. D.
    Asturiol, David
    Ceriani, Lidia
    Exner, Thomas E.
    Gao, Wang
    Gomes, Caroline
    Kruisselbrink, Johannes
    Martens, Marvin
    Meek, M. E. Bette
    Pamies, David
    Pletz, Julia
    Scholz, Stefan
    Schuettler, Andreas
    Spinu, Nicoleta
    Villeneuve, Daniel L.
    Wittwehr, Clemens
    Worth, Andrew
    Luijten, Mirjam
    COMPUTATIONAL TOXICOLOGY, 2022, 21
  • [10] Data quality in predictive toxicology: Reproducibility of rodent carcinogenicity experiments
    Gottmann, E
    Kramer, S
    Pfahringer, B
    Helma, C
    ENVIRONMENTAL HEALTH PERSPECTIVES, 2001, 109 (05) : 509 - 514