Efficient Classifier for Classification of Prognostic Breast Cancer Data through Data Mining Techniques

被引:0
|
作者
Jacob, Shomona Gracia [1 ,2 ]
Ramani, R. Geetha [3 ]
机构
[1] Rajalakshmi Engn Coll, Dept Comp Sci & Engn, Madras, Tamil Nadu, India
[2] Anna Univ, Madras, Tamil Nadu, India
[3] Anna Univ, Coll Engn, Dept Informat Sci & Technol, Madras, Tamil Nadu, India
来源
WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, WCECS 2012, VOL I | 2012年
关键词
Breast Cancer Prognosis; Classification; Data mining; Feature Selection; Machine Learning; DIAGNOSIS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data mining involves the process of recovering related, significant and credential information from a large collection of aggregated data. A major area of current research in data mining is the field of clinical investigations that involve disease diagnosis, prognosis and drug therapy. The objective of this paper is to identify an efficient classifier for prognostic breast cancer data. This research work involves designing a data mining framework that incorporates the task of learning patterns and rules that will facilitate the formulation of decisions in new cases. The machine learning techniques employed to train the proposed system are based on feature relevance analysis and classification algorithms. Wisconsin Prognostic Breast Cancer (WPBC) data from the UCI machine learning repository is utilized by means of data mining techniques to completely train the system on 198 individual cases, each comprising of 33 predictor values. This paper highlights the performance of feature reduction and classification algorithms on the training dataset. We evaluate the number of attributes for split in the Random tree algorithm and the confidence level and minimum size of the leaves in the C4.5 algorithm to produce 100 percent classification accuracy. Our results demonstrate that Random Tree and Quinlan's C4.5 classification algorithm produce 100 percent accuracy in the training and test phase of classification with proper evaluation of algorithmic parameters.
引用
收藏
页码:493 / 498
页数:6
相关论文
共 50 条
  • [41] Earthquakes classification using data mining techniques
    Rodriguez-Elizalde, J
    Figueroa-Nazuno, J
    8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL II, PROCEEDINGS: COMPUTING TECHNIQUES, 2004, : 257 - 261
  • [42] Comparative Study on Data Mining Techniques Applied to Breast Cancer Gene Expression Profiles
    Mosquim Junior, Sergio
    de Oliveira, Juliana
    PROCEEDINGS OF THE 10TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 3: BIOINFORMATICS, 2017, : 168 - 175
  • [43] Talent Knowledge Acquisition using Data Mining Classification Techniques
    Jantan, Hamidah
    Hamdan, Abdul Razak
    Othman, Zulaiha Ali
    2011 3RD CONFERENCE ON DATA MINING AND OPTIMIZATION (DMO), 2011, : 32 - 37
  • [44] Data Mining Techniques to Support the Classification of MV Electricity Customers
    Ramos, Sergio
    Vale, Zita
    2008 IEEE POWER & ENERGY SOCIETY GENERAL MEETING, VOLS 1-11, 2008, : 1581 - 1587
  • [45] Intelligent Breast Cancer Prediction Model Using Data Mining Techniques
    Shen, Runjie
    Yang, Yuanyuan
    Shao, Fengfeng
    2014 SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL 1, 2014, : 384 - 387
  • [46] Prediction of benign and malignant breast cancer using data mining techniques
    Chaurasia, Vikas
    Pal, Saurabh
    Tiwari, B. B.
    JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2018, 12 (02) : 119 - 126
  • [47] Data mining techniques in breast cancer diagnosis at the cellular–molecular level
    Jian Yang
    Dler Hussein Kadir
    Journal of Cancer Research and Clinical Oncology, 2023, 149 : 12605 - 12620
  • [48] Investigation of Website Classification Methods Based on Data Mining Techniques
    Rukavitsyn, Andrey N.
    Kupriyanov, Mikhail S.
    Shorov, Andrey V.
    Petukhov, Ilya V.
    PROCEEDINGS OF THE XIX IEEE INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND MEASUREMENTS (SCM 2016), 2016, : 333 - 336
  • [49] CLASSIFICATION OF THE FINANCIAL SUSTAINABILITY OF HEALTH INSURANCE BENEFICIARIES THROUGH DATA MINING TECHNIQUES
    Dias Pedro Reboucas, Silvia Maria
    Brandao de Oliveira, Daniele Adelaide
    Soares, Romulo Alves
    Dores Maia Ferreira, Eugenia Maria
    dos Reis de Pinto Gouveia, Maria Jose Baltazar
    JOURNAL OF SPATIAL AND ORGANIZATIONAL DYNAMICS, 2016, 4 (03): : 229 - 242
  • [50] Breast Cancer Detection in Mammogram Medical Images with Data Mining Techniques
    Kontos, Konstantinos
    Maragoudakis, Manolis
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2013, 2013, 412 : 336 - 347