Efficient Classifier for Classification of Prognostic Breast Cancer Data through Data Mining Techniques

被引:0
|
作者
Jacob, Shomona Gracia [1 ,2 ]
Ramani, R. Geetha [3 ]
机构
[1] Rajalakshmi Engn Coll, Dept Comp Sci & Engn, Madras, Tamil Nadu, India
[2] Anna Univ, Madras, Tamil Nadu, India
[3] Anna Univ, Coll Engn, Dept Informat Sci & Technol, Madras, Tamil Nadu, India
来源
WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, WCECS 2012, VOL I | 2012年
关键词
Breast Cancer Prognosis; Classification; Data mining; Feature Selection; Machine Learning; DIAGNOSIS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data mining involves the process of recovering related, significant and credential information from a large collection of aggregated data. A major area of current research in data mining is the field of clinical investigations that involve disease diagnosis, prognosis and drug therapy. The objective of this paper is to identify an efficient classifier for prognostic breast cancer data. This research work involves designing a data mining framework that incorporates the task of learning patterns and rules that will facilitate the formulation of decisions in new cases. The machine learning techniques employed to train the proposed system are based on feature relevance analysis and classification algorithms. Wisconsin Prognostic Breast Cancer (WPBC) data from the UCI machine learning repository is utilized by means of data mining techniques to completely train the system on 198 individual cases, each comprising of 33 predictor values. This paper highlights the performance of feature reduction and classification algorithms on the training dataset. We evaluate the number of attributes for split in the Random tree algorithm and the confidence level and minimum size of the leaves in the C4.5 algorithm to produce 100 percent classification accuracy. Our results demonstrate that Random Tree and Quinlan's C4.5 classification algorithm produce 100 percent accuracy in the training and test phase of classification with proper evaluation of algorithmic parameters.
引用
收藏
页码:493 / 498
页数:6
相关论文
共 50 条
  • [1] Comparing Various Classifier Techniques for Efficient Mining of Data
    Pal, Dheeraj
    Jain, Alok
    Saxena, Aradhana
    Agarwal, Vaibhav
    PROCEEDINGS OF THE INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2015, VOL 2, 2016, 439 : 191 - 202
  • [2] Breast Cancer Prediction Using Data Mining Classification Techniques
    Kazi, Abdul Karim
    Waseemullah
    Baig, Mirza Adnan
    Khan, Shahzaib
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (09): : 696 - 704
  • [3] Breast and Colon Cancer Classification from Gene Expression Profiles Using Data Mining Techniques
    AbdElNabi, Mohamed Loey Ramadan
    Jasim, Mohammed Wajeeh
    EL-Bakry, Hazem M.
    Taha, Mohamed Hamed N.
    Khalifa, Nour Eldeen M.
    SYMMETRY-BASEL, 2020, 12 (03):
  • [4] Mining of Classification Patterns in Clinical Data through Data Mining Algorithms
    Jacob, Shomona Gracia
    Ramani, R. Geetha
    PROCEEDINGS OF THE 2012 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI'12), 2012, : 997 - 1003
  • [5] Hybrid model for classification of diseases using data mining and particle swarm optimisation techniques
    Gupta, Rashmi
    Shrivas, Akhilesh Kumar
    Shukla, Ragini
    INTERNATIONAL JOURNAL OF COMPUTING SCIENCE AND MATHEMATICS, 2023, 17 (03) : 295 - 307
  • [6] Discovery of Knowledge Patterns in Lymphographic Clinical Data through Data Mining Methods and Techniques
    Jacob, Shomona Gracia
    Ramani, R. Geetha
    Nancy, P.
    ADVANCES IN COMPUTING AND INFORMATION TECHNOLOGY, VOL 3, 2013, 178 : 129 - 140
  • [7] Classification of Splice Junction DNA sequence through Data mining techniques
    Jacob, Shomona Gracia
    Ramani, R. Geetha
    Nancy, P.
    2012 INTERNATIONAL CONFERENCE ON FUTURE COMMUNICATION AND COMPUTER TECHNOLOGY (ICFCCT 2012), 2012, : 143 - 148
  • [8] Feature Relevance Analysis and Classification of Road Traffic Accident Data through Data Mining Techniques
    Shanthi, S.
    Ramani, R. Geetha
    WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, WCECS 2012, VOL I, 2012, : 122 - 127
  • [9] Comparative Analysis of Breast Cancer and Hypothyroid Dataset using Data Mining Classification Techniques
    Verma, Deepika
    Mishra, Nidhi
    2017 IEEE INTERNATIONAL CONFERENCE ON POWER, CONTROL, SIGNALS AND INSTRUMENTATION ENGINEERING (ICPCSI), 2017, : 1624 - 1626
  • [10] Survey On Mining Techniques For Breast Cancer Related Data
    Palivela, Hemant
    Yogish, H. K.
    Vijaykumar, S.
    Patil, Kalpana
    2013 INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS (ICICES), 2013, : 540 - 546