Improving Breast Cancer Diagnosis Accuracy by Particle Swarm Optimization Feature Selection

被引:0
作者
Reihane Kazerani
机构
[1] Shiraz University,Information Technology, Management Information System
来源
International Journal of Computational Intelligence Systems | / 17卷
关键词
Breast cancer; Feature selection; Particle swarm optimization; Machine learning; Classification accuracy; UCI repository;
D O I
暂无
中图分类号
学科分类号
摘要
Breast cancer has been one of the leading causes of death among women in the world. Early detection of this disease can save patient’s lives and reduce mortality. Due to the large number of features involved in the diagnosis of this disease, the breast cancer diagnosis process can be time consuming. To reduce cost and time and improving accuracy of breast cancer diagnosis, this paper propose a feature selection algorithm based on particle swarm optimization (PSO) combined with machine learning methods for selection the most effective features for breast cancer diagnosis among all features. In order to evaluate the efficiency of the proposed feature selection method, it was tested on three most common breast cancer datasets available in the University of California, Irvine (UCI) repository named: Coimbra dataset (CD), Wisconsin Diagnostic Breast Cancer dataset (WDBC) and Wisconsin Prognostic Breast Cancer dataset (WPBC). In the Coimbra dataset with all its 9 features and without PSO feature selection algorithm the highest obtained accuracy was 87% by Support Vector Machine method, while with PSO feature selection algorithm the accuracy reached to 91% and the number of features was reduced from 9 to 4. In the WDBC dataset with all its 30 features and without PSO feature selection algorithm the highest obtained accuracy was 99% by Random Forest method, while with PSO feature selection algorithm the accuracy reached to 100% and the number of features was reduced from 30 to 19. In the WPBC dataset with all its 33 features and without PSO feature selection algorithm the highest obtained accuracy was 94% by Support Vector Machine method, while with PSO feature selection algorithm the accuracy reached to 96% and the number of features was reduced from 33 to 17. The results of this paper indicated that the proposed feature selection algorithm based on PSO algorithm can improve the accuracy of breast cancer diagnosis. While it has selected fewer and more effective features than the total number of features in the original datasets.
引用
收藏
相关论文
共 63 条
  • [1] Salem A(2022)Efficient framework for detecting COVID-19 and pneumonia from chest X-ray using deep convolutional network Egypt. Inform. J. 23 247-257
  • [2] Sherif AS(2016)Incidence and mortality and epidemiology of breast cancer in the world Asian Pac. J. Cancer Prev. 17 43-46
  • [3] Hussein MK(2010)Age at diagnosis of breast cancer in Arab nations Int. J. Surg. 8 448-452
  • [4] Salehiniya H(2007)Trends in epidemiology and management of breast cancer in developing Arab countries: a literature and registry analysis Int. J. Surg. 5 225-233
  • [5] Ghoncheh M(2022)A new convolutional neural network architecture for automatic detection of brain tumors in magnetic resonance imaging images IEEE Access 10 2775-2782
  • [6] Pournamdar Z(2020)Multi-modal prediction of breast cancer using particle swarm optimization with non-dominating sorting Int. J. Distrib. Sens. Netw. 6 29637-29647
  • [7] Najjar H(2018)particle swarm optimization feature selection for breast cancer recurrence prediction IEEE Access 1 1-6
  • [8] Easson A(2019)Determination of the blood, hormone and obesity value ranges that indicate the breast cancer, using data mining based expert system IRBM 1 466-482
  • [9] El Saghir NS(2019)Using resistin, glucose, age and bmi and pruning fuzzy neural network for the construction of expert systems in the prediction of breast cancer Mach. Learn. Knowl. Extr. 3 10715-10722
  • [10] Musallam AS(2017)A machine learning approach for early prediction of breast cancer Int. J. Eng. Comput. Sci. 267 687-699