Breast Cancer Subtypes Classification with Hybrid Machine Learning Model

被引:5
作者
Sarkar, Suvobrata [1 ]
Mali, Kalyani [2 ]
机构
[1] Dr BC Roy Engn Coll, Dept Comp Sci & Engn, Durgapur 713206, W Bengal, India
[2] Univ Kalyani, Dept Comp Sci & Engn, Kalyani, W Bengal, India
关键词
triple-negative breast cancer; clinicopathological parameters; hybrid machine learning models; classification; genetic algorithm; support vector machine; SUPPORT VECTOR MACHINES; FEATURE-SELECTION; GENE SELECTION; PREDICTION; ALGORITHM; WOMEN;
D O I
10.1055/s-0042-1751043
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Background Breast cancer is the most prevailing heterogeneous disease among females characterized with distinct molecular subtypes and varied clinicopathological features. With the emergence of various artificial intelligence techniques especially machine learning, the breast cancer research has attained new heights in cancer detection and prognosis. Objective Recent development in computer driven diagnostic system has enabled the clinicians to improve the accuracy in detecting various types of breast tumors. Our study is to develop a computer driven diagnostic system which will enable the clinicians to improve the accuracy in detecting various types of breast tumors. Methods In this article, we proposed a breast cancer classification model based on the hybridization of machine learning approaches for classifying triple-negative breast cancer and non-triple negative breast cancer patients with clinicopathological features collected from multiple tertiary care hospitals/centers. Results The results of genetic algorithm and support vector machine (GA-SVM) hybrid model was compared with classics feature selection SVM hybrid models like support vector machine-recursive feature elimination (SVM-RFE), LASSO-SVM, Grid-SVM, and linear SVM. The classification results obtained from GA-SVM hybrid model outperformed the other compared models when applied on two distinct hospital-based datasets of patients investigated with breast cancer in North West of African subcontinent. To validate the predictive model accuracy, 10-fold cross-validation method was applied on all models with the same multicentered datasets. The model performance was evaluated with well-known metrics like mean squared error, logarithmic loss, F1-score, area under the ROC curve, and the precision-recall curve. Conclusion The hybrid machine learning model can be employed for breast cancer subtypes classification that could help the medical practitioners in better treatment planning and disease outcome.
引用
收藏
页码:68 / 83
页数:16
相关论文
共 72 条
[1]   Distribution of Breast Cancer Subtypes Among Nigerian Women and Correlation to the Risk Factors and Clinicopathological Characteristics [J].
Adeniji, Adeoluwa Akeem ;
Dawodu, Olayemi Olubunmi ;
Habeebu, Muhammad Yaqub ;
Oyekan, Ademola Oluwatosin ;
Bashir, Mariam Adebola ;
Martin, Mike G. ;
Keshinro, Samuel Olalekan ;
Fagbenro, Gabriel Timilehin .
WORLD JOURNAL OF ONCOLOGY, 2020, 11 (04) :165-172
[2]   Support vector machines combined with feature selection for breast cancer diagnosis [J].
Akay, Mehmet Fatih .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) :3240-3247
[3]   Gene selection in cancer classification using PSO/SVM and GA/SVM hybrid algorithms [J].
Alba, Enrique ;
Garcia-Nieto, Jose ;
Jourdan, Laetitia ;
Talbi, El-Ghazali .
2007 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-10, PROCEEDINGS, 2007, :284-+
[4]   Penalized logistic regression with the adaptive LASSO for gene selection in high-dimensional cancer classification [J].
Algamal, Zakariya Yahya ;
Lee, Muhammad Hisyam .
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (23) :9326-9332
[5]  
Ali N.M., 2020, Indones. J. Electr. Eng. Comput. Sci., V20, P712, DOI [10.11591/ijeecs.v20.i2.pp712-719, DOI 10.11591/IJEECS.V20.I2.PP712-719]
[6]   Breast cancer diagnosis using GA feature selection and Rotation Forest [J].
Alickovic, Emina ;
Subasi, Abdulhamit .
NEURAL COMPUTING & APPLICATIONS, 2017, 28 (04) :753-763
[7]   Higher Population-Based Incidence Rates of Triple-Negative Breast Cancer Among Young African-American Women Implications for Breast Cancer Screening Recommendations [J].
Amirikia, Kathryn C. ;
Mills, Paul ;
Bush, Jason ;
Newman, Lisa A. .
CANCER, 2011, 117 (12) :2747-2753
[8]   Reducing variability of breast cancer subtype predictors by grounding deep learning models in prior knowledge [J].
Anderson, Paul ;
Gadgil, Richa ;
Johnson, William A. ;
Schwab, Ella ;
Davidson, Jean M. .
COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 138
[9]   Using Machine Learning Algorithms for Breast Cancer Risk Prediction and Diagnosis [J].
Asri, Hiba ;
Mousannif, Hajar ;
Al Moatassime, Hassan ;
Noel, Thomas .
7TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT 2016) / THE 6TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT-2016) / AFFILIATED WORKSHOPS, 2016, 83 :1064-1069
[10]   Development of an absolute assignment predictor for triple-negative breast cancer subtyping using machine learning approaches [J].
Ben Azzouz, Fadoua ;
Michel, Bertrand ;
Lasla, Hamza ;
Gouraud, Wilfried ;
Francois, Anne-Flore ;
Girka, Fabien ;
Lecointre, Theo ;
Guerin-Charbonnel, Catherine ;
Juin, Philippe P. ;
Campone, Mario ;
Jezequel, Pascal .
COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 129