Breast Cancer Prediction Using Fine Needle Aspiration Features and Upsampling with Supervised Machine Learning

被引:19
|
作者
Shafique, Rahman [1 ]
Rustam, Furqan [2 ]
Choi, Gyu Sang [1 ]
Diez, Isabel de la Torre [3 ]
Mahmood, Arif [4 ]
Lipari, Vivian [5 ,6 ,7 ]
Velasco, Carmen Lili Rodriguez [5 ,8 ,9 ]
Ashraf, Imran [1 ]
机构
[1] Yeungnam Univ, Dept Informat & Commun Engn, Gyongsan 38541, South Korea
[2] Univ Coll Dublin, Sch Comp Sci, Dublin D04V1W8, Ireland
[3] Univ Valladolid, Dept Signal Theory & Commun & Telematic Engn, Paseo Belen 15, Valladolid 47011, Spain
[4] Islamia Univ Bahawalpur, Dept Comp Sci & Informat Technol, Bahawalpur 63100, Punjab, Pakistan
[5] Univ Europea Atlant, Res Grp Foods, Nutr Biochem & Hlth, Isabel Torres 21, Santander 39011, Spain
[6] Univ Int Iberoamericana, Dept Project Management, Campeche 24560, Mexico
[7] Fdn Universitaria Int Colombia Bogota, Bogota, Colombia
[8] Univ Int Iberoamericana Arecibo, Dept Project Management, Arecibo, PR 00613 USA
[9] Univ Int Cuanza, Project Management, Cuito EN250, Kuito, Bie, Angola
关键词
breast cancer prediction; feature selection; fine-needle aspiration features; principal component analysis; singular value decomposition; deep learning; SYSTEM; DIAGNOSIS; IMAGES;
D O I
10.3390/cancers15030681
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Simple Summary Breast cancer is prevalent in women and the second leading cause of death. Conventional breast cancer detection methods require several laboratory tests and medical experts. Automated breast cancer detection is thus very important for timely treatment. This study explores the influence of various feature selection technique to increase the performance of machine learning methods for breast cancer detection. Experimental results shows that use of appropriate features tend to show highly accurate prediction. Breast cancer is one of the most common invasive cancers in women and it continues to be a worldwide medical problem since the number of cases has significantly increased over the past decade. Breast cancer is the second leading cause of death from cancer in women. The early detection of breast cancer can save human life but the traditional approach for detecting breast cancer disease needs various laboratory tests involving medical experts. To reduce human error and speed up breast cancer detection, an automatic system is required that would perform the diagnosis accurately and timely. Despite the research efforts for automated systems for cancer detection, a wide gap exists between the desired and provided accuracy of current approaches. To overcome this issue, this research proposes an approach for breast cancer prediction by selecting the best fine needle aspiration features. To enhance the prediction accuracy, several feature selection techniques are applied to analyze their efficacy, such as principal component analysis, singular vector decomposition, and chi-square (Chi2). Extensive experiments are performed with different features and different set sizes of features to investigate the optimal feature set. Additionally, the influence of imbalanced and balanced data using the SMOTE approach is investigated. Six classifiers including random forest, support vector machine, gradient boosting machine, logistic regression, multilayer perceptron, and K-nearest neighbors (KNN) are tuned to achieve increased classification accuracy. Results indicate that KNN outperforms all other classifiers on the used dataset with 20 features using SVD and with the 15 most important features using a PCA with a 100% accuracy score.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Breast cancer prediction using supervised machine learning techniques
    Dadheech, Pankaj
    Kalmani, Vijay
    Dogiwal, Sanwta Ram
    Sharma, Vijay Kumar
    Kumar, Ankit
    Pandey, Saroj Kumar
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2023, 44 (03): : 383 - 392
  • [2] Fine needle aspiration of synchronous bilateral breast cancer
    Konstantinou, E.
    Divani, S.
    Tzikopoulou, M.
    Kalodimos, G.
    Fericean, A.
    Rousogiannis, S.
    Vardouli, A.
    VIRCHOWS ARCHIV, 2017, 471 : S244 - S244
  • [3] Breast cancer chemoprevention trials using the fine-needle aspiration model
    Kimler, BF
    Fabian, CJ
    Wallace, DD
    JOURNAL OF CELLULAR BIOCHEMISTRY, 2000, : 7 - 12
  • [4] PREDICTION OF BREAST CANCER USING K-NEAREST NEIGHBOUR: A SUPERVISED MACHINE LEARNING ALGORITHM
    Pandey, S.
    Sharma, A.
    Siddiqui, M. K.
    Singla, D.
    Vanderpuye-Orgle, J.
    VALUE IN HEALTH, 2020, 23 : S1 - S1
  • [5] SECRETORY CARCINOMA OF THE BREAST: SPECIFIC FEATURES OF FINE NEEDLE ASPIRATION CYTOLOGY
    Abe, M.
    Ikenaga, M.
    Morizono, H.
    Iwase, T.
    Horii, R.
    Akiyama, F.
    Arai, Y.
    Furuta, N.
    Hirai, Y.
    ACTA CYTOLOGICA, 2010, 54 (03) : 440 - 440
  • [6] INFLUENCE OF CANCER HISTOLOGY ON THE SUCCESS OF FINE NEEDLE ASPIRATION OF THE BREAST
    LAMB, J
    ANDERSON, TJ
    JOURNAL OF CLINICAL PATHOLOGY, 1989, 42 (07) : 733 - 735
  • [7] FINE NEEDLE ASPIRATION BIOPSY IN THE DIAGNOSIS OF BREAST-CANCER
    MCDONALD, AH
    HANCHARD, B
    SHAH, D
    PATH, DM
    FLETCHER, PR
    DUQUESNAY, R
    WEST INDIAN MEDICAL JOURNAL, 1990, 39 (02): : 71 - 73
  • [8] FINE-NEEDLE ASPIRATION BIOPSY AND BREAST-CANCER
    FOX, AL
    AMERICAN JOURNAL OF CLINICAL PATHOLOGY, 1993, 99 (05) : 645 - 645
  • [9] FINE NEEDLE ASPIRATION CYTOLOGY IN THE DIAGNOSIS OF BREAST-CANCER
    NICASTRI, G
    REED, WP
    DZIURA, B
    BREAST CANCER RESEARCH AND TREATMENT, 1988, 12 (01) : 115 - 115
  • [10] Automated breast cancer diagnosis based on fine needle aspiration
    deGuzman, MC
    Prabhu, N
    Cramer, H
    ANALYTICAL AND QUANTITATIVE CYTOLOGY AND HISTOLOGY, 2002, 24 (06): : 305 - 313