LightGBM: A Leading Force in Breast Cancer Diagnosis Through Machine Learning and Image Processing

被引:4
作者
Kanber, Bassam M. [1 ]
Al Smadi, Ahmad [2 ]
Noaman, Naglaa F. [1 ]
Liu, Bo [1 ]
Gou, Shuiping [1 ]
Alsmadi, Mutasem K. [3 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence, Xian 710071, Peoples R China
[2] Zarqa Univ, Dept Data Sci & Artificial Intelligence, Zarqa 13100, Jordan
[3] Imam Abdulrahman Bin Faisal Univ, Coll Appl Studies & Community Serv, Dept Management Informat Syst, Dammam 34212, Saudi Arabia
关键词
Breast cancer; Histopathology; Image processing; Biomedical imaging; Machine learning; Feature extraction; Image classification; Medical diagnostic imaging; Performance evaluation; histopathological images; image classification; machine learning; feature extraction;
D O I
10.1109/ACCESS.2024.3375755
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The early diagnosis of breast cancer (BC), a prominent global cause of mortality, necessitates the development of innovative diagnostic strategies. This study leverages machine learning (ML) and advanced image processing techniques to analyze histopathology images, thereby augmenting the capabilities for BC diagnosis. A robust feature extraction (FE) pipeline is developed, integrating techniques such as color histogram analysis, contour FE, hu moments, and haralick texture features. Ten ML algorithms, including LightGBM (LGBM), CatBoost, and XGBoost, are systematically evaluated across varying magnifications of the BreakHis dataset to assess their diagnostic performance. The research introduces a novel approach by combining distinct FE techniques, enhancing the model's ability to distinguish between benign and malignant tissues with exceptional accuracy. These integrated techniques significantly elevate BC diagnostic accuracy and reliability, holding the potential to positively impact patient outcomes and healthcare systems. Notably, the combination of the FE pipeline and LGBM achieves the highest accuracy, reported in two forms: before augmentation accuracies (0.9598 for 40x, 0.9516 for 100 x , 0.9652 for 200 x , 0.9535 for 400 x , and 0.9570 for all magnifications combined) and after augmentation accuracies (0.9949 for 40x , 0.9870 for 100 x , 0.9987 for 200 x , and 0.9918 for 400 x ) for the classification of magnification histopathological images. Moreover, the study highlights the crucial role of augmentation in further refining classification accuracy. Extending its applicability, the proposed method is also successfully applied to the classification of lung colon cancer images (LC25000 dataset), achieving an impressive accuracy of 0.9983. The model demonstrates its effectiveness and adaptability as a compelling method for histopathological image classification. This research contributes to the evolving field of BC diagnostics, offering a framework for robust and accurate ML-based diagnostic tools that may revolutionize cancer diagnosis and enhance patient care.
引用
收藏
页码:39811 / 39832
页数:22
相关论文
共 60 条
  • [1] Classification of Breast Tumors Based on Histopathology Images Using Deep Features and Ensemble of Gradient Boosting Methods
    Abbasniya, Mohammad Reza
    Sheikholeslamzadeh, Sayed Ali
    Nasiri, Hamid
    Emami, Samaneh
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2022, 103
  • [2] The Evolution and Reliability of Machine Learning Techniques for Oncology
    Abu Owida, Hamza
    Moh'd, Bashar Al-haj
    Turab, Nidal
    Al-Nabulsi, Jamal
    Abuowaida, Suhaila
    [J]. INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2023, 19 (08) : 110 - 129
  • [3] Classification of Breast Cancer Histology Images Using Transfer Learning
    Ahmad, Hafiz Mughees
    Ghuffar, Sajid
    Khurshid, Khurram
    [J]. PROCEEDINGS OF 2019 16TH INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGY (IBCAST), 2019, : 328 - 332
  • [4] RoboGuard: Enhancing Robotic System Security with Ensemble Learning
    Al Maqousi, Ali
    Alauthman, Mohammad
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2023, 20 (06) : 965 - 976
  • [5] Alazaidah R., 2024, J. Statist. Appl. Probab., V13, P119
  • [6] A novel Siamese deep hashing model for histopathology image retrieval
    Alizadeh, Seyed Mohammad
    Helfroush, Mohammad Sadegh
    Mueller, Henning
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 225
  • [7] Sliding Window Based Support Vector Machine System for Classification of Breast Cancer Using Histopathological Microscopic Images
    Alqudah, Amin
    Alqudah, Ali Mohammad
    [J]. IETE JOURNAL OF RESEARCH, 2022, 68 (01) : 59 - 67
  • [8] Alzyoud M., 2024, International Journal of Data and Network Science, V8, P179, DOI [DOI 10.5267/J.IJDNS.2023.10.006, 10.5267/j.ijdns.2023.10.006]
  • [9] [Anonymous], 2001, ELEMENTS STAT LEARNI, DOI [DOI 10.1007/978-0-387-21606-5, 10.1007/978-0-387-21606-5]
  • [10] [Anonymous], 2023, Welcome to LightGBM's Documentation!-LightGBM 4.1.0.99Documentation