Optimized U-Net convolutional neural network based breast cancer prediction for accuracy increment in big data

被引:1
作者
Kirola, Madhu [1 ]
Memoria, Minakshi [1 ]
Dumka, Ankur [2 ,3 ]
机构
[1] Uttaranchal Univ, Uttaranchal Inst Technol, Dept Comp Sci & Engn, Dehra Dun, India
[2] Women Inst Technol, Dept Comp Sci & Engn, Dehra Dun, India
[3] Graph era Univ, Dept Comp Sci, Dehra Dun, Uttarakhand, India
关键词
big data; breast cancer; classification; deep learning; Hadoop; segmentation; MACHINE-LEARNING ALGORITHMS; RISK;
D O I
10.1002/cpe.7652
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Big data is data collected with huge dimensions and continuous exponential growth over time. In recent years, big data in health care has been commonly used to predict diseases. Breast cancer is one of the most common diseases and the secondary cause of death among women. Early diagnosis of breast cancer can prevent the risk of death. Few types of research have been done on breast cancer prediction on big data. However, the traditional prediction models have less efficient in terms of accuracy and error rate. The Optimized U-Net Convolutional neural network (OU-NetCNN) model is proposed in this paper to overcome these challenges. Hadoop is the storage system generated to store the datasets samples for big data. The data samples from two datasets, namely BreakHis and Kaggle (Breast Histopathology Images), are preserved in this storage system. The BreakHis data are considered for further processes like pre-processing, segmentation, feature extraction, feature selection, and classification from the stored data samples. The noise is removed from the histopathological breast images in pre-processing using the adaptive fast peer-group filtering (AFPGF) approach. Then the morphological operations such as erosion and dilation are used to eradicate unwanted portions and the quality of the image is enhanced using the improved balance contrast enhancement method (IBCE). Next, edges of breast images are detected using an adaptive artificial ecosystem optimization (AAEO) algorithm-based edge detection approach in the segmentation process. The features are extracted using the Spatial Gray Level Dependence Matrix (SGLDM) and the optimal features are selected by the modified selfish herd optimization (MSHO) algorithm. Finally, the selected features are fed into the proposed OU-NetCNN model to classify the histopathology images as benign and malignant images. This hybridization minimizes the error rate, computational complexity and over fitting issues. The simulation analysis is performed in the PYTHON tool. Two datasets, namely BreakHis and Kaggle (Breast Histopathology Images), are considered. Some of the measures such as precision, sensitivity, accuracy and F-measure are considered to evaluate the performance of the proposed model and compared with existing approaches.
引用
收藏
页数:19
相关论文
共 24 条
[1]   CWV-BANN-SVM ensemble learning classifier for an accurate diagnosis of breast cancer [J].
Abdar, Moloud ;
Makarenkov, Vladimir .
MEASUREMENT, 2019, 146 :557-570
[2]   On the Scalability of Machine-Learning Algorithms for Breast Cancer Prediction in Big Data Context [J].
Alghunaim, Sara ;
Al-Baity, Heyam H. .
IEEE ACCESS, 2019, 7 :91535-91546
[3]  
Amrane M., 2018, P EL EL COMP SCI BIO, P1
[4]  
[Anonymous], 2015, International Journal of Scientific Engineering and Applied Science (IJSEAS)
[5]  
Asri H, 2015, 2015 INTERNATIONAL CONFERENCE ON CLOUD TECHNOLOGIES AND APPLICATIONS (CLOUDTECH 15), P56
[6]   Using Machine Learning Algorithms for Breast Cancer Risk Prediction and Diagnosis [J].
Asri, Hiba ;
Mousannif, Hajar ;
Al Moatassime, Hassan ;
Noel, Thomas .
7TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT 2016) / THE 6TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT-2016) / AFFILIATED WORKSHOPS, 2016, 83 :1064-1069
[7]   Automated Breast Cancer Diagnosis Based on Machine Learning Algorithms [J].
Dhahri, Habib ;
Al Maghayreh, Eslam ;
Mahmood, Awais ;
Elkilani, Wail ;
Nagi, Mohammed Faisal .
JOURNAL OF HEALTHCARE ENGINEERING, 2019, 2019
[8]   SVM and SVM Ensembles in Breast Cancer Prediction [J].
Huang, Min-Wei ;
Chen, Chih-Wen ;
Lin, Wei-Chao ;
Ke, Shih-Wen ;
Tsai, Chih-Fong .
PLOS ONE, 2017, 12 (01)
[9]  
Islam MM, 2017, IEEE REG 10 HUMANIT, P226, DOI 10.1109/R10-HTC.2017.8288944
[10]   Prediction of Malignant and Benign Breast Cancer: A Data Mining Approach in Healthcare Applications [J].
Kumar, Vivek ;
Mishra, Brojo Kishore ;
Mazzara, Manuel ;
Thanh, Dang N. H. ;
Verma, Abhishek .
ADVANCES IN DATA SCIENCE AND MANAGEMENT, 2020, 37 :435-442