Analytics of Heterogeneous Breast Cancer Data Using Neuroevolution

被引:17
作者
Abdikenov, Beibit [1 ]
Iklassov, Zangir [1 ]
Sharipov, Askhat [1 ]
Hussain, Shahid [2 ]
Jamwal, Prashant K. [1 ]
机构
[1] Nazarbayev Univ, Elect & Comp Engn Dept, Astana 010000, Kazakhstan
[2] Univ Canberra, Fac Sci & Technol, Human Centred Technol Res Ctr, Canberra, ACT 2617, Australia
关键词
Breast cancer prognostic modelling; entity embedding; deep learning networks; evolutionary algorithms; fuzzy inferencing; NONDOMINATED SORTING APPROACH; GENETIC ALGORITHM; EVOLUTIONARY; CLASSIFICATION; NETWORKS; DESIGN;
D O I
10.1109/ACCESS.2019.2897078
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Breast cancer prognostic modeling is difficult since it is governed by many diverse factors. Given the low median survival and large scale breast cancer data, which comes from high throughput technology, the accurate and reliable prognosis of breast cancer is becoming increasingly difficult. While accurate and timely prognosis may save many patients from going through painful and expensive treatments, it may also help oncologists in managing the disease more efficiently and effectively. Data analytics augmented by machine-learning algorithms have been proposed in past for breast cancer prognosis; and however, most of these could not perform well owing to the heterogeneous nature of available data and model interpretability related issues. A robust prognostic modeling approach is proposed here whereby a Pareto optimal set of deep neural networks (DNNs) exhibiting equally good performance metrics is obtained. The set of DNNs is initialized and their hyperparameters are optimized using the evolutionary algorithm, NSGAIII. The final DNN model is selected from the Pareto optimal set of many DNNs using a fuzzy inferencing approach. Contrary to using DNNs as the black box, the proposed scheme allows understanding how various performance metrics (such as accuracy, sensitivity, F1, and so on) change with changes in hyperparameters. This enhanced interpretability can be further used to improve or modify the behavior of DNNs. The heterogeneous breast cancer database requires preprocessing for better interpretation of categorical variables in order to improve prognosis from classifiers. Furthermore, we propose to use a neural network-based entity-embedding method for categorical features with high cardinality. This approach can provide a vector representation of categorical features in multidimensional space with enhanced interpretability. It is shown with evidence that DNNs optimized using evolutionary algorithms exhibit improved performance over other classifiers mentioned in this paper.
引用
收藏
页码:18050 / 18060
页数:11
相关论文
共 58 条
[1]   Improving multiclass classification by deep networks using DAGSVM and Triplet Loss [J].
Agarwal, Nakul ;
Balasubramanian, Vineeth N. ;
Jawahar, C., V .
PATTERN RECOGNITION LETTERS, 2018, 112 :184-190
[2]   Efficient Conversion of Deep Features to Compact Binary Codes Using Fourier Decomposition for Multimedia Big Data [J].
Ahmad, Jamil ;
Muhammad, Khan ;
Lloret, Jaime ;
Baik, Sung Wook .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (07) :3205-3215
[3]  
[Anonymous], 2017, P 22 AUSTR DOC COMP
[4]  
Auerbach JE, 2011, GECCO-2011: PROCEEDINGS OF THE 13TH ANNUAL GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, P1475
[5]  
Bader J., 2008, EVOLUTIONARY COMPUTA, V19, P45
[6]   Influence of complications following immediate breast reconstruction on breast cancer recurrence rates [J].
Beecher, S. M. ;
O'Leary, D. P. ;
McLaughlin, R. ;
Sweeney, K. J. ;
Kerin, M. J. .
BRITISH JOURNAL OF SURGERY, 2016, 103 (04) :391-398
[7]  
Bojanowski P., 2017, T ASSOC COMPUT LING, V5, P135, DOI [10.1162/tacl_a_00051, DOI 10.1162/TACL_A_00051]
[8]   Identifying risk factors for adverse diseases using dynamic rare association rule mining [J].
Borah, Anindita ;
Nath, Bhabesh .
EXPERT SYSTEMS WITH APPLICATIONS, 2018, 113 :233-263
[9]   Research on Rotor Position Model for Switched Reluctance Motor Using Neural Network [J].
Cai, Yan ;
Wang, Yu ;
Xu, Hainan ;
Sun, Siyuan ;
Wang, Chenhui ;
Sun, Liubin .
IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2018, 23 (06) :2762-2773
[10]   An Algorithm for Clustering Categorical Data With Set-Valued Features [J].
Cao, Fuyuan ;
Huang, Joshua Zhexue ;
Liang, Jiye ;
Zhao, Xingwang ;
Meng, Yinfeng ;
Feng, Kai ;
Qian, Yuhua .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (10) :4593-4606