Training artificial neural networks directly on the concordance index for censored data using genetic algorithms

被引:18
作者
Kalderstam, Jonas [1 ]
Eden, Patrik [1 ]
Bendahl, Par-Ola [2 ]
Strand, Carina [2 ]
Ferno, Marten [2 ]
Ohlsson, Mattias [1 ]
机构
[1] Lund Univ, Dept Astron & Theoret Phys, Computat Biol & Biol Phys Grp, SE-22362 Lund, Sweden
[2] Lund Univ, Skane Univ Hosp, Clin Sci Lund, Dept Oncol, SE-22185 Lund, Sweden
基金
瑞典研究理事会;
关键词
Survival analysis; Genetic algorithms; Artificial neural networks; Concordance index; Breast cancer recurrence; BREAST-CANCER; MODELS;
D O I
10.1016/j.artmed.2013.03.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Objective: The concordance index (c-index) is the standard way of evaluating the performance of prognostic models in the presence of censored data. Constructing prognostic models using artificial neural networks (ANNs) is commonly done by training on error functions which are modified versions of the c-index. Our objective was to demonstrate the capability of training directly on the c-index and to evaluate our approach compared to the Cox proportional hazards model. Method: We constructed a prognostic model using an ensemble of ANNs which were trained using a genetic algorithm. The individual networks were trained on a non-linear artificial data set divided into a training and test set both of size 2000, where 50% of the data was censored. The ANNs were also trained on a data set consisting of 4042 patients treated for breast cancer spread over five different medical studies, 2/3 used for training and 1/3 used as a test set. A Cox model was also constructed on the same data in both cases. The two models' c-indices on the test sets were then compared. The ranking performance of the models is additionally presented visually using modified scatter plots. Results: Cross validation on the cancer training set did not indicate any non-linear effects between the covariates. An ensemble of 30 ANNs with one hidden neuron was therefore used. The ANN model had almost the same c-index score as the Cox model (c-index = 0.70 and 0.71, respectively) on the cancer test set. Both models identified similarly sized low risk groups with at most 10% false positives, 49 for the ANN model and 60 for the Cox model, but repeated bootstrap runs indicate that the difference was not significant. A significant difference could however be seen when applied on the non-linear synthetic data set. In that case the ANN ensemble managed to achieve a c-index score of 0.90 whereas the Cox model failed to distinguish itself from the random case (c-index = 0.49). Conclusions: We have found empirical evidence that ensembles of ANN models can be optimized directly on the c-index. Comparison with a Cox model indicates that near identical performance is achieved on a real cancer data set while on a non-linear data set the ANN model is clearly superior. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:125 / 132
页数:8
相关论文
共 50 条
  • [31] Modeling the Ase 20 Greek Index Using Artificial Neural Nerworks Combined with Genetic Algorithms
    Karathanasopoulos, Andreas S.
    Theofilatos, Konstantinos A.
    Leloudas, Panagiotis M.
    Likothanassis, Spiridon D.
    ARTIFICIAL NEURAL NETWORKS-ICANN 2010, PT I, 2010, 6352 : 428 - +
  • [32] Hybrid training approach for artificial neural networks using genetic algorithms for rate of reaction estimation: Application to industrial methanol oxidation to formaldehyde on silver catalyst
    Papes Filho, Antonio Carlos
    Maciel Filho, Rubens
    CHEMICAL ENGINEERING JOURNAL, 2010, 157 (2-3) : 501 - 508
  • [33] Artificial neural networks design using evolutionary algorithms
    Castillo, PA
    Arenas, MG
    Castillo-Valdivieso, JJ
    Merelo, JJ
    Prieto, A
    Romero, G
    ADVANCES IN SOFT COMPUTING: ENGINEERING DESIGN AND MANUFACTURING, 2003, : 43 - 52
  • [34] Analysis of Cancer Microarray Data using Constructive Neural Networks and Genetic Algorithms
    Luque-Baena, R. M.
    Urda, D.
    Subirats, J. L.
    Franco, L.
    Jerez, J. M.
    PROCEEDINGS IWBBIO 2013: INTERNATIONAL WORK-CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING, 2013, : 55 - 63
  • [35] Privacy-Preserving Distributed Learning Based on Genetic Algorithms and Artificial Neural Networks
    Guijarro-Berdinas, Bertha
    Martinez-Rego, David
    Fernandez-Lorenzo, Santiago
    DISTRIBUTED COMPUTING, ARTIFICIAL INTELLIGENCE, BIOINFORMATICS, SOFT COMPUTING, AND AMBIENT ASSISTED LIVING, PT II, PROCEEDINGS, 2009, 5518 : 195 - 202
  • [36] Data-driven system for intelligent monitoring and optimization of froth flotation circuits using Artificial Neural Networks and Genetic Algorithms
    Hasidi, Oussama
    Abdelwahed, El Hassan
    Alaoui-Chrifi, Moulay Abdellah El
    Chahid, Rachida
    Qazdar, Aimad
    Qassimi, Sara
    Zaizi, Fatima Zahra
    Bourzeix, Francois
    Benzakour, Intissar
    Bendaouia, Ahmed
    JOURNAL OF PROCESS CONTROL, 2024, 137
  • [37] An integrated approach for optimum design of bridge decks using genetic algorithms and artificial neural networks
    Srinivas, V.
    Ramanjaneyulu, K.
    ADVANCES IN ENGINEERING SOFTWARE, 2007, 38 (07) : 475 - 487
  • [38] Evolving artificial neural networks using simulated annealing-based hybrid genetic algorithms
    Shi H.
    Li W.
    Journal of Software, 2010, 5 (04) : 353 - 360
  • [39] Real-Time Dynamic Transit Signal Priority Optimization for Coordinated Traffic Networks Using Genetic Algorithms and Artificial Neural Networks
    Ghanim, Mohammad S.
    Abu-Lebdeh, Ghassan
    JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS, 2015, 19 (04) : 327 - 338
  • [40] Selection of predictor variables for pneumonia using neural networks and genetic algorithms
    Heckerling, PS
    Gerber, BS
    Tape, TG
    Wigton, RS
    METHODS OF INFORMATION IN MEDICINE, 2005, 44 (01) : 89 - 97