Optimizing Outcome Prediction in Diffuse Large B-Cell Lymphoma by Use of Machine Learning and Nationwide Lymphoma Registries: A Nordic Lymphoma Group Study

被引:31
作者
Biccler, Jorne L. [1 ,2 ]
Eloranta, Sandra [6 ]
Brown, Peter de Nully [3 ]
Frederikseri, Henrik [4 ]
Jerkeman, Mats [7 ]
Jorgensen, Judit [5 ]
Jakobsen, Lasso Hjort [1 ,2 ]
Smedby, Karin E. [6 ,8 ]
Bogsted, Martin [1 ,2 ]
El-Galaly, Tarec C. [1 ,2 ]
机构
[1] Aalborg Univ Hosp, Aalborg, Denmark
[2] Aalborg Univ, Aalborg, Denmark
[3] Copenhagen Univ Hosp, Copenhagen, Denmark
[4] Odense Univ Hosp, Odense, Denmark
[5] Aarhus Univ Hosp, Aarhus, Denmark
[6] Karolinska Inst, Stockholm, Sweden
[7] Lund Univ, Lund, Sweden
[8] Karolinska Univ Hosp, Solna, Sweden
来源
JCO CLINICAL CANCER INFORMATICS | 2018年 / 2卷
关键词
D O I
10.1200/CCI.18.00025
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Purpose Prognostic models for diffuse large B-cell lymphoma (DLBCL), such as the International Prognostic Index (IPI) are widely used in clinical practice. The models are typically developed with simplicity in mind and thus do not exploit the full potential of detailed clinical data. This study investigated whether nationwide lymphoma registries containing clinical data and machine learning techniques could prove to be useful for building modern prognostic tools. Patients and Methods This study was based on nationwide lymphoma registries from Denmark and Sweden, which include large amounts of clinicopathologic data. Using the Danish DLBCL cohort, a stacking approach was used to build a new prognostic model that leverages the strengths of different survival models. To compare the performance of the stacking approach with established prognostic models, cross-validation was used to estimate the concordance index (C-index), time-varying area under the curve, and integrated Brier score. Finally, the generalizability was tested by applying the new model to the Swedish cohort. Results In total, 2,759 and 2,414 patients were included from the Danish and Swedish cohorts, respectively. In the Danish cohort, the stacking approach led to the lowest integrated Brier score, indicating that the survival curves obtained from the stacking model fitted the observed survival the best. The C-index and time-varying area under the curve indicated that the stacked model (C-index: Denmark [DK], 0.756; Sweden [SE], 0.744) had good discriminative capabilities compared with the other considered prognostic models (IPI: DK, 0.662; SE, 0.661; and National Comprehensive Cancer Network-IPI: DK, 0.681; SE, 0.681). Furthermore, these results were reproducible in the independent Swedish cohort. Conclusion A new prognostic model based on machine learning techniques was developed and was shown to significantly outperform established prognostic indices for DLBCL. (C) 2018 by American Society of Clinical Oncology
引用
收藏
页码:1 / 13
页数:13
相关论文
共 39 条
  • [1] Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling
    Alizadeh, AA
    Eisen, MB
    Davis, RE
    Ma, C
    Lossos, IS
    Rosenwald, A
    Boldrick, JG
    Sabet, H
    Tran, T
    Yu, X
    Powell, JI
    Yang, LM
    Marti, GE
    Moore, T
    Hudson, J
    Lu, LS
    Lewis, DB
    Tibshirani, R
    Sherlock, G
    Chan, WC
    Greiner, TC
    Weisenburger, DD
    Armitage, JO
    Warnke, R
    Levy, R
    Wilson, W
    Grever, MR
    Byrd, JC
    Botstein, D
    Brown, PO
    Staudt, LM
    [J]. NATURE, 2000, 403 (6769) : 503 - 511
  • [2] Allison P., 2002, MISSING DATA
  • [3] A comparison of imputation techniques for handling missing predictor values in a risk model with a binary outcome
    Ambler, Gareth
    Omar, Rumana Z.
    Royston, Patrick
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2007, 16 (03) : 277 - 298
  • [4] The Danish National Lymphoma Registry: Coverage and Data Quality
    Arboe, Bente
    El-Galaly, Tarec Christoffer
    Clausen, Michael Roost
    Munksgaard, Peter Svenssen
    Stoltenberg, Danny
    Nygaard, Mette Kathrine
    Klausen, Tobias Wirenfeldt
    Christensen, Jacob Haaber
    Gorlov, Jette Sonderskov
    Brown, Peter de Nully
    [J]. PLOS ONE, 2016, 11 (06):
  • [5] New approach to classifying non-hodgkin's lymphomas: Clinical features of the major histologic subtypes
    Armitage, JO
    Weisenburger, DD
    [J]. JOURNAL OF CLINICAL ONCOLOGY, 1998, 16 (08) : 2780 - 2795
  • [6] Bembom O, 2007, STAT APPL GENET MOL, V6
  • [7] Biccler J, 2017, CANC MED, V7, P114
  • [8] Estimating and comparing time-dependent areas under receiver operating characteristic curves for censored event times with competing risks
    Blanche, Paul
    Dartigues, Jean-Francois
    Jacqmin-Gadda, Helene
    [J]. STATISTICS IN MEDICINE, 2013, 32 (30) : 5381 - 5397
  • [9] Real-world data on prognostic factors and treatment in peripheral T-cell lymphomas: a study from the Swedish Lymphoma Registry
    Ellin, Fredrik
    Landstrom, Jenny
    Jerkeman, Mats
    Relander, Thomas
    [J]. BLOOD, 2014, 124 (10) : 1570 - 1577
  • [10] A clinically based prognostic index for diffuse large B-cell lymphoma with a cut-off at 70 years of age significantly improves prognostic stratification: population-based analysis from the Danish Lymphoma Registry
    Gang, Anne O.
    Pedersen, Michael
    d'Amore, Francesco
    Pedersen, Lars M.
    Jensen, Bo A.
    Jensen, Paw
    Moller, Michael B.
    Mourits-Andersen, Hans T.
    Pedersen, Robert S.
    Klausen, Tobias W.
    Brown, Peter de N.
    [J]. LEUKEMIA & LYMPHOMA, 2015, 56 (09) : 2556 - 2562