Performance comparison of modified ComBat for harmonization of radiomic features for multicenter studies

被引:140
作者
Da-ano, R. [1 ]
Masson, I [1 ,2 ]
Lucia, F. [1 ,3 ]
Dore, M. [2 ]
Robin, P. [4 ]
Alfieri, J. [5 ]
Rousseau, C. [6 ,11 ]
Mervoyer, A. [2 ]
Reinhold, C. [7 ]
Castelli, J. [8 ,9 ]
De Crevoisier, R. [8 ,9 ]
Ramee, J. F. [10 ]
Pradier, O. [1 ,3 ]
Schick, U. [1 ,3 ]
Visvikis, D. [1 ]
Hatt, M. [1 ]
机构
[1] Univ Brest, INSERM, UMR 1101, LaTIM, Brest, France
[2] Inst Cancerol Ouest Rene Gauducheau, Dept Radiat Oncol, St Herblain, France
[3] Univ Hosp, Radiat Oncol Dept, Brest, France
[4] Univ Brest, Dept Nucl Med, Brest, France
[5] McGill Univ, Dept Radiat Oncol, Hlth Ctr, Montreal, PQ, Canada
[6] Inst Cancerol Ouest Rene Gauducheau, Dept Nucl Med, St Herblain, France
[7] McGill Univ, Dept Radiol, Hlth Ctr, Montreal, PQ, Canada
[8] Inst Eugene Marquis, Radiotherapy Dept Canc, Rennes, France
[9] Univ Rennes 1, LTSI, Rennes, France
[10] Ctr Hosp Vendee, Dept Med Oncol, La Roche Suryon, France
[11] Univ Nantes, CRCINA, INSERM, UMR1232,CNRS,ERL6001, Nantes, France
基金
欧盟地平线“2020”;
关键词
IMAGES;
D O I
10.1038/s41598-020-66110-w
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Multicenter studies are needed to demonstrate the clinical potential value of radiomics as a prognostic tool. However, variability in scanner models, acquisition protocols and reconstruction settings are unavoidable and radiomic features are notoriously sensitive to these factors, which hinders pooling them in a statistical analysis. A statistical harmonization method called ComBat was developed to deal with the "batch effect" in gene expression microarray data and was used in radiomics studies to deal with the "center-effect". Our goal was to evaluate modifications in ComBat allowing for more flexibility in choosing a reference and improving robustness of the estimation. Two modified ComBat versions were evaluated: M-ComBat allows to transform all features distributions to a chosen reference, instead of the overall mean, providing more flexibility. B-ComBat adds bootstrap and Monte Carlo for improved robustness in the estimation. BM-ComBat combines both modifications. The four versions were compared regarding their ability to harmonize features in a multicenter context in two different clinical datasets. The first contains 119 locally advanced cervical cancer patients from 3 centers, with magnetic resonance imaging and positron emission tomography imaging. In that case ComBat was applied with 3 labels corresponding to each center. The second one contains 98 locally advanced laryngeal cancer patients from 5 centers with contrast-enhanced computed tomography. In that specific case, because imaging settings were highly heterogeneous even within each of the five centers, unsupervised clustering was used to determine two labels for applying ComBat. The impact of each harmonization was evaluated through three different machine learning pipelines for the modelling step in predicting the clinical outcomes, across two performance metrics (balanced accuracy and Matthews correlation coefficient). Before harmonization, almost all radiomic features had significantly different distributions between labels. These differences were successfully removed with all ComBat versions. The predictive ability of the radiomic models was always improved with harmonization and the improved ComBat provided the best results. This was observed consistently in both datasets, through all machine learning pipelines and performance metrics. The proposed modifications allow for more flexibility and robustness in the estimation. They also slightly but consistently improve the predictive power of resulting radiomic models.
引用
收藏
页数:12
相关论文
共 48 条
[1]  
[Anonymous], 2004, Proc. IEEE Int Biomedical Imaging: Nano to Macro Symp, DOI DOI 10.1109/ISBI.2004.1398617
[2]   FDG PET/CT: EANM procedure guidelines for tumour imaging: version 2.0 [J].
Boellaard, Ronald ;
Delgado-Bolton, Roberto ;
Oyen, Wim J. G. ;
Giammarile, Francesco ;
Tatsch, Klaus ;
Eschner, Wolfgang ;
Verzijlbergen, Fred J. ;
Barrington, Sally F. ;
Pike, Lucy C. ;
Weber, Wolfgang A. ;
Stroobants, Sigrid ;
Delbeke, Dominique ;
Donohoe, Kevin J. ;
Holbrook, Scott ;
Graham, Michael M. ;
Testanera, Giorgio ;
Hoekstra, Otto S. ;
Zijlstra, Josee ;
Visser, Eric ;
Hoekstra, Corneline J. ;
Pruim, Jan ;
Willemsen, Antoon ;
Arends, Bertjan ;
Kotzerke, Joerg ;
Bockisch, Andreas ;
Beyer, Thomas ;
Chiti, Arturo ;
Krause, Bernd J. .
EUROPEAN JOURNAL OF NUCLEAR MEDICINE AND MOLECULAR IMAGING, 2015, 42 (02) :328-354
[3]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[4]   Creating Robust Predictive Radiomic Models for Data From Independent Institutions Using Normalization [J].
Chatterjee, Avishek ;
Vallieres, Martin ;
Dohan, Anthony ;
Levesque, Ives R. ;
Ueno, Yoshiko ;
Saif, Sameh ;
Reinhold, Caroline ;
Seuntjens, Jan .
IEEE TRANSACTIONS ON RADIATION AND PLASMA MEDICAL SCIENCES, 2019, 3 (02) :210-215
[5]   Removing Batch Effects in Analysis of Expression Microarray Data: An Evaluation of Six Batch Adjustment Methods [J].
Chen, Chao ;
Grennan, Kay ;
Badner, Judith ;
Zhang, Dandan ;
Gershon, Elliot ;
Jin, Li ;
Liu, Chunyu .
PLOS ONE, 2011, 6 (02)
[6]  
Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
[7]   Ten quick tips for machine learning in computational biology [J].
Chicco, Davide .
BIODATA MINING, 2017, 10
[8]   Deep Learning-based Image Conversion of CT Reconstruction Kernels Improves Radiomics Reproducibility for Pulmonary Nodules or Masses [J].
Choe, Jooae ;
Lee, Sang Min ;
Do, Kyung-Hymn ;
Lee, Gaeun ;
Lee, June-Goo ;
Seo, Joon Beom .
RADIOLOGY, 2019, 292 (02) :365-373
[9]   Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers [J].
Deist, Timo M. ;
Dankers, Frank J. W. M. ;
Valdes, Gilmer ;
Wijsman, Robin ;
Hsu, I-Chow ;
Oberije, Cary ;
Lustberg, Tim ;
van Soest, Johan ;
Hoebers, Frank ;
Jochems, Arthur ;
El Naqa, Issam ;
Wee, Leonard ;
Morin, Olivier ;
Raleigh, David R. ;
Bots, Wouter ;
Kaanders, Johannes H. ;
Belderbos, Jose ;
Kwint, Margriet ;
Solberg, Timothy ;
Monshouwer, Rene ;
Bussink, Johan ;
Dekker, Andre ;
Lambin, Philippe .
MEDICAL PHYSICS, 2018, 45 (07) :3449-3459
[10]   Pretreatment 18F-FDG PET/CT Radiomics Predict Local Recurrence in Patients Treated with Stereotactic Body Radiotherapy for Early-Stage Non-Small Cell Lung Cancer: A Multicentric Study [J].
Dissaux, Gurvan ;
Visvikis, Dimitris ;
Da-ano, Ronrick ;
Pradier, Olivier ;
Chajon, Enrique ;
Barillot, Isabelle ;
Duverge, Loig ;
Masson, Ingrid ;
Abgral, Ronan ;
Ribeiro, Maria-Joao Santiago ;
Devillers, Anne ;
Pallardy, Amandine ;
Fleury, Vincent ;
Mahe, Marc-Andre ;
De Crevoisier, Renaud ;
Hatt, Mathieu ;
Schick, Ulrike .
JOURNAL OF NUCLEAR MEDICINE, 2020, 61 (06) :814-820