REliability of consensus-based segMentatIoN in raDiomic feature reproducibility (REMIND): A word of caution

被引:7
作者
Kocak, Burak [1 ]
Yardimci, Aytul Hande [1 ]
Nazli, Mehmet Ali [1 ]
Yuzkan, Sabahattin [1 ]
Mutlu, Samet [1 ]
Guzelbey, Tevfik [1 ]
Ozdemir, Merve Sam [1 ]
Akin, Meliha [1 ]
Yucel, Serap [2 ]
Bulut, Elif [1 ]
Bayrak, Osman Nuri [1 ]
Okumus, Ahmet Arda [1 ]
机构
[1] Univ Hlth Sci, Basaksehir Cam & Sakura City Hosp, Dept Radiol, TR-34480 Istanbul, Turkiye
[2] Baskent Univ, Istanbul Hosp, Dept Radiol, Istanbul, Turkiye
关键词
Radiomics; Reproducibility; Segmentation; Magnetic resonance imaging; Computed tomography; CT TEXTURE ANALYSIS; VALIDATION; IMAGES;
D O I
10.1016/j.ejrad.2023.110893
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Objective: To evaluate the reliability of consensus-based segmentation in terms of reproducibility of radiomic features.Methods: In this retrospective study, three tumor data sets were investigated: breast cancer (n = 30), renal cell carcinoma (n = 30), and pituitary macroadenoma (n = 30). MRI was utilized for breast and pituitary data sets, while CT was used for renal data set. 12 readers participated in the segmentation process. Consensus segmen-tation was created by making corrections on a previous region or volume of interest. Four experiments were designed to evaluate the reproducibility of radiomic features. Reliability was assessed with intraclass correlation coefficient (ICC) with two cut-off values: 0.75 and 0.9.Results: Considering the lower bound of the 95% confidence interval and the ICC threshold of 0.90, at least 61% of the radiomic features were not reproducible in the inter-consensus analysis. In the susceptibility experiment, at least half (54%) became non-reproducible when the first reader is replaced with a different reader. In the intra-consensus analysis, at least about one-third (32%) were non-reproducible when the same second reader segmented the image over the same first reader two weeks later. Compared to inter-reader analysis based on independent single readers, the inter-consensus analysis did not statistically significantly improve the rates of reproducible features in all data sets and analyses.Conclusions: Despite the positive connotation of the word "consensus", it is essential to REMIND that consensus-based segmentation has significant reproducibility issues. Therefore, the usage of consensus-based segmentation alone should be avoided unless a reliability analysis is performed, even if it is not practical in clinical settings.
引用
收藏
页数:11
相关论文
共 30 条
  • [1] Development and validation of a MRI-based radiomics signature for prediction of KRAS mutation in rectal cancer
    Cui, Yanfen
    Liu, Huanhuan
    Ren, Jialiang
    Du, Xiaosong
    Xin, Lei
    Li, Dandan
    Yang, Xiaotang
    Wang, Dengbin
    [J]. EUROPEAN RADIOLOGY, 2020, 30 (04) : 1948 - 1958
  • [2] Standardised lesion segmentation for imaging biomarker quantitation: a consensus recommendation from ESR and EORTC
    DeSouza, Nandita M.
    van Der Lugt, Aad
    Deroose, Christophe M.
    Alberich-Bayarri, Angel
    Bidaut, Luc
    Fournier, Laure
    Costaridou, Lena
    Oprea-Lager, Daniela E.
    Kotter, Elmar
    Smits, Marion
    Mayerhoefer, Marius E.
    Boellaard, Ronald
    Caroli, Anna
    De Geus-Oei, Lioe-Fee
    Kunz, Wolfgang G.
    Oei, Edwin H.
    Lecouvet, Frederic
    Franca, Manuela
    Loewe, Christian
    Lopci, Egesta
    Caramella, Caroline
    Persson, Anders
    Golay, Xavier
    Dewey, Marc
    O'Connor, James P. B.
    DeGraaf, Pim
    Gatidis, Sergios
    Zahlmann, Gudrun
    [J]. INSIGHTS INTO IMAGING, 2022, 13 (01)
  • [3] Radiomics: Images Are More than Pictures, They Are Data
    Gillies, Robert J.
    Kinahan, Paul E.
    Hricak, Hedvig
    [J]. RADIOLOGY, 2016, 278 (02) : 563 - 577
  • [4] Repeatability and reproducibility study of radiomic features on a phantom and human cohort
    Jha, A. K.
    Mithun, S.
    Jaiswar, V.
    Sherkhane, U. B.
    Purandare, N. C.
    Prabhash, K.
    Rangarajan, V.
    Dekker, A.
    Wee, L.
    Traverso, A.
    [J]. SCIENTIFIC REPORTS, 2021, 11 (01)
  • [5] Machine Learning-Based Multiparametric Magnetic Resonance Imaging Radiomics for Prediction of H3K27M Mutation in Midline Gliomas
    Kandemirli, Sedat Giray
    Kocak, Burak
    Naganawa, Shotaro
    Ozturk, Kerem
    Yip, Stephen S. F.
    Chopra, Saurav
    Rivetti, Luciano
    Aldine, Amro Saad
    Jones, Karra
    Cayci, Zuzan
    Moritani, Toshio
    Sato, Takashi Shawn
    [J]. WORLD NEUROSURGERY, 2021, 151 : E78 - E85
  • [6] Reliability of Single-Slice-Based 2D CT Texture Analysis of Renal Masses: Influence of Intra- and Interobserver Manual Segmentation Variability on Radiomic Feature Reproducibility
    Kocak, Burak
    Durmaz, Emine Sebnem
    Kaya, Ozlem Korkmaz
    Ates, Ece
    Kilickesmez, Ozgur
    [J]. AMERICAN JOURNAL OF ROENTGENOLOGY, 2019, 213 (02) : 377 - 383
  • [7] Influence of segmentation margin on machine learning-based high-dimensional quantitative CT texture analysis: a reproducibility study on renal clear cell carcinomas
    Kocak, Burak
    Ates, Ece
    Durmaz, Emine Sebnem
    Ulusan, Melis Baykara
    Kilickesmez, Ozgur
    [J]. EUROPEAN RADIOLOGY, 2019, 29 (09) : 4765 - 4775
  • [8] A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research
    Koo, Terry K.
    Li, Mae Y.
    [J]. JOURNAL OF CHIROPRACTIC MEDICINE, 2016, 15 (02) : 155 - 163
  • [9] Kumar V., Magnetic Resonance Imaging
  • [10] Radiomics: the bridge between medical imaging and personalized medicine
    Lambin, Philippe
    Leijenaar, Ralph T. H.
    Deist, Timo M.
    Peerlings, Jurgen
    de Jong, Evelyn E. C.
    van Timmeren, Janita
    Sanduleanu, Sebastian
    Larue, Ruben T. H. M.
    Even, Aniek J. G.
    Jochems, Arthur
    van Wijk, Yvonka
    Woodruff, Henry
    van Soest, Johan
    Lustberg, Tim
    Roelofs, Erik
    van Elmpt, Wouter
    Dekker, Andre
    Mottaghy, Felix M.
    Wildberger, Joachim E.
    Walsh, Sean
    [J]. NATURE REVIEWS CLINICAL ONCOLOGY, 2017, 14 (12) : 749 - 762