Evaluation of Deep Learning to Augment Image-Guided Radiotherapy for Head and Neck and Prostate Cancers

被引:48
作者
Oktay, Ozan [1 ]
Nanavati, Jay [1 ]
Schwaighofer, Anton [1 ]
Carter, David [1 ]
Bristow, Melissa [1 ]
Tanno, Ryutaro [1 ]
Jena, Rajesh [1 ]
Barnett, Gill [1 ]
Noble, David [2 ,3 ]
Rimmer, Yvonne [2 ]
Glocker, Ben [1 ]
O'Hara, Kenton [1 ]
Bishop, Christopher [1 ]
Alvarez-Valle, Javier [1 ]
Nori, Aditya [1 ]
机构
[1] Microsoft Res, Hlth Intelligence, 21 Stn Rd, Cambridge CB1 2FB, England
[2] Cambridge Univ Hosp NHS Fdn Trust, Dept Oncol, Cambridge, England
[3] Western Gen Hosp, Edinburgh Canc Ctr, Edinburgh, Midlothian, Scotland
关键词
INTEROBSERVER VARIABILITY; RADIATION ONCOLOGY; TARGET VOLUME; CT; DEFINITION; THERAPY;
D O I
10.1001/jamanetworkopen.2020.27426
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Question Can machine learning models achieve clinically acceptable accuracy in image segmentation tasks in radiotherapy planning and reduce overall contouring time? Findings This quality improvement study was conducted on a set of 242 head and neck and 519 pelvic computed tomography scans acquired for radiotherapy planning at 8 distinct clinical sites with heterogeneous population groups and image acquisition settings. The proposed technology achieved levels of accuracy within interexpert variability; statistical agreement was observed for 13 of 15 structures while reducing the annotation time by a mean of 93% per scan. Meaning The study findings highlight the opportunity for widespread adoption of autosegmentation models in radiotherapy workflows to reduce overall contouring and planning time. This quality improvement study explores clinically acceptable autocontouring solutions that can be integrated into existing workflows and used in different domains of radiotherapy. Importance Personalized radiotherapy planning depends on high-quality delineation of target tumors and surrounding organs at risk (OARs). This process puts additional time burdens on oncologists and introduces variability among both experts and institutions. Objective To explore clinically acceptable autocontouring solutions that can be integrated into existing workflows and used in different domains of radiotherapy. Design, Setting, and Participants This quality improvement study used a multicenter imaging data set comprising 519 pelvic and 242 head and neck computed tomography (CT) scans from 8 distinct clinical sites and patients diagnosed either with prostate or head and neck cancer. The scans were acquired as part of treatment dose planning from patients who received intensity-modulated radiation therapy between October 2013 and February 2020. Fifteen different OARs were manually annotated by expert readers and radiation oncologists. The models were trained on a subset of the data set to automatically delineate OARs and evaluated on both internal and external data sets. Data analysis was conducted October 2019 to September 2020. Main Outcomes and Measures The autocontouring solution was evaluated on external data sets, and its accuracy was quantified with volumetric agreement and surface distance measures. Models were benchmarked against expert annotations in an interobserver variability (IOV) study. Clinical utility was evaluated by measuring time spent on manual corrections and annotations from scratch. Results A total of 519 participants' (519 [100%] men; 390 [75%] aged 62-75 years) pelvic CT images and 242 participants' (184 [76%] men; 194 [80%] aged 50-73 years) head and neck CT images were included. The models achieved levels of clinical accuracy within the bounds of expert IOV for 13 of 15 structures (eg, left femur, kappa = 0.982; brainstem, kappa = 0.806) and performed consistently well across both external and internal data sets (eg, mean [SD] Dice score for left femur, internal vs external data sets: 98.52% [0.50] vs 98.04% [1.02]; P = .04). The correction time of autogenerated contours on 10 head and neck and 10 prostate scans was measured as a mean of 4.98 (95% CI, 4.44-5.52) min/scan and 3.40 (95% CI, 1.60-5.20) min/scan, respectively, to ensure clinically accepted accuracy. Manual segmentation of the head and neck took a mean 86.75 (95% CI, 75.21-92.29) min/scan for an expert reader and 73.25 (95% CI, 68.68-77.82) min/scan for a radiation oncologist. The autogenerated contours represented a 93% reduction in time. Conclusions and Relevance In this study, the models achieved levels of clinical accuracy within expert IOV while reducing manual contouring time and performing consistently well across previously unseen heterogeneous data sets. With the availability of open-source libraries and reliable performance, this creates significant opportunities for the transformation of radiation treatment planning.
引用
收藏
页数:11
相关论文
共 33 条
[1]  
Acosta O., 2014, Abdomen and Thoracic Imaging: An Engineering Clinical Perspective, P623, DOI [10.1007/978, DOI 10.1007/978-1-4614-8498-1_24]
[2]   Fully automated organ segmentation in male pelvic CT images [J].
Balagopal, Anjali ;
Kazemifar, Samaneh ;
Dan Nguyen ;
Lin, Mu-Han ;
Hannan, Raquibul ;
Owrangi, Amir ;
Jiang, Steve .
PHYSICS IN MEDICINE AND BIOLOGY, 2018, 63 (24)
[3]   Normal tissue reactions to radiotherapy: towards tailoring treatment dose by genotype [J].
Barnett, Gillian C. ;
West, Catherine M. L. ;
Dunning, Alison M. ;
Elliott, Rebecca M. ;
Coles, Charlotte E. ;
Pharoah, Paul D. P. ;
Burnet, Neil G. .
NATURE REVIEWS CANCER, 2009, 9 (02) :134-142
[4]   Artifacts in CT: Recognition and avoidance [J].
Barrett, JF ;
Keat, N .
RADIOGRAPHICS, 2004, 24 (06) :1679-1691
[5]   STATISTICAL METHODS FOR ASSESSING AGREEMENT BETWEEN TWO METHODS OF CLINICAL MEASUREMENT [J].
BLAND, JM ;
ALTMAN, DG .
LANCET, 1986, 1 (8476) :307-310
[6]   CT-based delineation of organs at risk in the head and neck region: DAHANCA, EORTC, GORTEC, HKNPCSG, NCIC CTG, NCRI, NRG Oncology and TROG consensus guidelines [J].
Brouwer, Charlotte L. ;
Steenbakkers, Roel J. H. M. ;
Bourhis, Jean ;
Budach, Wilfried ;
Grau, Cai ;
Gregoire, Vincent ;
van Herk, Marcel ;
Lee, Anne ;
Maingon, Philippe ;
Nutting, Chris ;
O'Sullivan, Brian ;
Porceddu, Sandro V. ;
Rosenthal, David I. ;
Sijtsema, Nanna M. ;
Langendijk, Johannes A. .
RADIOTHERAPY AND ONCOLOGY, 2015, 117 (01) :83-90
[7]  
Cancer Imaging Archive, 2020, HEAD NECK CET
[8]   Interphysician variability in defining the planning target volume in the irradiation of prostate and seminal vesicles [J].
Cazzaniga, LF ;
Marinoni, MA ;
Bossi, A ;
Bianchi, E ;
Cagna, E ;
Cosentino, D ;
Scandolaro, L ;
Valli, M ;
Frigerio, M .
RADIOTHERAPY AND ONCOLOGY, 1998, 47 (03) :293-296
[9]   An evaluation of the variability of tumor-shape definition derived by experienced observers from CT images of supraglottic carcinomas (ACRIN protocol 6658) [J].
Cooper, Jay S. ;
Mukherji, Suresh K. ;
Toledano, Alicia Y. ;
Beldon, Clifford ;
Schmalfuss, Ilona M. ;
Amdur, Robert ;
Sailer, Scott ;
Loevner, Laurie A. ;
Kousouboris, Phil ;
Ang, K. Kian ;
Cormack, Jean ;
Sicks, JoRean .
INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2007, 67 (04) :972-975
[10]   MEASURES OF THE AMOUNT OF ECOLOGIC ASSOCIATION BETWEEN SPECIES [J].
DICE, LR .
ECOLOGY, 1945, 26 (03) :297-302