Investigating the potential of deep learning for patient-specific quality assurance of salivary gland contours using EORTC-1219-DAHANCA-29 clinical trial data

被引：9

作者：

Nijhuis, Hanne ^{[1
]}

van Rooij, Ward ^{[1
]}

Gregoire, Vincent ^{[2
]}

Overgaard, Jens ^{[3
]}

Slotman, Berend J. ^{[1
]}

Verbakel, Wilko F. ^{[1
]}

Dahele, Max ^{[1
]}

机构：

[1] Vrije Univ Amsterdam, Amsterdam UMC, Dept Radiat Oncol, Amsterdam, Netherlands

[2] Ctr Leon Berard, Dept Radiat Oncol, Lyon, France

[3] Aarhus Univ, Dept Expt Clin Oncol, Dept Clin Med, Aarhus N, Denmark

来源：

ACTA ONCOLOGICA | 2021年 / 60卷 / 05期

关键词：

Deep learning; Radiotherapy; Clinical trial; Quality assurance; Segmentation; Salivary glands; RADIATION-THERAPY; RADIOTHERAPY;

D O I：

10.1080/0284186X.2020.1863463

中图分类号：

R73 [肿瘤学];

学科分类号：

100214 ;

摘要：

Introduction: Manual quality assurance (QA) of radiotherapy contours for clinical trials is time and labor intensive and subject to inter-observer variability. Therefore, we investigated whether deep-learning (DL) can provide an automated solution to salivary gland contour QA. Material and methods: DL-models were trained to generate contours for parotid (PG) and submandibular glands (SMG). Sorensen-Dice coefficient (SDC) and Hausdorff distance (HD) were used to assess agreement between DL and clinical contours and thresholds were defined to highlight cases as potentially sub-optimal. 3 types of deliberate errors (expansion, contraction and displacement) were gradually applied to a test set, to confirm that SDC and HD were suitable QA metrics. DL-based QA was performed on 62 patients from the EORTC-1219-DAHANCA-29 trial. All highlighted contours were visually inspected. Results: Increasing the magnitude of all 3 types of errors resulted in progressively severe deterioration/increase in average SDC/HD. 19/124 clinical PG contours were highlighted as potentially sub-optimal, of which 5 (26%) were actually deemed clinically sub-optimal. 2/19 non-highlighted contours were false negatives (11%). 15/69 clinical SMG contours were highlighted, with 7 (47%) deemed clinically sub-optimal and 2/15 non-highlighted contours were false negatives (13%). For most incorrectly highlighted contours causes for low agreement could be identified. Conclusion: Automated DL-based contour QA is feasible but some visual inspection remains essential. The substantial number of false positives were caused by sub-optimal performance of the DL-model. Improvements to the model will increase the extent of automation and reliability, facilitating the adoption of DL-based contour QA in clinical trials and routine practice.

引用

页码：575 / 581

页数：7

共 3 条

[1] Analysis of EORTC-1219-DAHANCA-29 trial plans demonstrates the potential of knowledge-based planning to provide patient-specific treatment plan quality assurance
Tol, Jim P.
Dahele, Max
Gregoire, Vincent
Overgaard, Jens
Slotman, Ben J.
Verbakel, Wilko F. A. R.
RADIOTHERAPY AND ONCOLOGY, 2019, 130 : 75 - 81
[2] Deep Hybrid Learning Prediction of Patient-Specific Quality Assurance in Radiotherapy: Implementation in Clinical Routine
Moreau, Noemie
Bonnor, Laurine
Jaudet, Cyril
Lechippey, Laetitia
Falzone, Nadia
Batalla, Alain
Bertaut, Cindy
Corroyer-Dulmont, Aurelien
DIAGNOSTICS, 2023, 13 (05)
[3] Virtual pretreatment patient-specific quality assurance of volumetric modulated arc therapy using deep learning
Yoganathan, S. A.
Ahmed, Sharib
Paloor, Satheesh
Torfeh, Tarraf
Aouadi, Souha
Al-Hammadi, Noora
Hammoud, Rabih
MEDICAL PHYSICS, 2023, 50 (12) : 7891 - 7903

← 1 →