Deep Learning in Large and Multi-Site Structural Brain MR Imaging Datasets

被引:34
作者
Bento, Mariana [1 ,2 ,3 ]
Fantini, Irene [4 ]
Park, Justin [2 ,3 ,5 ]
Rittner, Leticia [4 ]
Frayne, Richard [2 ,3 ,5 ,6 ]
机构
[1] Univ Calgary, Schulich Sch Engn, Elect & Software Engn, Calgary, AB, Canada
[2] Univ Calgary, Hotchkiss Brain Inst, Calgary, AB, Canada
[3] Foothills Med Ctr, Calgary Image Proc & Anal Ctr, Calgary, AB, Canada
[4] Univ Estadual Campinas, Sch Elect & Comp Engn, Campinas, Brazil
[5] Univ Calgary, Cumming Sch Med, Radiol & Clin Neurosci, Calgary, AB, Canada
[6] Foothills Med Ctr, Seaman Family MR Res Ctr, Calgary, AB, Canada
基金
巴西圣保罗研究基金会;
关键词
multi-site datasets; deep learning; domain adaptation; data aggregation; batch effects; machine learning; MR brain imaging; WHITE-MATTER HYPERINTENSITIES; BIAS FIELD CORRECTION; ARTIFICIAL-INTELLIGENCE; DOMAIN ADAPTATION; SEGMENTATION; NETWORKS; MACHINE; FUTURE; IMAGES; DISEASE;
D O I
10.3389/fninf.2021.805669
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Large, multi-site, heterogeneous brain imaging datasets are increasingly required for the training, validation, and testing of advanced deep learning (DL)-based automated tools, including structural magnetic resonance (MR) image-based diagnostic and treatment monitoring approaches. When assembling a number of smaller datasets to form a larger dataset, understanding the underlying variability between different acquisition and processing protocols across the aggregated dataset (termed "batch effects") is critical. The presence of variation in the training dataset is important as it more closely reflects the true underlying data distribution and, thus, may enhance the overall generalizability of the tool. However, the impact of batch effects must be carefully evaluated in order to avoid undesirable effects that, for example, may reduce performance measures. Batch effects can result from many sources, including differences in acquisition equipment, imaging technique and parameters, as well as applied processing methodologies. Their impact, both beneficial and adversarial, must be considered when developing tools to ensure that their outputs are related to the proposed clinical or research question (i.e., actual disease-related or pathological changes) and are not simply due to the peculiarities of underlying batch effects in the aggregated dataset. We reviewed applications of DL in structural brain MR imaging that aggregated images from neuroimaging datasets, typically acquired at multiple sites. We examined datasets containing both healthy control participants and patients that were acquired using varying acquisition protocols. First, we discussed issues around Data Access and enumerated the key characteristics of some commonly used publicly available brain datasets. Then we reviewed methods for correcting batch effects by exploring the two main classes of approaches: Data Harmonization that uses data standardization, quality control protocols or other similar algorithms and procedures to explicitly understand and minimize unwanted batch effects; and Domain Adaptation that develops DL tools that implicitly handle the batch effects by using approaches to achieve reliable and robust results. In this narrative review, we highlighted the advantages and disadvantages of both classes of DL approaches, and described key challenges to be addressed in future studies.
引用
收藏
页数:17
相关论文
共 108 条
[1]   Unsupervised Domain Adaptation With Optimal Transport in Multi-Site Segmentation of Multiple Sclerosis Lesions From MRI Data [J].
Ackaouy, Antoine ;
Courty, Nicolas ;
Vallee, Emmanuel ;
Commowick, Olivier ;
Barillot, Christian ;
Galassi, Francesca .
FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2020, 14
[2]  
Ahmed M. N., 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149), P250, DOI 10.1109/CVPR.1999.786947
[3]  
Ajakan H., 2014, ARXIV14124446
[4]   Deep Learning for Brain MRI Segmentation: State of the Art and Future Directions [J].
Akkus, Zeynettin ;
Galimzianova, Alfiia ;
Hoogi, Assaf ;
Rubin, Daniel L. ;
Erickson, Bradley J. .
JOURNAL OF DIGITAL IMAGING, 2017, 30 (04) :449-459
[5]  
[Anonymous], 2015, PROC INT C LEARNING
[6]  
[Anonymous], 2013, WEB AGE INFORM MANAG
[7]  
Aslani S, 2020, I S BIOMED IMAGING, P781, DOI [10.1109/ISBI45749.2020.9098721, 10.1109/isbi45749.2020.9098721]
[8]   Accounting for data variability in multi-institutional distributed deep learning for medical imaging [J].
Balachandar, Niranjan ;
Chang, Ken ;
Kalpathy-Cramer, Jayashree ;
Rubin, Daniel L. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (05) :700-708
[9]   Protecting Your Patients' Interests in the Era of Big Data, Artificial Intelligence, and Predictive Analytics [J].
Balthazar, Patricia ;
Harri, Peter ;
Prater, Adam ;
Safdar, Nabile M. .
JOURNAL OF THE AMERICAN COLLEGE OF RADIOLOGY, 2018, 15 (03) :580-586
[10]  
Bento M., 2021, ANN M INT SOC MAGN R