OpenBHB: a Large-Scale Multi-Site Brain MRI Data-set for Age Prediction and Debiasing

被引:22
作者
Dufumier, Benoit [1 ]
Grigis, Antoine [1 ]
Victor, Julie [1 ]
Ambroise, Corentin [1 ]
Frouin, Vincent [1 ]
Duchesnay, Edouard [1 ]
机构
[1] Univ Paris Saclay, NeuroSpin, CEA Saclay, Saclay, France
关键词
SURFACE-BASED ANALYSIS; SCHIZOPHRENIA; DISEASE;
D O I
10.1016/j.neuroimage.2022.119637
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Prediction of chronological age from neuroimaging in the healthy population is an important issue because the deviations from normal brain age may highlight abnormal trajectories towards brain disorders. As a first step, ML models have emerged to predict chronological age from brain MRI, as a proxy measure of biological age. However, there is currently no consensus w.r.t which Machine Learning (ML) model is best suited for this task, largely because of a lack of public benchmark. Furthermore, new large emerging population neuroimaging datasets are often biased by the acquisition center images are coming from. This bias heavily deteriorates models generalization capacities, especially for Deep Learning (DL) algorithms that are known to overfit rapidly on the simplest features (known as simplicity bias). Here we propose a new public benchmarking resource, namely Open Big Healthy Brains (OpenBHB), along with a challenge for both brain age prediction and site-effect removal through a representation learning framework. OpenBHB is large-scale, gathering > 5K 3D T1 brain MRI from Healthy Controls (HC) and highly multi-sites, aggregating > 60 centers worldwide and 10 studies. OpenBHB is expected to grow both in terms of available modalities and number of subjects. All OpenBHB datasets are uniformly preprocessed, including quality check, with container technologies that consist in: 3D Voxel-Based Morphometry maps (VBM from CAT12), quasi-raw (simple linear alignment of images), and Surface-Based Morphometry indices (SBM, from FreeSurfer). The OpenBHB challenge is permanent and we provide all tools, materials and tutorials for participants to easily submit and benchmark their model against each other on a public leaderboard.
引用
收藏
页数:14
相关论文
共 74 条
[1]   Deep learning encodes robust discriminative neuroimaging representations to outperform standard machine learning [J].
Abrol, Anees ;
Fu, Zening ;
Salman, Mustafa ;
Silva, Rogers ;
Du, Yuhui ;
Plis, Sergey ;
Calhoun, Vince .
NATURE COMMUNICATIONS, 2021, 12 (01)
[2]  
Alain G, 2018, Arxiv, DOI arXiv:1610.01644
[3]  
Avants BB., 2009, Insight j, V2, P1
[4]   A mind-brain-body dataset of MRI, EEG, cognition, emotion, and peripheral physiology in young and old adults [J].
Babayan, Anahit ;
Erbey, Miray ;
Kumral, Deniz ;
Reinelt, Janis D. ;
Reiter, Andrea M. F. ;
Roebbig, Josefin ;
Schaare, H. Lina ;
Uhlig, Marie ;
Anwander, Alfred ;
Bazin, Pierre-Louis ;
Horstmann, Annette ;
Lampe, Leonie ;
Nikulin, Vadim V. ;
Okon-Singer, Hadas ;
Preusser, Sven ;
Pampel, Andre ;
Rohr, Christiane S. ;
Sacher, Julia ;
Thoene-Otto, Angelika ;
Trapp, Sabrina ;
Nierhaus, Till ;
Altmann, Denise ;
Arelin, Katrin ;
Bloechl, Maria ;
Bongartz, Edith ;
Breig, Patric ;
Cesnaite, Elena ;
Chen, Sufang ;
Cozatl, Roberto ;
Czerwonatis, Saskia ;
Dambrauskaite, Gabriele ;
Dreyer, Maria ;
Enders, Jessica ;
Engelhardt, Melina ;
Fischer, Marie Michele ;
Forschack, Norman ;
Golchert, Johannes ;
Golz, Laura ;
Guran, C. Alexandrina ;
Hedrich, Susanna ;
Hentschel, Nicole ;
Hoffmann, Daria I. ;
Huntenburg, Julia M. ;
Jost, Rebecca ;
Kosatschek, Anna ;
Kunzendorf, Stella ;
Lammers, Hannah ;
Lauckner, Mark E. ;
Mahjoory, Keyvan ;
Kanaan, Ahmad S. .
SCIENTIFIC DATA, 2019, 6 (1)
[5]  
Bahng H, 2020, PR MACH LEARN RES, V119
[6]   Disruption of Cortical Association Networks in Schizophrenia and Psychotic Bipolar Disorder [J].
Baker, Justin T. ;
Holmes, Avram J. ;
Masters, Grace A. ;
Yeo, B. T. Thomas ;
Krienen, Fenna ;
Buckner, Randy L. ;
Oenguer, Dost .
JAMA PSYCHIATRY, 2014, 71 (02) :109-118
[7]  
Barbano C.A., 2021, P IEEECVF INT C COMP, P3806
[8]  
Bashyam V. M., 2020, MED IMAGE HARMONIZAT
[9]   MRI signatures of brain age and disease over the lifespan based on a deep brain network and 14 468 individuals worldwide [J].
Bashyam, Vishnu M. ;
Erus, Guray ;
Doshi, Jimit ;
Habes, Mohamad ;
Nasralah, Ilya ;
Truelove-Hill, Monica ;
Srinivasan, Dhivya ;
Mamourian, Liz ;
Pomponio, Raymond ;
Fan, Yong ;
Launer, Lenore J. ;
Masters, Colin L. ;
Maruff, Paul ;
Zhuo, Chuanjun ;
Volzke, Henry ;
Johnson, Sterling C. ;
Fripp, Jurgen ;
Koutsouleris, Nikolaos ;
Satterthwaite, Theodore D. ;
Wolf, Daniel ;
Gur, Raquel E. ;
Gur, Ruben C. ;
Morris, John ;
Albert, Marilyn S. ;
Grabe, Hans J. ;
Resnick, Susan ;
Bryan, R. Nick ;
Wolk, David A. ;
Shou, Haochang ;
Davatzikos, Christos .
BRAIN, 2020, 143 :2312-2324
[10]  
Buckner RL., 2014, NEUROINFORMATICS RES