Benchmarking the generalizability of brain age models: Challenges posed by scanner variance and prediction bias

被引:16
作者
Jirsaraie, Robert J. [1 ]
Kaufmann, Tobias [2 ,3 ,4 ]
Bashyam, Vishnu [5 ]
Erus, Guray [5 ]
Luby, Joan L. [6 ]
Westlye, Lars T. [3 ,4 ,7 ,8 ]
Davatzikos, Christos [5 ]
Barch, Deanna M. [9 ]
Sotiras, Aristeidis [10 ]
机构
[1] Washington Univ, Div Computat & Data Sci, One Brookings Dr,CB 1125, St Louis, MO 63130 USA
[2] Univ Tubingen, Dept Psychiat & Psychotherapy, Tubingen Ctr Mental Hlth, Tubingen, Germany
[3] Univ Oslo, Oslo Univ Hosp, Norwegian Ctr Mental Disorders Res, Div Mental Hlth & Addict, Oslo, Norway
[4] Univ Oslo, Inst Clin Med, Oslo, Norway
[5] Univ Penn, Ctr Biomed Image Comp & Analyt, Dept Radiol, Philadelphia, PA 19104 USA
[6] Washington Univ, Dept Psychiat, St Louis, MO 63130 USA
[7] Univ Oslo, Dept Psychol, Oslo, Norway
[8] Univ Oslo, KG Jebsen Ctr Neurodev Disorders, Oslo, Norway
[9] Washington Univ, Dept Psychol & Brain Sci, St Louis, MO 63130 USA
[10] Washington Univ, Dept Radiol, Sch Med St Louis, St Louis, MO 63130 USA
基金
美国国家科学基金会;
关键词
brain age; brain development; computational neuroscience; generalizability; machine learning; PATTERNS;
D O I
10.1002/hbm.26144
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Machine learning has been increasingly applied to neuroimaging data to predict age, deriving a personalized biomarker with potential clinical applications. The scientific and clinical value of these models depends on their applicability to independently acquired scans from diverse sources. Accordingly, we evaluated the generalizability of two brain age models that were trained across the lifespan by applying them to three distinct early-life samples with participants aged 8-22 years. These models were chosen based on the size and diversity of their training data, but they also differed greatly in their processing methods and predictive algorithms. Specifically, one brain age model was built by applying gradient tree boosting (GTB) to extracted features of cortical thickness, surface area, and brain volume. The other model applied a 2D convolutional neural network (DBN) to minimally preprocessed slices of T1-weighted scans. Additional model variants were created to understand how generalizability changed when each model was trained with data that became more similar to the test samples in terms of age and acquisition protocols. Our results illustrated numerous trade-offs. The GTB predictions were relatively more accurate overall and yielded more reliable predictions when applied to lower quality scans. In contrast, the DBN displayed the most utility in detecting associations between brain age gaps and cognitive functioning. Broadly speaking, the largest limitations affecting generalizability were acquisition protocol differences and biased brain age estimates. If such confounds could eventually be removed without post-hoc corrections, brain age predictions may have greater utility as personalized biomarkers of healthy aging.
引用
收藏
页码:1118 / 1128
页数:11
相关论文
共 44 条
  • [1] Deep Learning and Multiplex Networks for Accurate Modeling of Brain Age
    Amoroso, Nicola
    La Rocca, Marianna
    Bellantuono, Loredana
    Diacono, Domenico
    Fanizzi, Annarita
    Lella, Eufemia
    Lombardi, Angela
    Maggipinto, Tommaso
    Monaco, Alfonso
    Tangaro, Sabina
    Bellotti, Roberto
    [J]. FRONTIERS IN AGING NEUROSCIENCE, 2019, 11
  • [2] MRI signatures of brain age and disease over the lifespan based on a deep brain network and 14 468 individuals worldwide
    Bashyam, Vishnu M.
    Erus, Guray
    Doshi, Jimit
    Habes, Mohamad
    Nasralah, Ilya
    Truelove-Hill, Monica
    Srinivasan, Dhivya
    Mamourian, Liz
    Pomponio, Raymond
    Fan, Yong
    Launer, Lenore J.
    Masters, Colin L.
    Maruff, Paul
    Zhuo, Chuanjun
    Volzke, Henry
    Johnson, Sterling C.
    Fripp, Jurgen
    Koutsouleris, Nikolaos
    Satterthwaite, Theodore D.
    Wolf, Daniel
    Gur, Raquel E.
    Gur, Ruben C.
    Morris, John
    Albert, Marilyn S.
    Grabe, Hans J.
    Resnick, Susan
    Bryan, R. Nick
    Wolk, David A.
    Shou, Haochang
    Davatzikos, Christos
    [J]. BRAIN, 2020, 143 : 2312 - 2324
  • [3] Neuroanatomical Assessment of Biological Maturity
    Brown, Timothy T.
    Kuperman, Joshua M.
    Chung, Yoonho
    Erhart, Matthew
    McCabe, Connor
    Hagler, Donald J., Jr.
    Venkatraman, Vijay K.
    Akshoomoff, Natacha
    Amaral, David G.
    Bloss, Cinnamon S.
    Casey, B. J.
    Chang, Linda
    Ernst, Thomas M.
    Frazier, Jean A.
    Gruen, Jeffrey R.
    Kaufmann, Walter E.
    Kenet, Tal
    Kennedy, David N.
    Murray, Sarah S.
    Sowell, Elizabeth R.
    Jernigan, Terry L.
    Dale, Anders M.
    [J]. CURRENT BIOLOGY, 2012, 22 (18) : 1693 - 1698
  • [4] Pitfalls in brain age analyses
    Butler, Ellyn R.
    Chen, Andrew
    Ramadan, Rabie
    Le, Trang T.
    Ruparel, Kosha
    Moore, Tyler M.
    Satterthwaite, Theodore D.
    Zhang, Fengqing
    Shou, Haochang
    Gur, Ruben C.
    Nichols, Thomas E.
    Shinohara, Russell T.
    [J]. HUMAN BRAIN MAPPING, 2021, 42 (13) : 4092 - 4101
  • [5] Demographically Corrected Normative Standards for the English Version of the NIH Toolbox Cognition Battery
    Casaletto, Kaitlin B.
    Umlauf, Anya
    Beaumont, Jennifer
    Gershon, Richard
    Slotkin, Jerry
    Akshoomoff, Natacha
    Heaton, Robert K.
    [J]. JOURNAL OF THE INTERNATIONAL NEUROPSYCHOLOGICAL SOCIETY, 2015, 21 (05) : 378 - 391
  • [6] Generalization of diffusion magnetic resonance imaging ?based brain age prediction model through transfer learning
    Chen, Chang-Le
    Hsu, Yung-Chin
    Yang, Li-Ying
    Tung, Yu-Hung
    Luo, Wen-Bin
    Liu, Chih-Min
    Hwang, Tzung-Jeng
    Hwu, Hai-Gwo
    Tseng, Wen-Yih Isaac
    [J]. NEUROIMAGE, 2020, 217
  • [7] Predicting Age Using Neuroimaging: Innovative Brain Ageing Biomarkers
    Cole, James H.
    Franke, Katja
    [J]. TRENDS IN NEUROSCIENCES, 2017, 40 (12) : 681 - 690
  • [8] Cortical surface-based analysis - I. Segmentation and surface reconstruction
    Dale, AM
    Fischl, B
    Sereno, MI
    [J]. NEUROIMAGE, 1999, 9 (02) : 179 - 194
  • [9] Mind the gap: Performance metric evaluation in brain-age prediction
    de Lange, Ann-Marie G.
    Anaturk, Melis
    Rokicki, Jaroslav
    Han, Laura K. M.
    Franke, Katja
    Alnaes, Dag
    Ebmeier, Klaus P.
    Draganski, Bogdan
    Kaufmann, Tobias
    Westlye, Lars T.
    Hahn, Tim
    Cole, James H.
    [J]. HUMAN BRAIN MAPPING, 2022, 43 (10) : 3113 - 3129
  • [10] Multi-Atlas Skull-Stripping
    Doshi, Jimit
    Erus, Guray
    Ou, Yangming
    Gaonkar, Bilwaj
    Davatzikos, Christos
    [J]. ACADEMIC RADIOLOGY, 2013, 20 (12) : 1566 - 1576