Using generative AI to investigate medical imagery models and datasets

被引:9
作者
Lang, Oran [1 ]
Yaya-Stupp, Doron [1 ]
Traynis, Ilana [2 ]
Cole-Lewis, Heather [1 ]
Bennett, Chloe R. [3 ]
Lyles, Courtney R. [1 ,4 ]
Lau, Charles [1 ]
Irani, Michal [5 ]
Semturs, Christopher [1 ]
Webster, Dale R. [1 ]
Corrado, Greg S. [1 ]
Hassidim, Avinatan [1 ]
Matias, Yossi [1 ]
Liu, Yun [1 ]
Hammel, Naama [1 ]
Babenko, Boris [1 ]
机构
[1] Google, Mountain View, CA 94043 USA
[2] Google Via Adv Clin, Deerfield, IL USA
[3] Google Via Pro Unltd, Folsom, CA USA
[4] Univ Calif San Francisco, Dept Med, San Francisco, CA USA
[5] Weizmann Inst Sci, Rehovot, Israel
来源
EBIOMEDICINE | 2024年 / 102卷
关键词
fi cial intelligence; Medical imagery; Explainability; Interpretability; Deep learning; Generative AI; RACIAL-DIFFERENCES; MEIBOMIAN GLANDS; RACE; HEALTH; PHOTOGRAPHS; DISPARITIES; BIOLOGY; GENDER; AGE;
D O I
10.1016/j.ebiom.2024.105075
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background AI models have shown promise in performing many medical imaging tasks. However, our ability to explain what signals these models have learned is severely lacking. Explanations are needed in order to increase the trust of doctors in AI -based models, especially in domains where AI prediction capabilities surpass those of humans. Moreover, such explanations could enable novel scienti fi c discovery by uncovering signals in the data that aren ' t yet known to experts. Methods In this paper, we present a work fl ow for generating hypotheses to understand which visual signals in images are correlated with a classi fi cation model ' s predictions for a given task. This approach leverages an automatic visual explanation algorithm followed by interdisciplinary expert review. We propose the following 4 steps: (i) Train a classi fi er to perform a given task to assess whether the imagery indeed contains signals relevant to the task; (ii) Train a StyleGAN-based image generator with an architecture that enables guidance by the classi fi er ( " StylEx " ); (iii) Automatically detect, extract, and visualize the top visual attributes that the classi fi er is sensitive towards. For visualization, we independently modify each of these attributes to generate counterfactual visualizations for a set of images (i.e., what the image would look like with the attribute increased or decreased); (iv) Formulate hypotheses for the underlying mechanisms, to stimulate future research. Speci fi cally, present the discovered attributes and corresponding counterfactual visualizations to an interdisciplinary panel of experts so that hypotheses can account for social and structural determinants of health (e.g., whether the attributes correspond to known patho-physiological or socio-cultural phenomena, or could be novel discoveries). Findings To demonstrate the broad applicability of our approach, we present results on eight prediction tasks across three medical imaging modalities - retinal fundus photographs, external eye photographs, and chest radiographs. We showcase examples where many of the automatically-learned attributes clearly capture clinically known features (e.g., types of cataract, enlarged heart), and demonstrate automatically-learned confounders that arise from factors beyond physiological mechanisms (e.g., chest X-ray underexposure is correlated with the classi fi er predicting abnormality, and eye makeup is correlated with the classi fi er predicting low hemoglobin levels). We further show that our method reveals a number of physiologically plausible, previously-unknown attributes based on the literature (e.g., differences in the fundus associated with self-reported sex, which were previously unknown). Interpretation Our approach enables hypotheses generation via attribute visualizations and has the potential to enable researchers to better understand, improve their assessment, and extract new knowledge from AI -based models, as well as debug and design better datasets. Though not designed to infer causality, importantly, we highlight that attributes generated by our framework can capture phenomena beyond physiology or pathophysiology, re fl ecting the real world nature of healthcare delivery and socio-cultural factors, and hence interdisciplinary perspectives are critical in these investigations. Finally, we will release code to help researchers train their own StylEx models and analyze their predictive tasks of interest, and use the methodology presented in this paper for responsible interpretation of the revealed attributes.
引用
收藏
页数:14
相关论文
共 89 条
  • [1] [Anonymous], 2010, A conceptual framework for action on the social determinants of health, P76
  • [2] [Anonymous], 2020, New AMA policies recognize race as a social, not biological, construct
  • [3] A deep learning model for novel systemic biomarkers in photographs of the external eye: a retrospective study
    Babenko, Boris
    Traynis, Ilana
    Chen, Christina
    Singh, Preeti
    Uddin, Akib
    Cuadros, Jorge
    Daskivich, Lauren P.
    Maa, April Y.
    Kim, Ramasamy
    Kang, Eugene Yu-Chuan
    Matias, Yossi
    Corrado, Greg S.
    Peng, Lily
    Webster, Dale R.
    Semturs, Christopher
    Krause, Jonathan
    Varadarajan, Avinash V.
    Hammel, Naama
    Liu, Yun
    [J]. LANCET DIGITAL HEALTH, 2023, 5 (05): : E257 - E264
  • [4] Detection of signs of disease in external photographs of the eyes via deep learning
    Babenko, Boris
    Mitani, Akinori
    Traynis, Ilana
    Kitade, Naho
    Singh, Preeti
    Maa, April Y.
    Cuadros, Jorge
    Corrado, Greg S.
    Peng, Lily
    Webster, Dale R.
    Varadarajan, Avinash
    Hammel, Naama
    Liu, Yun
    [J]. NATURE BIOMEDICAL ENGINEERING, 2022, 6 (12) : 1370 - +
  • [5] Structural racism and health inequities in the USA: evidence and interventions
    Bailey, Zinzi D.
    Krieger, Nancy
    Agenor, Madina
    Graves, Jasmine
    Linos, Natalia
    Bassett, Mary T.
    [J]. LANCET, 2017, 389 (10077) : 1453 - 1463
  • [6] Evaluation of machine learning methodology for the prediction of healthcare resource utilization and healthcare costs in patients with critical limb ischemia-is preventive and personalized approach on the horizon?
    Berger, Jeffrey S.
    Haskell, Lloyd
    Ting, Windsor
    Lurie, Fedor
    Chang, Shun-Chiao
    Mueller, Luke A.
    Elder, Kenneth
    Rich, Kelly
    Crivera, Concetta
    Schein, Jeffrey R.
    Alas, Veronica
    [J]. EPMA JOURNAL, 2020, 11 (01) : 53 - 64
  • [7] Beutel A, 2017, Arxiv, DOI arXiv:1707.00075
  • [8] Boxt L, 2009, Cardiac imaging: the requisites
  • [9] Braun Lundy, 2015, Can J Respir Ther, V51, P99
  • [10] Abandon "Race." Focus on Racism
    Braveman, Paula
    Dominguez, Tyan Parker
    [J]. FRONTIERS IN PUBLIC HEALTH, 2021, 9