Using generative AI to investigate medical imagery models and datasets

被引：9

作者：

Lang, Oran ^{[1
]}

Yaya-Stupp, Doron ^{[1
]}

Traynis, Ilana ^{[2
]}

Cole-Lewis, Heather ^{[1
]}

Bennett, Chloe R. ^{[3
]}

Lyles, Courtney R. ^{[1
,4
]}

Lau, Charles ^{[1
]}

Irani, Michal ^{[5
]}

Semturs, Christopher ^{[1
]}

Webster, Dale R. ^{[1
]}

Corrado, Greg S. ^{[1
]}

Hassidim, Avinatan ^{[1
]}

Matias, Yossi ^{[1
]}

Liu, Yun ^{[1
]}

Hammel, Naama ^{[1
]}

Babenko, Boris ^{[1
]}

机构：

[1] Google, Mountain View, CA 94043 USA

[2] Google Via Adv Clin, Deerfield, IL USA

[3] Google Via Pro Unltd, Folsom, CA USA

[4] Univ Calif San Francisco, Dept Med, San Francisco, CA USA

[5] Weizmann Inst Sci, Rehovot, Israel

来源：

EBIOMEDICINE | 2024年 / 102卷

关键词：

fi cial intelligence; Medical imagery; Explainability; Interpretability; Deep learning; Generative AI; RACIAL-DIFFERENCES; MEIBOMIAN GLANDS; RACE; HEALTH; PHOTOGRAPHS; DISPARITIES; BIOLOGY; GENDER; AGE;

D O I：

10.1016/j.ebiom.2024.105075

中图分类号：

R5 [内科学];

学科分类号：

1002 ; 100201 ;

摘要：

Background AI models have shown promise in performing many medical imaging tasks. However, our ability to explain what signals these models have learned is severely lacking. Explanations are needed in order to increase the trust of doctors in AI -based models, especially in domains where AI prediction capabilities surpass those of humans. Moreover, such explanations could enable novel scienti fi c discovery by uncovering signals in the data that aren ' t yet known to experts. Methods In this paper, we present a work fl ow for generating hypotheses to understand which visual signals in images are correlated with a classi fi cation model ' s predictions for a given task. This approach leverages an automatic visual explanation algorithm followed by interdisciplinary expert review. We propose the following 4 steps: (i) Train a classi fi er to perform a given task to assess whether the imagery indeed contains signals relevant to the task; (ii) Train a StyleGAN-based image generator with an architecture that enables guidance by the classi fi er ( " StylEx " ); (iii) Automatically detect, extract, and visualize the top visual attributes that the classi fi er is sensitive towards. For visualization, we independently modify each of these attributes to generate counterfactual visualizations for a set of images (i.e., what the image would look like with the attribute increased or decreased); (iv) Formulate hypotheses for the underlying mechanisms, to stimulate future research. Speci fi cally, present the discovered attributes and corresponding counterfactual visualizations to an interdisciplinary panel of experts so that hypotheses can account for social and structural determinants of health (e.g., whether the attributes correspond to known patho-physiological or socio-cultural phenomena, or could be novel discoveries). Findings To demonstrate the broad applicability of our approach, we present results on eight prediction tasks across three medical imaging modalities - retinal fundus photographs, external eye photographs, and chest radiographs. We showcase examples where many of the automatically-learned attributes clearly capture clinically known features (e.g., types of cataract, enlarged heart), and demonstrate automatically-learned confounders that arise from factors beyond physiological mechanisms (e.g., chest X-ray underexposure is correlated with the classi fi er predicting abnormality, and eye makeup is correlated with the classi fi er predicting low hemoglobin levels). We further show that our method reveals a number of physiologically plausible, previously-unknown attributes based on the literature (e.g., differences in the fundus associated with self-reported sex, which were previously unknown). Interpretation Our approach enables hypotheses generation via attribute visualizations and has the potential to enable researchers to better understand, improve their assessment, and extract new knowledge from AI -based models, as well as debug and design better datasets. Though not designed to infer causality, importantly, we highlight that attributes generated by our framework can capture phenomena beyond physiology or pathophysiology, re fl ecting the real world nature of healthcare delivery and socio-cultural factors, and hence interdisciplinary perspectives are critical in these investigations. Finally, we will release code to help researchers train their own StylEx models and analyze their predictive tasks of interest, and use the methodology presented in this paper for responsible interpretation of the revealed attributes.

引用

页数：14

共 89 条

[1] [Anonymous], 2010, A conceptual framework for action on the social determinants of health, P76
[2] [Anonymous], 2020, New AMA policies recognize race as a social, not biological, construct
[3] A deep learning model for novel systemic biomarkers in photographs of the external eye: a retrospective study
Babenko, Boris
Traynis, Ilana
Chen, Christina
Singh, Preeti
Uddin, Akib
Cuadros, Jorge
Daskivich, Lauren P.
Maa, April Y.
Kim, Ramasamy
Kang, Eugene Yu-Chuan
Matias, Yossi
Corrado, Greg S.
Peng, Lily
Webster, Dale R.
Semturs, Christopher
Krause, Jonathan
Varadarajan, Avinash V.
Hammel, Naama
Liu, Yun
[J]. LANCET DIGITAL HEALTH, 2023, 5 (05): : E257 - E264
[4] Detection of signs of disease in external photographs of the eyes via deep learning
Babenko, Boris
Mitani, Akinori
Traynis, Ilana
Kitade, Naho
Singh, Preeti
Maa, April Y.
Cuadros, Jorge
Corrado, Greg S.
Peng, Lily
Webster, Dale R.
Varadarajan, Avinash
Hammel, Naama
Liu, Yun
[J]. NATURE BIOMEDICAL ENGINEERING, 2022, 6 (12) : 1370 - +
[5] Structural racism and health inequities in the USA: evidence and interventions
Bailey, Zinzi D.
Krieger, Nancy
Agenor, Madina
Graves, Jasmine
Linos, Natalia
Bassett, Mary T.
[J]. LANCET, 2017, 389 (10077) : 1453 - 1463
[6] Evaluation of machine learning methodology for the prediction of healthcare resource utilization and healthcare costs in patients with critical limb ischemia-is preventive and personalized approach on the horizon?
Berger, Jeffrey S.
Haskell, Lloyd
Ting, Windsor
Lurie, Fedor
Chang, Shun-Chiao
Mueller, Luke A.
Elder, Kenneth
Rich, Kelly
Crivera, Concetta
Schein, Jeffrey R.
Alas, Veronica
[J]. EPMA JOURNAL, 2020, 11 (01) : 53 - 64
[7] Beutel A, 2017, Arxiv, DOI arXiv:1707.00075
[8] Boxt L, 2009, Cardiac imaging: the requisites
[9] Braun Lundy, 2015, Can J Respir Ther, V51, P99
[10] Abandon "Race." Focus on Racism
Braveman, Paula
Dominguez, Tyan Parker
[J]. FRONTIERS IN PUBLIC HEALTH, 2021, 9

← 1 2 3 4 5 6 7 8 9 →