Towards long-tailed, multi-label disease classification from chest X-ray: Overview of the CXR-LT challenge

被引:2
作者
Holste, Gregory [1 ]
Zhou, Yiliang [2 ]
Wang, Song [1 ]
Jaiswal, Ajay [1 ]
Lin, Mingquan [2 ]
Zhuge, Sherry [3 ]
Yang, Yuzhe [4 ]
Kim, Dongkyun [5 ]
Nguyen-Mau, Trong-Hieu [6 ]
Tran, Minh-Triet [6 ]
Jeong, Jaehyup [7 ]
Park, Wongi [8 ]
Ryu, Jongbin [8 ]
Hong, Feng [9 ]
Verma, Arsh [10 ]
Yamagishi, Yosuke [11 ]
Kim, Changhyun [12 ]
Seo, Hyeryeong [13 ]
Kang, Myungjoo [14 ]
Celi, Leo Anthony [15 ,16 ,17 ]
Lu, Zhiyong [18 ]
Summers, Ronald M. [19 ]
Shih, George [20 ]
Wang, Zhangyang [1 ]
Peng, Yifan [2 ]
机构
[1] Univ Texas Austin, Dept Elect & Comp Engn, Austin, TX 78712 USA
[2] Weill Cornell Med, Dept Populat Hlth Sci, New York, NY 10065 USA
[3] Carnegie Mellon Univ, Sch Informat Syst, Pittsburgh, PA 15213 USA
[4] MIT, Dept Elect Engn & Comp Sci, Cambridge, MA 02139 USA
[5] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
[6] Univ Sci, VNU HCM, Ho Chi Minh City, Vietnam
[7] KT Corp, KT Res & Dev Ctr, Seoul 06763, South Korea
[8] Ajou Univ, Dept Software & Comp Engn, Suwon 16499, South Korea
[9] Shanghai Jiao Tong Univ, Cooperat Medianet Innovat Ctr, Shanghai 200240, Peoples R China
[10] Wadhwani Inst Artificial Intelligence, Mumbai 400079, India
[11] Univ Tokyo, Grad Sch Med, Div Radiol & Biomed Engn, Tokyo 1130033, Japan
[12] SK Telecom, AIX Future R&D Ctr, Biomed AI Team, Seoul 04539, South Korea
[13] Seoul Natl Univ, Interdisciplinary Program AI IPAI, Seoul 02504, South Korea
[14] Seoul Natl Univ, Dept Math Sci, Seoul 02504, South Korea
[15] MIT, Lab Computat Physiol, Cambridge, MA 02139 USA
[16] Beth Israel Deaconess Med Ctr, Div Pulm Crit Care & Sleep Med, Boston, MA 02215 USA
[17] Harvard TH Chan Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[18] Natl Ctr Biotechnol Informat, Natl Lib Med, Bethesda, MD 20894 USA
[19] NIH, Ctr Clin, Bethesda, MD 20892 USA
[20] Weill Cornell Med, Dept Radiol, New York, NY 10065 USA
基金
美国国家科学基金会;
关键词
Chest X-ray; Long-tailed learning; Computer-aided diagnosis; DIAGNOSIS;
D O I
10.1016/j.media.2024.103224
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many real -world image recognition problems, such as diagnostic medical imaging exams, are "long-tailed"- there are a few common findings followed by many more relatively rare conditions. In chest radiography, diagnosis is both a long-tailed and multi -label problem, as patients often present with multiple findings simultaneously. While researchers have begun to study the problem of long-tailed learning in medical image recognition, few have studied the interaction of label imbalance and label co -occurrence posed by longtailed, multi -label disease classification. To engage with the research community on this emerging topic, we conducted an open challenge, CXR-LT , on long-tailed, multi -label thorax disease classification from chest X-rays (CXRs). We publicly release a large-scale benchmark dataset of over 350,000 CXRs, each labeled with at least one of 26 clinical findings following a long-tailed distribution. We synthesize common themes of top -performing solutions, providing practical recommendations for long-tailed, multi -label medical image classification. Finally, we use these insights to propose a path forward involving vision -language foundation models for few- and zero -shot disease classification.
引用
收藏
页数:12
相关论文
共 83 条
  • [31] Flexible Sampling for Long-Tailed Skin Lesion Classification
    Ju, Lie
    Wu, Yicheng
    Wang, Lin
    Yu, Zhen
    Zhao, Xin
    Wang, Xin
    Bonnington, Paul
    Ge, Zongyuan
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT III, 2022, 13433 : 462 - 471
  • [32] Relational Subsets Knowledge Distillation for Long-Tailed Retinal Diseases Recognition
    Ju, Lie
    Wang, Xin
    Wang, Lin
    Liu, Tongliang
    Zhao, Xin
    Drummond, Tom
    Mahapatra, Dwarikanath
    Ge, Zongyuan
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT VIII, 2021, 12908 : 3 - 12
  • [33] Transformers in Vision: A Survey
    Khan, Salman
    Naseer, Muzammal
    Hayat, Munawar
    Zamir, Syed Waqas
    Khan, Fahad Shahbaz
    Shah, Mubarak
    [J]. ACM COMPUTING SURVEYS, 2022, 54 (10S)
  • [34] Chest X-Ray Feature Pyramid Sum Model with Diseased Area Data Augmentation Method
    Kim, Changhyun
    Kim, Giyeol
    Yang, Sooyoung
    Kim, Hyunsu
    Lee, Sangyool
    Cho, Hansu
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2749 - 2758
  • [35] CheXFusion: Effective Fusion of Multi-View Features using Transformers for Long-Tailed Chest X-Ray Classification
    Kim, Dongkyun
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2694 - 2702
  • [36] Lehman E., 2023, Clinical-t5: Large language models built using mimic clinical text
  • [37] Li C., 2024, Adv. Neural Inf. Process. Syst., V36
  • [38] A ConvNet for the 2020s
    Liu, Zhuang
    Mao, Hanzi
    Wu, Chao-Yuan
    Feichtenhofer, Christoph
    Darrell, Trevor
    Xie, Saining
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11966 - 11976
  • [39] Mishra A, 2023, Arxiv, DOI arXiv:2306.08000
  • [40] Multi-Modal Understanding and Generation for Medical Images and Text via Vision-Language Pre-Training
    Moon, Jong Hak
    Lee, Hyungyung
    Shin, Woncheol
    Kim, Young-Hak
    Choi, Edward
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (12) : 6070 - 6080