WsiCaption: Multiple Instance Generation of Pathology Reports for Gigapixel Whole-Slide Images

被引:2
|
作者
Chen, Pingyi [1 ,2 ,3 ]
Li, Honglin [1 ,2 ,3 ]
Zhu, Chenglu [2 ,3 ]
Zheng, Sunyi [2 ,3 ]
Shui, Zhongyi [1 ,2 ,3 ]
Yang, Lin [2 ,3 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
[2] Westlake Univ, Res Ctr Ind Future, Hangzhou, Peoples R China
[3] Westlake Univ, Sch Engn, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Whole Slide Images; Image Caption; Visual-language; Learning;
D O I
10.1007/978-3-031-72083-3_51
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Whole slide images are the foundation of digital pathology for the diagnosis and treatment of carcinomas. Writing pathology reports is laborious and error-prone for inexperienced pathologists. To reduce the workload and improve clinical automation, we investigate how to generate pathology reports given whole slide images. On the data end, we curated the largest WSI-text dataset (PathText). In specific, we collected nearly 10000 high-quality WSI-text pairs for visuallanguage models by recognizing and cleaning pathology reports which narrate diagnostic slides in TCGA. On the model end, we propose the multiple instance generative model (MI-Gen) which can produce pathology reports for gigapixel WSIs. We benchmark our model on the largest subset of PathText. Experimental results show our model can generate pathology reports which contain multiple clinical clues and achieve competitive performance on certain slide-level tasks. We observe that simple semantic extraction from the pathology reports can achieve the best performance (0.838 of F1 score) on BRCA subtyping surpassing previous state-of-the-art approaches. Our collected dataset and related code are available at https://github.com/cpystan/Wsi-Caption.
引用
收藏
页码:546 / 556
页数:11
相关论文
共 50 条
  • [31] Clinical Applications of Whole-slide Imaging in Anatomic Pathology
    Volynskaya, Zoya
    Evans, Andrew J.
    Asa, Sylvia L.
    ADVANCES IN ANATOMIC PATHOLOGY, 2017, 24 (04) : 215 - 221
  • [32] Practical quantification of necrosis in histological whole-slide images
    Homeyer, Andre
    Schenk, Andrea
    Arlt, Janine
    Dahmen, Uta
    Dirsch, Olaf
    Hahn, Horst K.
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2013, 37 (04) : 313 - 322
  • [33] Multiclass Classification of Breast Cancer in Whole-Slide Images
    Kwok, Scotty
    IMAGE ANALYSIS AND RECOGNITION (ICIAR 2018), 2018, 10882 : 931 - 940
  • [34] Deep learning-based histotype diagnosis of ovarian carcinoma whole-slide pathology images
    Farahani, Hossein
    Boschman, Jeffrey
    Farnell, David
    Darbandsari, Amirali
    Zhang, Allen
    Ahmadvand, Pouya
    Jones, Steven J. M.
    Huntsman, David
    Kobel, Martin
    Gilks, C. Blake
    Singh, Naveena
    Bashashati, Ali
    MODERN PATHOLOGY, 2022, 35 (12) : 1983 - 1990
  • [35] FAST MCT OPTIMIZATION FOR THE COMPRESSION OF WHOLE-SLIDE IMAGES
    Hernandez-Cabronero, Miguel
    Sanchez, Victor
    Auli-Llinas, Francesc
    Serra-Sagrista, Joan
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 2370 - 2374
  • [36] MHAttnSurv: Multi-head attention for survival prediction using whole-slide pathology images
    Jiang, Shuai
    Suriawinata, Arief A.
    Hassanpour, Saeed
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 158
  • [37] Stain Specific Standardization of Whole-Slide Histopathological Images
    Bejnordi, Babak Ehteshami
    Litjens, Geert
    Timofeeva, Nadya
    Otte-Holler, Irene
    Homeyer, Andre
    Karssemeijer, Nico
    van der Laak, Jeroen A. W. M.
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2016, 35 (02) : 404 - 415
  • [38] Immune subtyping of melanoma whole slide images using multiple instance learning
    Godson, Lucy
    Alemi, Navid
    Nsengimana, Jeremie
    Cook, Graham P.
    Clarke, Emily L.
    Treanor, Darren
    Bishop, D. Timothy
    Newton-Bishop, Julia
    Gooya, Ali
    Magee, Derek
    MEDICAL IMAGE ANALYSIS, 2024, 93
  • [39] Validating Whole-Slide Imaging for Consultation Diagnoses in Surgical Pathology
    Bauer, Thomas W.
    Slaw, Renee J.
    ARCHIVES OF PATHOLOGY & LABORATORY MEDICINE, 2014, 138 (11) : 1459 - 1465
  • [40] CNN cascades for segmenting sparse objects in gigapixel whole slide images
    Gadermayr, Michael
    Dombrowski, Ann-Kathrin
    Klinkhammer, Barbara Mara
    Boor, Peter
    Merhof, Dorit
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2019, 71 : 40 - 48