CNN and Deep Sets for End-to-End Whole Slide Image Representation Learning

被引:0
|
作者
Hemati, Sobhan [1 ]
Kalra, Shivam [1 ]
Meaney, Cameron [2 ]
Babaie, Morteza [1 ]
Ghodsi, Ali [3 ,4 ]
Tizhoosh, H. R. [1 ,4 ]
机构
[1] Univ Waterloo, Kimia Lab, Waterloo, ON, Canada
[2] Univ Waterloo, Dept Appl Math, Waterloo, ON, Canada
[3] Univ Waterloo, Data Analyt Lab, Waterloo, ON, Canada
[4] MaRS Ctr, Vector Inst, Toronto, ON, Canada
来源
MEDICAL IMAGING WITH DEEP LEARNING, VOL 143 | 2021年 / 143卷
关键词
Whole-Slide Image Representation Learning; Whole-Slide Image Search; Multi-Instance Learning; Multi-label Classification; Digital Pathology; DIGITAL PATHOLOGY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Digital pathology has enabled us to capture, store and analyze scanned biopsy samples as digital images. Recent advances in deep learning are contributing to computational pathology to improve diagnosis and treatment. However, considering challenges inherent to whole slide images (WSIs), it is not easy to employ deep learning in digital pathology. More importantly, computational bottlenecks induced by the gigapixel WSIs make it difficult to use deep learning for end-to-end image representation. To mitigate this challenge, many patch-based approaches have been proposed. Although patching WSIs enables us to use deep learning, we end up with a bag of patches or set representation which makes downstream tasks non-trivial. More importantly, considering set representation per WSI, it is not clear how one can obtain similarity between two WSIs (sets) for tasks like image search matching. To address this challenge, we propose a neural network based on Convolutions Neural Network (CNN) and Deep Sets to learn one permutation invariant vector representation per WSI in an end-to-end manner. Considering available labels at the WSI level namely, primary site and cancer subtypes - we train the proposed network in a multi-label setting to encode both primary site and diagnosis. Having in mind that every primary site has its own specific cancer subtypes, we propose to use the predicted label for the primary site to recognize the cancer subtype. The proposed architecture is used for transfer learning of WSIs and validated two different tasks, i.e., search and classification. The results show that the proposed architecture can be used to obtain WSI representations that achieve better performance both in terms of retrieval performance and search time against Yot-tixel, a recently developed search engine for pathology images. Further, the model achieved competitive performance against the state-of-art in lung cancer classification.
引用
收藏
页码:301 / 311
页数:11
相关论文
共 50 条
  • [1] From whole-slide image to biomarker prediction: end-to-end weakly supervised deep learning in computational pathology
    El Nahhas, Omar S. M.
    van Treeck, Marko
    Woelflein, Georg
    Unger, Michaela
    Ligero, Marta
    Lenz, Tim
    Wagner, Sophia J.
    Hewitt, Katherine J.
    Khader, Firas
    Foersch, Sebastian
    Truhn, Daniel
    Kather, Jakob Nikolas
    NATURE PROTOCOLS, 2024, : 293 - 316
  • [2] Cluster-to-Conquer: A Framework for End-to-End Multi-Instance Learning for Whole Slide Image Classification
    Sharma, Yash
    Shrivastava, Aman
    Ehsan, Lubaina
    Moskaluk, Christopher A.
    Syed, Sana
    Brown, Donald E.
    MEDICAL IMAGING WITH DEEP LEARNING, VOL 143, 2021, 143 : 682 - 698
  • [3] An End-to-End Learning Architecture for Efficient Image Encoding and Deep Learning
    Chamain, Lahiru D.
    Qi, Siyu
    Ding, Zhi
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 691 - 695
  • [4] End-to-end Multiple Instance Learning for Whole-Slide Cytopathology of Urothelial Carcinoma
    Butke, Joshua
    Frick, Tatjana
    Roghmann, Florian
    El-Mashtoly, Samir F.
    Gerwert, Klaus
    Mosig, Axel
    MICCAI WORKSHOP ON COMPUTATIONAL PATHOLOGY, VOL 156, 2021, 156 : 57 - 68
  • [5] Beyond Classification: Whole Slide Tissue Histopathology Analysis By End-To-End Part Learning
    Xie, Chensu
    Muhammad, Hassan
    Vanderbilt, Chad M.
    Caso, Raul
    Yarlagadda, Dig Vijay Kumar
    Campanella, Gabriele
    Fuchs, Thomas J.
    MEDICAL IMAGING WITH DEEP LEARNING, VOL 121, 2020, 121 : 843 - 856
  • [6] End-to-End Learning of Deep Visual Representations for Image Retrieval
    Albert Gordo
    Jon Almazán
    Jerome Revaud
    Diane Larlus
    International Journal of Computer Vision, 2017, 124 : 237 - 254
  • [7] A deep learning network based end-to-end image composition
    Zhu, Xiaoyu
    Wang, Haodi
    Zhang, Zhiyi
    Wu, Xiuping
    Guo, Junqi
    Wu, Hao
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2022, 101
  • [8] End-to-End Learning of Deep Visual Representations for Image Retrieval
    Gordo, Albert
    Almazan, Jon
    Revaud, Jerome
    Larlus, Diane
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2017, 124 (02) : 237 - 254
  • [9] An End-to-End Image Dehazing Method Based on Deep Learning
    Zhang, Yi
    Huang, Hongbing
    Liu, Junyi
    Fan, Chao
    Wang, Yanyan
    Cai, Qing
    Ruan, Yingying
    Gong, Xiaojin
    2018 3RD INTERNATIONAL CONFERENCE ON COMMUNICATION, IMAGE AND SIGNAL PROCESSING, 2019, 1169
  • [10] End-to-end CNN + LSTM deep learning approach for bearing fault diagnosis
    Amin Khorram
    Mohammad Khalooei
    Mansoor Rezghi
    Applied Intelligence, 2021, 51 : 736 - 751