Position-Aware Masked Autoencoder for Histopathology WSI Representation Learning

被引:5
|
作者
Wu, Kun [1 ]
Zheng, Yushan [2 ]
Shi, Jun [3 ]
Xie, Fengying [1 ]
Jiang, Zhiguo [1 ]
机构
[1] Beihang Univ, Sch Astronaut, Image Proc Ctr, Beijing 102206, Peoples R China
[2] Beihang Univ, Beijing Adv Innovat Ctr Biomed Engn, Sch Engn Med, Beijing 100191, Peoples R China
[3] Hefei Univ Technol, Sch Software, Hefei 230601, Peoples R China
来源
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VI | 2023年 / 14225卷
基金
中国国家自然科学基金;
关键词
WSI representation learning; Self-supervised learning;
D O I
10.1007/978-3-031-43987-2_69
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transformer-based multiple instance learning (MIL) framework has been proven advanced for whole slide image (WSI) analysis. However, existing spatial embedding strategies in Transformer can only represent fixed structural information, which are hard to tackle the scale-varying and isotropic characteristics of WSIs. Moreover, the current MIL cannot take advantage of a large number of unlabeled WSIs for training. In this paper, we propose a novel self-supervised whole slide image representation learning framework named position-aware masked autoencoder (PAMA), which can make full use of abundant unlabeled WSIs to improve the discrimination of slide features. Moreover, we propose a position-aware cross-attention (PACA) module with a kernel reorientation (KRO) strategy, which makes PAMA able to maintain spatial integrity and semantic enrichment during the training. We evaluated the proposed method on a public TCGA-Lung dataset with 3,064 WSIs and an in-house Endometrial dataset with 3,654 WSIs, and compared it with 6 state-of-the-art methods. The results of experiments show our PAMA is superior to SOTA MIL methods and SSL methods. The code will be available at https://github.com/WkEEn/PAMA.
引用
收藏
页码:714 / 724
页数:11
相关论文
共 50 条
  • [1] Federated Learning with Position-Aware Neurons
    Li, Xin-Chun
    Xu, Yi-Chu
    Song, Shaoming
    Li, Bingshuai
    Li, Yinchuan
    Shao, Yunfeng
    Zhan, De-Chuan
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 10072 - 10081
  • [2] Position-aware representation learning with anatomical priors for enhanced pancreas tumor segmentation
    Dong, Kaiqi
    Hu, Peijun
    Tian, Yu
    Zhu, Yan
    Li, Xiang
    Zhou, Tianshu
    Bai, Xueli
    Liang, Tingbo
    Li, Jingsong
    NEUROCOMPUTING, 2025, 616
  • [3] Scale-MAE: A Scale-Aware Masked Autoencoder for Multiscale Geospatial Representation Learning
    Reed, Colorado J.
    Gupta, Ritwik
    Li, Shufan
    Brockman, Sarah
    Funk, Christopher
    Clipp, Brian
    Keutzer, Kurt
    Candido, Salvatore
    Uyttendaele, Matt
    Darrell, Trevor
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 4065 - 4076
  • [4] Position-aware multimedia mobile learning systems in museums
    Chou, LD
    Wu, CH
    Ho, SP
    Lee, CC
    Proceedings of the IASTED International Conference on Web-Based Education, 2004, : 148 - 150
  • [5] Position-Aware Network Representation Learning via K-Step Mutual Information Estimation
    Chu X.
    Fan X.
    Bi J.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2021, 58 (08): : 1612 - 1623
  • [6] Position-Aware ListMLE: A Sequential Learning Process for Ranking
    Lan, Yanyan
    Zhu, Yadong
    Guo, Jiafeng
    Niu, Shuzi
    Cheng, Xueqi
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2014, : 449 - 458
  • [7] MARLIN: Masked Autoencoder for facial video Representation LearnINg
    Cai, Zhixi
    Ghosh, Shreya
    Stefanov, Kalin
    Dhall, Abhinav
    Cai, Jianfei
    Rezatofighi, Hamid
    Haffari, Reza
    Hayat, Munawar
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 1493 - 1504
  • [8] Exploring global context and position-aware representation for group activity recognition
    Du, Zexing
    Wang, Qing
    IMAGE AND VISION COMPUTING, 2024, 149
  • [9] Enhancing Sindhi Word Segmentation Using Subword Representation Learning and Position-Aware Self-Attention
    Ali, Wazir
    Kumar, Jay
    Tumani, Saifullah
    Nour, Redhwan
    Noor, Adeeb
    Xu, Zenglin
    IEEE ACCESS, 2024, 12 : 183133 - 183142
  • [10] Position-Aware Cuckoo Filters
    Kwon, Minseok
    Shankar, Vijay
    Reviriego, Pedro
    PROCEEDINGS OF THE 2018 SYMPOSIUM ON ARCHITECTURES FOR NETWORKING AND COMMUNICATIONS SYSTEMS (ANCS '18), 2018, : 151 - 153