Transformer-based multiple instance learning network with 2D positional encoding for histopathology image classification

被引:0
|
作者
Bin Yang [1 ]
Lei Ding [2 ]
Jianqiang Li [2 ]
Yong Li [2 ]
Guangzhi Qu [2 ]
Jingyi Wang [3 ]
Qiang Wang [2 ]
Bo Liu [2 ]
机构
[1] Center for Strategic Assessment and Consulting, Academy of Military Science, Beijing
[2] Faculty of Information Technology, Beijing University of Technology, Beijing
[3] Computer Science and Engineering Department, Oakland University, Rochester
[4] School of Mathematical and Computational Sciences, Massey University, Auckland
基金
中国国家自然科学基金;
关键词
Image classification; Multiple instance learning; Weakly supervised training;
D O I
10.1007/s40747-025-01779-y
中图分类号
学科分类号
摘要
Digital medical imaging, particularly pathology images, is essential for cancer diagnosis but faces challenges in direct model training due to its super-resolution nature. Although weakly supervised learning has reduced the need for manual annotations, many multiple instance learning (MIL) methods struggle to effectively capture crucial spatial relationships in histopathological images. Existing methods incorporating positional information often overlook nuanced spatial correlations or use positional encoding strategies that do not fully capture the unique spatial dynamics of pathology images. To address this issue, we propose a new framework named TMIL (Transformer-based Multiple Instance Learning Network with 2D positional encoding), which leverages multiple instance learning for weakly supervised classification of histopathological images. TMIL incorporates a 2D positional encoding module, based on the Transformer, to model positional information and explore correlations between instances. Furthermore, TMIL divides histopathological images into pseudo-bags and trains patch-level feature vectors with deep metric learning to enhance classification performance. Finally, the proposed approach is evaluated on a public colorectal adenoma dataset. The experimental results show that TMIL outperforms existing MIL methods, achieving an AUC of 97.28% and an ACC of 95.19%. These findings suggest that TMIL’s integration of deep metric learning and positional encoding offers a promising approach for improving the efficiency and accuracy of pathology image analysis in cancer diagnosis. © The Author(s) 2025.
引用
收藏
相关论文
共 45 条
  • [1] Positional encoding-guided transformer-based multiple instance learning histopathology whole slide images classification
    Shi, Jun
    Sun, Dongdong
    Wu, Kun
    Jiang, Zhiguo
    Kong, Xue
    Wang, Wei
    Wu, Haibo
    Zheng, Yushan
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2025, 258
  • [2] SETMIL: Spatial Encoding Transformer-Based Multiple Instance Learning for Pathological Image Analysis
    Zhao, Yu
    Lin, Zhenyu
    Sun, Kai
    Zhang, Yidan
    Huang, Junzhou
    Wang, Liansheng
    Yao, Jianhua
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT II, 2022, 13432 : 66 - 76
  • [3] Transformer Based Multiple Instance Learning for Weakly Supervised Histopathology Image Segmentation
    Qian, Ziniu
    Li, Kailu
    Lai, Maode
    Chang, Eric I-Chao
    Wei, Bingzheng
    Fan, Yubo
    Xu, Yan
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT II, 2022, 13432 : 160 - 170
  • [4] Multiple instance learning for medical image classification based on instance importance
    Struski, Lukasz
    Janusz, Szymon
    Tabor, Jacek
    Markiewicz, Michal
    Lewicki, Arkadiusz
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 91
  • [5] Breast Cancer Histopathology Image Classification and Localization using Multiple Instance Learning
    Patil, Abhijeet
    Tamboli, Dipesh
    Meena, Swati
    Anand, Deepak
    Sethi, Amit
    2019 5TH IEEE INTERNATIONAL WIE CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (WIECON-ECE 2019), 2019,
  • [6] Multi-center Ovarian Tumor Classification Using Hierarchical Transformer-Based Multiple-Instance Learning
    Claessens, Cris H. B.
    Schultz, Eloy W. R.
    Koch, Anna
    Nies, Ingrid
    Hellstrom, Terese A. E.
    Nederend, Joost
    Niers-Stobbe, Ilse
    Bruining, Annemarie
    Piek, Jurgen M. J.
    De With, Peter H. N.
    van der Sommen, Fons
    CANCER PREVENTION, DETECTION, AND INTERVENTION, CAPTION 2024, 2025, 15199 : 3 - 13
  • [7] Neighborhood attention transformer multiple instance learning for whole slide image classification
    Aftab, Rukhma
    Yan, Qiang
    Zhao, Juanjuan
    Yong, Gao
    Huajie, Yue
    Urrehman, Zia
    Khalid, Faizi Mohammad
    FRONTIERS IN ONCOLOGY, 2024, 14
  • [8] A multi-resolution model for histopathology image classification and localization with multiple instance learning
    Li, Jiayun
    Li, Wenyuan
    Sisk, Anthony
    Ye, Huihui
    Wallace, W. Dean
    Speier, William
    Arnold, Corey W.
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 131
  • [9] Transformer based multiple instance learning for WSI breast cancer classification
    Gao, Chengyang
    Sun, Qiule
    Zhu, Wen
    Zhang, Lizhi
    Zhang, Jianxin
    Liu, Bin
    Zhang, Junxing
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 89
  • [10] An EM based multiple instance learning method for image classification
    Pao, H. T.
    Chuang, S. C.
    Xu, Y. Y.
    Fu, Hsin-Chia
    EXPERT SYSTEMS WITH APPLICATIONS, 2008, 35 (03) : 1468 - 1472