Space-filling Curves for Modeling Spatial Context in Transformer-based Whole Slide Image Classification

被引:0
|
作者
Erkan, Cihan [1 ]
Aksoy, Selim [1 ]
机构
[1] Bilkent Univ, Dept Comp Engn, TR-06800 Ankara, Turkiye
来源
MEDICAL IMAGING 2023 | 2023年 / 12471卷
关键词
Digital pathology; space-filling curves; vision transformer; whole slide image classification;
D O I
10.1117/12.2654191
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The common method for histopathology image classification is to sample small patches from large whole slide images and make predictions based on aggregations of patch representations. Transformer models provide a promising alternative with their ability to capture long-range dependencies of patches and their potential to detect representative regions, thanks to their novel self-attention strategy. However, as a sequence-based architecture, transformers are unable to directly capture the two-dimensional nature of images. While it is possible to get around this problem by converting an image into a sequence of patches in raster scan order, the basic transformer architecture is still insensitive to the locations of the patches in the image. The aim of this work is to make the model be aware of the spatial context of the patches as neighboring patches are likely to be part of the same diagnostically relevant structure. We propose a transformer-based whole slide image classification framework that uses space-filling curves to generate patch sequences that are adaptive to the variations in the shapes of the tissue structures. The goal is to preserve the locality of the patches so that neighboring patches in the one-dimensional sequence are closer to each other in the two-dimensional slide. We use positional encodings to capture the spatial arrangements of the patches in these sequences. Experiments using a lung cancer dataset obtained from The Cancer Genome Atlas show that the proposed sequence generation approach that best preserves the locality of the patches achieves 87.6% accuracy, which is higher than baseline models that use raster scan ordering (86.7% accuracy), no ordering (86.3% accuracy), and a model that uses convolutions to relate the neighboring patches (81.7% accuracy).
引用
收藏
页数:8
相关论文
共 41 条
  • [21] An empirical study of p-norm based locality measures of space-filling curves
    Dai, HK
    Su, HC
    PDPTA'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-4, 2003, : 1434 - 1440
  • [22] Algorithm for NC Tool Paths Automatic Generation on Surfaces Based on Space-Filling Curves
    Chen, Zhanfang
    Zhang, Xiaoming
    Han, Dongsong
    Wu, Shufang
    Zhang, Wenbo
    ADVANCES IN COMPUTER SCIENCE, INTELLIGENT SYSTEM AND ENVIRONMENT, VOL 2, 2011, 105 : 441 - +
  • [23] HiViT: Hierarchical attention-based Transformer for multi-scale whole slide histopathological image classification
    Yu, Jinze
    Li, Shuo
    Tan, Luxin
    Zhou, Haoyi
    Li, Zhongwu
    Li, Jianxin
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 277
  • [24] Density-based data clustering algorithms for lower dimensions using space-filling curves
    Xu, Bin
    Chen, Danny Z.
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2007, 4426 : 997 - +
  • [25] A cache-aware algorithm for PDES on hierarchical data structures based on space-filling curves
    Guenther, Frank
    Mehl, Miriam
    Poegl, Markus
    Zenger, Christoph
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2006, 28 (05) : 1634 - 1650
  • [26] PYRAMID MASKED IMAGE MODELING FOR TRANSFORMER-BASED AERIAL OBJECT DETECTION
    Zhang, Cong
    Liu, Tianshan
    Ju, Yakun
    Lam, Kin-Man
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1675 - 1679
  • [27] Local-to-global spatial learning for whole-slide image representation and classification
    Yu, Jiahui
    Ma, Tianyu
    Fu, Yu
    Chen, Hang
    Lai, Maode
    Zhuo, Cheng
    Xu, Yingke
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2023, 107
  • [28] MG-Trans: Multi-Scale Graph Transformer With Information Bottleneck for Whole Slide Image Classification
    Shi, Jiangbo
    Tang, Lufei
    Gao, Zeyu
    Li, Yang
    Wang, Chunbao
    Gong, Tieliang
    Li, Chen
    Fu, Huazhu
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (12) : 3871 - 3883
  • [29] SparseConvMIL: Sparse Convolutional Context-Aware Multiple Instance Learning for Whole Slide Image Classification
    Lerousseau, Marvin
    Vakalopoulou, Maria
    Deutsch, Eric
    Paragios, Nikos
    MICCAI WORKSHOP ON COMPUTATIONAL PATHOLOGY, VOL 156, 2021, 156 : 129 - 139
  • [30] Use of space-filling curves for additive manufacturing of three dimensionally varying graded dielectric structures using fused deposition modeling
    Larimore, Zachary
    Jensen, Sarah
    Parsons, Paul
    Good, Brandon
    Smith, Kelsey
    Mirotznik, Mark
    ADDITIVE MANUFACTURING, 2017, 15 : 48 - 56