Space-filling Curves for Modeling Spatial Context in Transformer-based Whole Slide Image Classification

被引：0

作者：

Erkan, Cihan ^{[1
]}

Aksoy, Selim ^{[1
]}

机构：

[1] Bilkent Univ, Dept Comp Engn, TR-06800 Ankara, Turkiye

来源：

MEDICAL IMAGING 2023 | 2023年 / 12471卷

关键词：

Digital pathology; space-filling curves; vision transformer; whole slide image classification;

D O I：

10.1117/12.2654191

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The common method for histopathology image classification is to sample small patches from large whole slide images and make predictions based on aggregations of patch representations. Transformer models provide a promising alternative with their ability to capture long-range dependencies of patches and their potential to detect representative regions, thanks to their novel self-attention strategy. However, as a sequence-based architecture, transformers are unable to directly capture the two-dimensional nature of images. While it is possible to get around this problem by converting an image into a sequence of patches in raster scan order, the basic transformer architecture is still insensitive to the locations of the patches in the image. The aim of this work is to make the model be aware of the spatial context of the patches as neighboring patches are likely to be part of the same diagnostically relevant structure. We propose a transformer-based whole slide image classification framework that uses space-filling curves to generate patch sequences that are adaptive to the variations in the shapes of the tissue structures. The goal is to preserve the locality of the patches so that neighboring patches in the one-dimensional sequence are closer to each other in the two-dimensional slide. We use positional encodings to capture the spatial arrangements of the patches in these sequences. Experiments using a lung cancer dataset obtained from The Cancer Genome Atlas show that the proposed sequence generation approach that best preserves the locality of the patches achieves 87.6% accuracy, which is higher than baseline models that use raster scan ordering (86.7% accuracy), no ordering (86.3% accuracy), and a model that uses convolutions to relate the neighboring patches (81.7% accuracy).

引用

页数：8

共 41 条

[21] An empirical study of p-norm based locality measures of space-filling curves
Dai, HK
Su, HC
PDPTA'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-4, 2003, : 1434 - 1440
[22] Algorithm for NC Tool Paths Automatic Generation on Surfaces Based on Space-Filling Curves
Chen, Zhanfang
Zhang, Xiaoming
Han, Dongsong
Wu, Shufang
Zhang, Wenbo
ADVANCES IN COMPUTER SCIENCE, INTELLIGENT SYSTEM AND ENVIRONMENT, VOL 2, 2011, 105 : 441 - +
[23] HiViT: Hierarchical attention-based Transformer for multi-scale whole slide histopathological image classification
Yu, Jinze
Li, Shuo
Tan, Luxin
Zhou, Haoyi
Li, Zhongwu
Li, Jianxin
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 277
[24] Density-based data clustering algorithms for lower dimensions using space-filling curves
Xu, Bin
Chen, Danny Z.
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2007, 4426 : 997 - +
[25] A cache-aware algorithm for PDES on hierarchical data structures based on space-filling curves
Guenther, Frank
Mehl, Miriam
Poegl, Markus
Zenger, Christoph
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2006, 28 (05) : 1634 - 1650
[26] PYRAMID MASKED IMAGE MODELING FOR TRANSFORMER-BASED AERIAL OBJECT DETECTION
Zhang, Cong
Liu, Tianshan
Ju, Yakun
Lam, Kin-Man
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1675 - 1679
[27] Local-to-global spatial learning for whole-slide image representation and classification
Yu, Jiahui
Ma, Tianyu
Fu, Yu
Chen, Hang
Lai, Maode
Zhuo, Cheng
Xu, Yingke
COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2023, 107
[28] MG-Trans: Multi-Scale Graph Transformer With Information Bottleneck for Whole Slide Image Classification
Shi, Jiangbo
Tang, Lufei
Gao, Zeyu
Li, Yang
Wang, Chunbao
Gong, Tieliang
Li, Chen
Fu, Huazhu
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (12) : 3871 - 3883
[29] SparseConvMIL: Sparse Convolutional Context-Aware Multiple Instance Learning for Whole Slide Image Classification
Lerousseau, Marvin
Vakalopoulou, Maria
Deutsch, Eric
Paragios, Nikos
MICCAI WORKSHOP ON COMPUTATIONAL PATHOLOGY, VOL 156, 2021, 156 : 129 - 139
[30] Use of space-filling curves for additive manufacturing of three dimensionally varying graded dielectric structures using fused deposition modeling
Larimore, Zachary
Jensen, Sarah
Parsons, Paul
Good, Brandon
Smith, Kelsey
Mirotznik, Mark
ADDITIVE MANUFACTURING, 2017, 15 : 48 - 56

← 1 2 3 4 5 →