CFI-ViT: A coarse-to-fine inference based vision transformer for gastric cancer subtype detection using pathological images

被引:0
作者
Wang, Xinghang [1 ,2 ]
Tao, Haibo [3 ]
Wang, Bin [1 ,2 ]
Jin, Huaiping [1 ,2 ]
Li, Zhenhui [3 ]
机构
[1] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, Dept Automat, Kunming 650500, Peoples R China
[2] Kunming Univ Sci & Technol, Higher Educ Key Lab Ind Intelligence & Syst Yunnan, Kunming 650500, Peoples R China
[3] Kunming Med Univ, Peking Univ, Affiliated Hosp 3, Yunnan Canc Hosp,Canc Hosp Yunnan, Kunming 650118, Peoples R China
基金
中国国家自然科学基金;
关键词
Gastric cancer; Subtype classification; Histopathology images; Vision transformer; Attention mechanism; Two-stage inference; RING CELL-CARCINOMA; COMPUTATIONAL PATHOLOGY; CLASSIFICATION; DIFFERENCE;
D O I
10.1016/j.bspc.2024.107160
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Accurate detection of histopathological cancer subtypes is crucial for personalized treatment. Currently, deep learning methods based on histopathology images have become an effective solution to this problem. However, existing deep learning methods for histopathology image classification often suffer from high computational complexity, not considering the variability of different regions, and failing to synchronize the focus on local-global information effectively. To address these issues, we propose a coarse-to-fine inference based vision transformer (ViT) network (CFI-ViT) for pathological image detection of gastric cancer subtypes. CFI-ViT combines global attention and discriminative and differentiable modules to achieve two-stage inference. In the coarse inference stage, a ViT model with relative position embedding is employed to extract global information from the input images. If the critical information is not sufficiently identified, the differentiable module is adopted to extract local image regions with discrimination for fine-grained screening in the fine inference stage. The effectiveness and superiority of the proposed CFI-ViT method have been validated through three pathological image datasets of gastric cancer, including one private dataset clinically collected from Yunnan Cancer Hospital in China and two publicly available datasets, i.e., HE-GHI-DS and TCGA-STAD. The experimental results demonstrate that CFI-ViT achieves superior recognition accuracy and generalization performance compared to traditional methods, while using only 80 % of the computational resources required by the ViT model.
引用
收藏
页数:16
相关论文
共 51 条
  • [1] Abernethy J, 2016, NEURAL INF PROCESS S, P233
  • [2] A State-of-the-Art Review for Gastric Histopathology Image Analysis Approaches and Future Development
    Ai, Shiliang
    Li, Chen
    Li, Xiaoyan
    Jiang, Tao
    Grzegorzek, Marcin
    Sun, Changhao
    Rahaman, Md Mamunur
    Zhang, Jinghua
    Yao, Yudong
    Li, Hong
    [J]. BIOMED RESEARCH INTERNATIONAL, 2021, 2021
  • [3] The Eighth Edition AJCC Cancer Staging Manual: Continuing to build a bridge from a population-based to a more "personalized" approach to cancer staging
    Amin, Mahul B.
    Greene, Frederick L.
    Edge, Stephen B.
    Compton, Carolyn C.
    Gershenwald, Jeffrey E.
    Brookland, Robert K.
    Meyer, Laura
    Gress, Donna M.
    Byrd, David R.
    Winchester, David P.
    [J]. CA-A CANCER JOURNAL FOR CLINICIANS, 2017, 67 (02) : 93 - 99
  • [4] Attention Augmented Convolutional Networks
    Bello, Irwan
    Zoph, Barret
    Vaswani, Ashish
    Shlens, Jonathon
    Le, Quoc V.
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3285 - 3294
  • [5] Berthet Q., 2020, Adv Neural Inf Process Syst, V33, P9508
  • [6] Bosman F. T., 2010, WHO classification of tumours of the digestive system
  • [7] Clinical-grade computational pathology using weakly supervised deep learning on whole slide images
    Campanella, Gabriele
    Hanna, Matthew G.
    Geneslaw, Luke
    Miraflor, Allen
    Silva, Vitor Werneck Krauss
    Busam, Klaus J.
    Brogi, Edi
    Reuter, Victor E.
    Klimstra, David S.
    Fuchs, Thomas J.
    [J]. NATURE MEDICINE, 2019, 25 (08) : 1301 - +
  • [8] GasHis-Transformer: A multi-scale visual transformer approach for gastric histopathological image detection
    Chen, Haoyuan
    Li, Chen
    Wang, Ge
    Li, Xiaoyan
    Rahaman, Md Mamunur
    Sun, Hongzan
    Hu, Weiming
    Li, Yixin
    Liu, Wanli
    Sun, Changhao
    Ai, Shiliang
    Grzegorzek, Marcin
    [J]. PATTERN RECOGNITION, 2022, 130
  • [9] Gastric Cancer: Overview
    Correa, Pelayo
    [J]. GASTROENTEROLOGY CLINICS OF NORTH AMERICA, 2013, 42 (02) : 211 - +
  • [10] TransMed: Transformers Advance Multi-Modal Medical Image Classification
    Dai, Yin
    Gao, Yifan
    Liu, Fayu
    [J]. DIAGNOSTICS, 2021, 11 (08)