VerFormer: Vertebrae-Aware Transformer for Automatic Spine Segmentation from CT Images

被引:0
|
作者
Li, Xinchen [1 ]
Hong, Yuan [1 ]
Xu, Yang [1 ]
Hu, Mu [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Med, Ruijin Hosp, Dept Orthoped, Shanghai 200025, Peoples R China
关键词
Vision Transformer; spine CT segmentation; attention mechanism; FRAMEWORK; NETWORKS;
D O I
10.3390/diagnostics14171859
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
The accurate and efficient segmentation of the spine is important in the diagnosis and treatment of spine malfunctions and fractures. However, it is still challenging because of large inter-vertebra variations in shape and cross-image localization of the spine. In previous methods, convolutional neural networks (CNNs) have been widely applied as a vision backbone to tackle this task. However, these methods are challenged in utilizing the global contextual information across the whole image for accurate spine segmentation because of the inherent locality of the convolution operation. Compared with CNNs, the Vision Transformer (ViT) has been proposed as another vision backbone with a high capacity to capture global contextual information. However, when the ViT is employed for spine segmentation, it treats all input tokens equally, including vertebrae-related tokens and non-vertebrae-related tokens. Additionally, it lacks the capability to locate regions of interest, thus lowering the accuracy of spine segmentation. To address this limitation, we propose a novel Vertebrae-aware Vision Transformer (VerFormer) for automatic spine segmentation from CT images. Our VerFormer is designed by incorporating a novel Vertebrae-aware Global (VG) block into the ViT backbone. In the VG block, the vertebrae-related global contextual information is extracted by a Vertebrae-aware Global Query (VGQ) module. Then, this information is incorporated into query tokens to highlight vertebrae-related tokens in the multi-head self-attention module. Thus, this VG block can leverage global contextual information to effectively and efficiently locate spines across the whole input, thus improving the segmentation accuracy of VerFormer. Driven by this design, the VerFormer demonstrates a solid capacity to capture more discriminative dependencies and vertebrae-related context in automatic spine segmentation. The experimental results on two spine CT segmentation tasks demonstrate the effectiveness of our VG block and the superiority of our VerFormer in spine segmentation. Compared with other popular CNN- or ViT-based segmentation models, our VerFormer shows superior segmentation accuracy and generalization.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Lung Parenchyma Segmentation from CT Images with a Fully Automatic Method
    Moghaddam, Reza Mousavi
    Aghazadeh, Nasser
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (05) : 14235 - 14257
  • [42] Lung Parenchyma Segmentation from CT Images with a Fully Automatic Method
    Reza Mousavi Moghaddam
    Nasser Aghazadeh
    Multimedia Tools and Applications, 2024, 83 : 14235 - 14257
  • [43] Automatic Deep Feature Learning via Patch-Based Deep Belief Network for Vertebrae Segmentation in CT Images
    Qadri, Syed Furqan
    Ai, Danni
    Hu, Guoyu
    Ahmad, Mubashir
    Huang, Yong
    Wang, Yongtian
    Yang, Jian
    APPLIED SCIENCES-BASEL, 2019, 9 (01):
  • [44] Fully automatic cervical vertebrae segmentation framework for X-ray images
    Al Arif, S. M. Masudur Rahman
    Knapp, Karen
    Slabaugh, Greg
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2018, 157 : 95 - 111
  • [45] SEGMENTATION OF TRABECULAR BONES FROM VERTEBRAL BODIES IN VOLUMETRIC CT SPINE IMAGES
    Asan, Melih S.
    Ali, Asem
    Arnold, Ben
    Fahmi, Rachid
    Farag, Aly A.
    Xiang, Ping
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 3385 - +
  • [46] Fully automatic detection of the vertebrae in 2D CT images
    Graf, Franz
    Kriegel, Hans-Peter
    Schubert, Matthias
    Strukelj, Michael
    Cavallaro, Alexander
    MEDICAL IMAGING 2011: IMAGE PROCESSING, 2011, 7962
  • [47] EG-TRANS3DUNET: A SINGLE-STAGED TRANSFORMER-BASED MODEL FOR ACCURATE VERTEBRAE SEGMENTATION FROM SPINAL CT IMAGES
    You, Xin
    Gu, Yun
    Liu, Yingying
    Lu, Steve
    Tang, Xin
    Yang, Jie
    2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,
  • [48] Automatic detection and segmentation of lumbar vertebrae from X-ray images for compression fracture evaluation
    Kim, Kang Cheol
    Cho, Hyun Cheol
    Jang, Tae Jun
    Choi, Jong Mun
    Seo, Jin Keun
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 200
  • [49] Automatic Pancreas Segmentation in CT Images With Distance-Based Saliency-Aware DenseASPP Network
    Hu, Peijun
    Li, Xiang
    Tian, Yu
    Tang, Tianyu
    Zhou, Tianshu
    Bai, Xueli
    Zhu, Shiqiang
    Liang, Tingbo
    Li, Jingsong
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (05) : 1601 - 1611
  • [50] Automatic Global Level Set Approach for Lumbar Vertebrae CT Image Segmentation
    Li, Yang
    Liang, Wei
    Zhang, Yinlong
    Tan, Jindong
    BIOMED RESEARCH INTERNATIONAL, 2018, 2018