3D Mitochondria Instance Segmentation with Spatio-Temporal Transformers

被引:2
|
作者
Thawakar, Omkar [1 ]
Anwer, Rao Muhammad [1 ,2 ]
Laaksonen, Jorma [2 ]
Reiner, Orly [3 ]
Shah, Mubarak [4 ]
Khan, Fahad Shahbaz [1 ,5 ]
机构
[1] MBZUAI, Masdar City, U Arab Emirates
[2] Aalto Univ, Espoo, Finland
[3] Weizmann Inst Sci, Rehovot, Israel
[4] Univ Cent Florida, Orlando, FL 32816 USA
[5] Linkoping Univ, Linkoping, Sweden
关键词
Electron Microscopy; Mitochondria instance segmentation; Spatio-Temporal Transformer; Hybrid CNN-Transformers;
D O I
10.1007/978-3-031-43993-3_59
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate 3D mitochondria instance segmentation in electron microscopy (EM) is a challenging problem and serves as a prerequisite to empirically analyze their distributions and morphology. Most existing approaches employ 3D convolutions to obtain representative features. However, these convolution-based approaches struggle to effectively capture long-range dependencies in the volume mitochondria data, due to their limited local receptive field. To address this, we propose a hybrid encoder-decoder framework based on a split spatio-temporal attention module that efficiently computes spatial and temporal self-attentions in parallel, which are later fused through a deformable convolution. Further, we introduce a semantic foreground-background adversarial loss during training that aids in delineating the region of mitochondria instances from the background clutter. Our extensive experiments on three benchmarks, Lucchi, MitoEM-R and MitoEM-H, reveal the benefits of the proposed contributions achieving state-of-the-art results on all three datasets. Our code and models are available at https://github.com/ OmkarThawakar/STT- UNET.
引用
收藏
页码:613 / 623
页数:11
相关论文
共 50 条
  • [21] Spatio-temporal reflectance sharing for relightable 3D video
    Ahmed, Naveed
    Theobalt, Christian
    Seidel, Hans-Peter
    COMPUTER VISION/COMPUTER GRAPHICS COLLABORATION TECHNIQUES, 2007, 4418 : 47 - +
  • [22] Binary spatio-temporal encoded illumination for 3D imaging
    Li, Yong
    Chen, Yunfu
    Jin, Hongzhen
    Wang, Hui
    Guangxue Xuebao/Acta Optica Sinica, 2009, 29 (03): : 670 - 675
  • [23] A Spatio-temporal Transformer for 3D Human Motion Prediction
    Aksan, Emre
    Kaufmann, Manuel
    Cao, Peng
    Hilliges, Otmar
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 565 - 574
  • [24] Spatio-Temporal Scheduling for 3D Reconfigurable & Multiprocessor Architecture
    Quang-Hai Khuat
    Quang-Hoa Le
    Chillet, Daniel
    Pillement, Sebastien
    2013 8TH INTERNATIONAL DESIGN AND TEST SYMPOSIUM (IDT), 2013,
  • [25] Remeshing and spatio-temporal wavelet filtering for 3D animations
    Payan, Frederic
    Kamoun, Aymen
    Antonini, Marc
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 1081 - 1084
  • [26] Spatio-Temporal Video Object Segmentation via Scale-Adaptive 3D Structure Tensor
    Hai-Yun Wang
    Kai-Kuang Ma
    EURASIP Journal on Advances in Signal Processing, 2004
  • [27] Spatio-temporal video object segmentation via scale-adaptive 3D structure tensor
    Wang, H.-Y. (haiyun@pmail.ntu.edu.sg), 1600, Hindawi Publishing Corporation (2004):
  • [28] SEGMENTATION OF 3D RF ECHOCARDIOGRAPHY USING A JOINT SPATIO-TEMPORAL PREDICTOR AND SIGNAL INTENSITY MODEL
    Pearlman, Paul C.
    Tagare, Hemant D.
    Lin, Ben A.
    Sinusas, Albert J.
    Duncan, James S.
    2011 8TH IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING: FROM NANO TO MACRO, 2011, : 649 - 652
  • [29] Spatio-temporal video object segmentation via scale-adaptive 3D structure tensor
    Wang, HY
    Ma, KK
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (06) : 798 - 813
  • [30] GCTransNet: 3D mitochondrial instance segmentation based on Global Context Vision Transformers
    Chen, Chaoyi
    Yan, Yidan
    Wu, Jingpeng
    Gan, Wen-Biao
    JOURNAL OF STRUCTURAL BIOLOGY, 2025, 217 (01)