IIAM: Intra and Inter Attention With Mutual Consistency Learning Network for Medical Image Segmentation

被引：0

作者：

Pang, Chen ^{[1
]}

Lu, Xuequan ^{[2
]}

Liu, Xiang ^{[3
]}

Zhang, Renfeng ^{[4
]}

Lyu, Lei ^{[1
]}

机构：

[1] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250358, Peoples R China

[2] La Trobe Univ, Dept Comp Sci & IT, Melbourne, Vic 3086, Australia

[3] IIT, Dept Elect & Comp Engn, Chicago, IL 60616 USA

[4] Shandong First Medial Univ, Shandong Prov Hosp, Dept Lab Med, Jinan 271000, Peoples R China

来源：

IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS | 2024年 / 28卷 / 10期

基金：

中国国家自然科学基金;

关键词：

Image segmentation; Transformers; Decoding; Feature extraction; Uncertainty; Convolutional neural networks; Medical diagnostic imaging; Medical image segmentation; attention; transformer; mutual consistency learning; TRANSFORMER;

D O I：

10.1109/JBHI.2024.3426074

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Medical image segmentation provides a reliable basis for diagnosis analysis and disease treatment by capturing the global and local features of the target region. To learn global features, convolutional neural networks are replaced with pure transformers, or transformer layers are stacked at the deepest layers of convolutional neural networks. Nevertheless, they are deficient in exploring local-global cues at each scale and the interaction among consensual regions in multiple scales, hindering the learning about the changes in size, shape, and position of target objects. To cope with these defects, we propose a novel Intra and Inter Attention with Mutual Consistency Learning Network (IIAM). Concretely, we design an intra attention module to aggregate the CNN-based local features and transformer-based global information on each scale. In addition, to capture the interaction among consensual regions in multiple scales, we devise an inter attention module to explore the cross-scale dependency of the object and its surroundings. Moreover, to reduce the impact of blurred regions in medical images on the final segmentation results, we introduce multiple decoders to estimate the model uncertainty, where we adopt a mutual consistency learning strategy to minimize the output discrepancy during the end-to-end training and weight the outputs of the three decoders as the final segmentation result. Extensive experiments on three benchmark datasets verify the efficacy of our method and demonstrate superior performance of our model to state-of-the-art techniques.

引用

页码：5971 / 5983

页数：13

共 43 条

[1] Dataset of breast ultrasound images [J].

Al-Dhabyani, Walid ;

Gomaa, Mohammed ;

Khaled, Hussien ;

Fahmy, Aly .

DATA IN BRIEF, 2020, 28

[2] A Multi-Scale Context Aware Attention Model for Medical Image Segmentation [J].

Alam, Md. Shariful ;

Wang, Dadong ;

Liao, Qiyu ;

Sowmya, Arcot .

IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (08) :3731-3739

[3] IEMask R-CNN: Information-Enhanced Mask R-CNN [J].

Bi, Xiuli ;

Hu, Jinwu ;

Xiao, Bin ;

Li, Weisheng ;

Gao, Xinbo .

IEEE TRANSACTIONS ON BIG DATA, 2023, 9 (02) :688-700

[4]

Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9

[5] Edge and neighborhood guidance network for 2D medical image segmentation [J].

Cao, Weiwei ;

Zheng, Jian ;

Xiang, Dehui ;

Ding, Saisai ;

Sun, Haotian ;

Yang, Xiaodong ;

Liu, Zhaobang ;

Dai, Yakang .

BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 69 (69)

[6] TransAttUnet: Multi-Level Attention-Guided U-Net With Transformer for Medical Image Segmentation [J].

Chen, Bingzhi ;

Liu, Yishu ;

Zhang, Zheng ;

Lu, Guangming ;

Kong, Adams Wai Kin .

IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (01) :55-68

[7] DCAN: Deep contour-aware networks for object instance segmentation from histology images [J].

Chen, Hao ;

Qi, Xiaojuan ;

Yu, Lequan ;

Dou, Qi ;

Qin, Jing ;

Heng, Pheng-Ann .

MEDICAL IMAGE ANALYSIS, 2017, 36 :135-146

[8]

Chen J., 2021, arXiv, DOI DOI 10.48550/ARXIV.2102.04306

[9] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].

Chen, Liang-Chieh ;

Papandreou, George ;

Kokkinos, Iasonas ;

Murphy, Kevin ;

Yuille, Alan L. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848

[10]

Dosovitskiy A., 2021, An image is worth 16 x16 words: transformers for image recognition at scale, P1

← 1 2 3 4 5 →