Deep Frequency Re-calibration U-Net for Medical Image Segmentation

被引：34

作者：

Azad, Reza ^{[1
]}

Bozorgpour, Afshin ^{[2
]}

Asadi-Aghbolaghi, Maryam ^{[3
]}

Merhof, Dorit ^{[1
]}

Escalera, Sergio ^{[4
,5
]}

机构：

[1] Rhein Westfal TH Aachen, Inst Imaging & Comp Vis, Aachen, Germany

[2] Sharif Univ Technol, Tehran, Iran

[3] Inst Res Fundamental Sci IPM, Sch Comp Sci, Tehran, Iran

[4] Univ Barcelona, Barcelona, Spain

[5] Comp Vis Ctr, Barcelona, Spain

来源：

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021) | 2021年

关键词：

D O I：

10.1109/ICCVW54120.2021.00366

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The human visual cortex is biased towards shape components while CNNs produce texture biased features. This fact may explain why the performance of CNN significantly degrades with low-labeled input data scenarios. In this paper, we propose a frequency re-calibration U-Net (FRCU-Net) for medical image segmentation. Representing an object in terms of frequency may reduce the effect of texture bias, resulting in better generalization for a low data regime. To do so, we apply the Laplacian pyramid in the bottleneck layer of the U-shaped structure. The Laplacian pyramid represents the object proposal in different frequency domains, where the high frequencies are responsible for the texture information and lower frequencies might be related to the shape. Adaptively re-calibrating these frequency representations can produce a more discriminative representation for describing the object of interest. To this end, we first propose to use a channel-wise attention mechanism to capture the relationship between the channels of a set of feature maps in one layer of the frequency pyramid. Second, the extracted features of each level of the pyramid are then combined through a non-linear function based on their impact on the final segmentation output. The proposed FRCU-Net is evaluated on five datasets ISIC 2017, ISIC 2018, the PH2, lung segmentation, and SegPC 2021 challenge datasets and compared to existing alternatives, achieving state-of-the-art results.

引用

页码：3267 / 3276

页数：10

共 34 条

[1] Skin lesion segmentation in dermoscopy images via deep full resolution convolutional networks [J].

Al-Masni, Mohammed A. ;

Al-antari, Mugahed A. ;

Choi, Mun-Taek ;

Han, Seung-Moo ;

Kim, Tae-Seong .

COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2018, 162 :221-231

[2]

Alom M.Z., 2018, ARXIV180206955

[3]

Asadi-Aghbolaghi Maryam, 2020, Multilevel context gating of embedded collective knowledge for medical image segmentation

[4] On the Texture Bias for Few-Shot CNN Segmentation [J].

Azad, Reza ;

Fayjie, Abdur R. ;

Kauffmann, Claude ;

Ben Ayed, Ismail ;

Pedersoli, Marco ;

Dolz, Jose .

2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, :2673-2682

[5] Attention Deeplabv3+: Multi-level Context Attention Mechanism for Skin Lesion Segmentation [J].

Azad, Reza ;

Asadi-Aghbolaghi, Maryam ;

Fathy, Mahmood ;

Escalera, Sergio .

COMPUTER VISION - ECCV 2020 WORKSHOPS, PT I, 2020, 12535 :251-266

[6]

Azad Reza, 2019, P IEEE INT C COMP VI

[7] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].

Badrinarayanan, Vijay ;

Kendall, Alex ;

Cipolla, Roberto .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495

[8]

Bozorgpour Afshin, 2021, ARXIV210506238

[9] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].

Chen, Liang-Chieh ;

Papandreou, George ;

Kokkinos, Iasonas ;

Murphy, Kevin ;

Yuille, Alan L. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848

[10]

Cicek O, 2016, 3D U-Net: Learning dense volumetric segmentation from sparse annotation

← 1 2 3 4 →