Searching a Hierarchically Aggregated Fusion Architecture for Fast Multi-Modality Image Fusion

被引：44

作者：

Liu, Risheng ^{[1
]}

Liu, Zhu ^{[1
]}

Liu, Jinyuan ^{[1
]}

Fan, Xin ^{[1
]}

机构：

[1] Dalian Univ Technol, Dalian, Peoples R China

来源：

PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021 | 2021年

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

Hierachically aggregated fusion architecture; fusion-oriented search; space; collaborative architecture search; multi-modality fusion;

D O I：

10.1145/3474085.3475299

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multi-modality image fusion refers to generating a complementary image that integrates typical characteristics from source images. In recent years, we have witnessed the remarkable progress of deep learning models for multi-modality fusion. Existing CNN-based approaches strain every nerve to design various architectures for realizing these tasks in an end-to-end manner. However, these handcrafted designs are unable to cope with the high demanding fusion tasks, resulting in blurred targets and lost textural details. To alleviate these issues, in this paper, we propose a novel approach, aiming at searching effective architectures according to various modality principles and fusion mechanisms. Specifically, we construct a hierarchically aggregated fusion architecture to extract and refine fused features from feature-level and object-level fusion perspectives, which is responsible for obtaining complementary target/detail representations. Then by investigating diverse effective practices, we composite a more flexible fusion-specific search space. Motivated by the collaborative principle, we employ a new search strategy with different principled losses and hardware constraints for sufficient discovery of components. As a result, we can obtain a task-specific architecture with fast inference time. Extensive quantitative and qualitative results demonstrate the superiority and versatility of our method against state-of-the-art methods.

引用

页码：1600 / 1608

页数：9

共 51 条

[31] DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network [J].

Liu, Rui ;

Ge, Yixiao ;

Choi, Ching Lam ;

Wang, Xiaogang ;

Li, Hongsheng .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :16372-16381

[32]

Liu Y, 2017, BIOMED RES INT, V2017, P1, DOI DOI 10.1155/2017/9374026

[33] Simultaneous image fusion and denoising with adaptive sparse representation [J].

Liu, Yu ;

Wang, Zengfu .

IET IMAGE PROCESSING, 2015, 9 (05) :347-357

[34] Medical Image Fusion via Convolutional Sparsity Based Morphological Component Analysis [J].

Liu, Yu ;

Chen, Xun ;

Ward, Rabab K. ;

Wang, Z. Jane .

IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (03) :485-489

[35] GANMcC: A Generative Adversarial Network With Multiclassification Constraints for Infrared and Visible Image Fusion [J].

Ma, Jiayi ;

Zhang, Hao ;

Shao, Zhenfeng ;

Liang, Pengwei ;

Xu, Han .

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70

[36] FusionGAN: A generative adversarial network for infrared and visible image fusion [J].

Ma, Jiayi ;

Yu, Wei ;

Liang, Pengwei ;

Li, Chang ;

Jiang, Junjun .

INFORMATION FUSION, 2019, 48 :11-26

[37] Infrared and visible image fusion via gradient transfer and total variation minimization [J].

Ma, Jiayi ;

Chen, Chen ;

Li, Chang ;

Huang, Jun .

INFORMATION FUSION, 2016, 31 :100-109

[38]

Ma Jiayi, 2020, IEEE T IMAGE PROCESS, V29

[39] Infrared and visible image fusion based on visual saliency map and weighted least square optimization [J].

Ma, Jinlei ;

Zhou, Zhiqiang ;

Wang, Bo ;

Zong, Hua .

INFRARED PHYSICS & TECHNOLOGY, 2017, 82 :8-17

[40] MFAS: Multimodal Fusion Architecture Search [J].

Perez-Rua, Juan-Manuel ;

Vielzeuf, Valentin ;

Pateux, Stephane ;

Baccouche, Moez ;

Jurie, Frederic .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :6959-6968

← 1 2 3 4 5 6 →