Frequency-Aware Feature Fusion for Dense Image Prediction

被引:6
作者
Chen, Linwei [1 ,2 ]
Fu, Ying [1 ,2 ]
Gu, Lin [3 ,4 ]
Yan, Chenggang [5 ]
Harada, Tatsuya [3 ,4 ]
Huang, Gao [6 ]
机构
[1] Beijing Inst Technol, MIIT Key Lab Complex Field Intelligent Sensing, Beijing 100811, Peoples R China
[2] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing 100811, Peoples R China
[3] RIKEN AIP, Tokyo 1030027, Japan
[4] Univ Tokyo, Res Ctr Adv Sci & Technol RCAST, Tokyo 1538904, Japan
[5] Hangzhou Dianzi Univ, Sch Automat, Hangzhou 310005, Peoples R China
[6] Tsinghua Univ, Dept Automat, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature fusion; feature upsampling; dense prediction; semantic segmentation; object detection; instance segmentation; panoptic segmentation;
D O I
10.1109/TPAMI.2024.3449959
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dense image prediction tasks demand features with strong category information and precise spatial boundary details at high resolution. To achieve this, modern hierarchical models often utilize feature fusion, directly adding upsampled coarse features from deep layers and high-resolution features from lower levels. In this paper, we observe rapid variations in fused feature values within objects, resulting in intra-category inconsistency due to disturbed high-frequency features. Additionally, blurred boundaries in fused features lack accurate high frequency, leading to boundary displacement. Building upon these observations, we propose Frequency-Aware Feature Fusion (FreqFusion), integrating an Adaptive Low-Pass Filter (ALPF) generator, an offset generator, and an Adaptive High-Pass Filter (AHPF) generator. The ALPF generator predicts spatially-variant low-pass filters to attenuate high-frequency components within objects, reducing intra-class inconsistency during upsampling. The offset generator refines large inconsistent features and thin boundaries by replacing inconsistent features with more consistent ones through resampling, while the AHPF generator enhances high-frequency detailed boundary information lost during downsampling. Comprehensive visualization and quantitative analysis demonstrate that FreqFusion effectively improves feature consistency and sharpens object boundaries. Extensive experiments across various dense prediction tasks confirm its effectiveness.
引用
收藏
页码:10763 / 10780
页数:18
相关论文
共 50 条
[21]   Automatic Image Matting with Attention Mechanism and Feature Fusion [J].
Wang X. ;
Wang Q. ;
Yang G. ;
Guo X. .
Wang, Qiqi (wangqiqi@tust.edu.cn), 2020, Institute of Computing Technology (32) :1473-1483
[22]   Pyramid Frequency Feature Fusion Object Detection Networks [J].
Mao L. ;
Li X. ;
Yang D. ;
Zhang R. .
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2021, 33 (02) :207-214
[23]   Instance-Aware Spatial-Frequency Feature Fusion Detector for Oriented Object Detection in Remote-Sensing Images [J].
Zheng, Shangdong ;
Wu, Zebin ;
Xu, Yang ;
Wei, Zhihui .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[24]   Pulmonary PET /CT image instance segmentation based on dense interactive feature fusion Mask RCNN [J].
Zhou T. ;
Zhao Y. ;
Lu H. ;
Wang Y. ;
Zhi L. .
Shengwu Yixue Gongchengxue Zazhi/Journal of Biomedical Engineering, 2024, 41 (03) :527-534
[25]   Feature Fusion for Leaf Image Classification [J].
Okuda, Moeri ;
Ohshima, Hiroaki .
2022 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (IEEE BIGCOMP 2022), 2022, :259-262
[26]   Hyperspectral Image Classification Based on Dual-Scale Dense Network with Efficient Channel Attentional Feature Fusion [J].
Shi, Zhongyang ;
Chen, Ming ;
Wu, Zhigao .
ELECTRONICS, 2023, 12 (13)
[27]   UNITE: Multitask Learning With Sufficient Feature for Dense Prediction [J].
Tian, Yuxin ;
Lin, Yijie ;
Ye, Qing ;
Wang, Jian ;
Peng, Xi ;
Lv, Jiancheng .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (08) :5012-5024
[28]   Tripartite Feature Enhanced Pyramid Network for Dense Prediction [J].
Liu, Dongfang ;
Liang, James ;
Geng, Tony ;
Loui, Alexander ;
Zhou, Tianfei .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 :2678-2692
[29]   Detecting Image Tampering Using Feature Fusion [J].
Zhang, Pin ;
Kong, Xiangwei .
2009 INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY, AND SECURITY (ARES), VOLS 1 AND 2, 2009, :335-340
[30]   Face image deblurring with feature correction and fusion [J].
Long, Ma ;
Yu, Xu ;
Cong, Shu ;
Zoujian, Wei ;
Jiangbin, Du ;
Jiayao, Zhao .
VISUAL COMPUTER, 2024, 40 (05) :3693-3707