Frequency-Aware Feature Fusion for Dense Image Prediction

被引:35
作者
Chen, Linwei [1 ,2 ]
Fu, Ying [1 ,2 ]
Gu, Lin [3 ,4 ]
Yan, Chenggang [5 ]
Harada, Tatsuya [3 ,4 ]
Huang, Gao [6 ]
机构
[1] Beijing Inst Technol, MIIT Key Lab Complex Field Intelligent Sensing, Beijing 100811, Peoples R China
[2] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing 100811, Peoples R China
[3] RIKEN AIP, Tokyo 1030027, Japan
[4] Univ Tokyo, Res Ctr Adv Sci & Technol RCAST, Tokyo 1538904, Japan
[5] Hangzhou Dianzi Univ, Sch Automat, Hangzhou 310005, Peoples R China
[6] Tsinghua Univ, Dept Automat, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature fusion; feature upsampling; dense prediction; semantic segmentation; object detection; instance segmentation; panoptic segmentation;
D O I
10.1109/TPAMI.2024.3449959
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dense image prediction tasks demand features with strong category information and precise spatial boundary details at high resolution. To achieve this, modern hierarchical models often utilize feature fusion, directly adding upsampled coarse features from deep layers and high-resolution features from lower levels. In this paper, we observe rapid variations in fused feature values within objects, resulting in intra-category inconsistency due to disturbed high-frequency features. Additionally, blurred boundaries in fused features lack accurate high frequency, leading to boundary displacement. Building upon these observations, we propose Frequency-Aware Feature Fusion (FreqFusion), integrating an Adaptive Low-Pass Filter (ALPF) generator, an offset generator, and an Adaptive High-Pass Filter (AHPF) generator. The ALPF generator predicts spatially-variant low-pass filters to attenuate high-frequency components within objects, reducing intra-class inconsistency during upsampling. The offset generator refines large inconsistent features and thin boundaries by replacing inconsistent features with more consistent ones through resampling, while the AHPF generator enhances high-frequency detailed boundary information lost during downsampling. Comprehensive visualization and quantitative analysis demonstrate that FreqFusion effectively improves feature consistency and sharpens object boundaries. Extensive experiments across various dense prediction tasks confirm its effectiveness.
引用
收藏
页码:10763 / 10780
页数:18
相关论文
共 50 条
[41]   Face image deblurring with feature correction and fusion [J].
Ma Long ;
Xu Yu ;
Shu Cong ;
Wei Zoujian ;
Du Jiangbin ;
Zhao Jiayao .
The Visual Computer, 2024, 40 :3693-3707
[42]   Augmented Feature Fusion for Image Retrieval System [J].
Zhou, Yang ;
Zeng, Dan ;
Zhang, Shiliang ;
Tian, Qi .
ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, :447-450
[43]   An Effective Feature Fusion Solution for Image Retrieval [J].
Li, Yuanyuan ;
Li, Zhiyang ;
Qu, Wenyu ;
Wang, Jun ;
Wu, Junfeng ;
Ji, Changqing .
JOURNAL OF MULTIPLE-VALUED LOGIC AND SOFT COMPUTING, 2018, 31 (1-2) :51-65
[44]   Rethinking Image Inpainting with Attention Feature Fusion [J].
Qu, Shuyi ;
Huang, Kaizhu ;
Wang, Qiufeng ;
Dong, Bin .
NEURAL INFORMATION PROCESSING, ICONIP 2022, PT III, 2023, 13625 :693-704
[45]   An Improved Feature Fusion Algorithm for Image Retrieval [J].
Che, Shilin ;
Chen, Dongfang ;
Wang, Xiaofeng .
2016 9TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2016), 2016, :384-389
[46]   iTransformer-FFC: A Frequency-Aware Transformer Framework for Multi-Scale Time Series Forecasting [J].
Tang, Yongli ;
Cai, Zhongqi .
ELECTRONICS, 2025, 14 (12)
[47]   Link prediction based on feature representation and fusion [J].
Xiao, Yunpeng ;
Li, Rui ;
Lu, Xingyu ;
Liu, Yanbing .
INFORMATION SCIENCES, 2021, 548 :1-17
[48]   Hardware-Aware and Efficient Feature Fusion Network Search [J].
Guo J.-M. ;
Zhang R. ;
Zhi T. ;
He D.-Y. ;
Huang D. ;
Chang M. ;
Zhang X.-S. ;
Guo Q. .
Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (11) :2420-2432
[49]   Semantic-Aware Dehazing Network With Adaptive Feature Fusion [J].
Zhang, Shengdong ;
Ren, Wenqi ;
Tan, Xin ;
Wang, Zhi-Jie ;
Liu, Yong ;
Zhang, Jingang ;
Zhang, Xiaoqin ;
Cao, Xiaochun .
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (01) :454-467
[50]   Content Feature and Style Feature Fusion Network for Single Image Dehazing [J].
Yang A.-P. ;
Liu J. ;
Xing J.-N. ;
Li X.-X. ;
He Y.-Q. .
Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (04) :769-777