Channel Attention for No-Reference Image Quality Assessment in DCT Domain

被引:1
作者
Wang, Zesheng [1 ]
Yuan, Liang [2 ,3 ]
Zhai, Guangtao [2 ,3 ]
机构
[1] Beijing Univ Chem Technol, Sch Informat Sci & Technol, Beijing 100029, Peoples R China
[2] Shanghai Jiao Tong Univ, ICCI, Shanghai 200240, Peoples R China
[3] Shanghai Jiao Tong Univ, Inst Image Commun & Informat Proc, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金;
关键词
Channel attention; discrete cosine transform; image quality assessment; transformer;
D O I
10.1109/LSP.2024.3392671
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Attention mechanism, especially self-attention, has gained great success in image quality assessment. The advent of Transformer has led to a substantial enhancement in no-reference image quality assessment (NR-IQA). Existing works focus on leveraging the global perceptual capability of Transformer encoders to perceive image quality. In this work, we start from a different view and propose a novel multi-frequency channel attention framework for Transformer encoder. Through frequency analysis, we demonstrate mathematically that traditional global average pooling (GAP) is a specific instance of feature decomposition in the frequency domain. With the proof, we use the discrete cosine transform to compress channels, which optimally compresses channels by efficiently utilizing frequency components overlooked by GAP. The experimental results show that the proposed method leads to improvements of performance over the state-of-the-art methods.
引用
收藏
页码:1274 / 1278
页数:5
相关论文
共 36 条
[1]   DISCRETE COSINE TRANSFORM [J].
AHMED, N ;
NATARAJAN, T ;
RAO, KR .
IEEE TRANSACTIONS ON COMPUTERS, 1974, C 23 (01) :90-93
[2]   Deep Neural Networks for No-Reference and Full-Reference Image Quality Assessment [J].
Bosse, Sebastian ;
Maniry, Dominique ;
Mueller, Klaus-Robert ;
Wiegand, Thomas ;
Samek, Wojciech .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (01) :206-219
[3]   SPIQ: A Self-Supervised Pre-Trained Model for Image Quality Assessment [J].
Chen, Pengfei ;
Li, Leida ;
Wu, Qingbo ;
Wu, Jinjian .
IEEE SIGNAL PROCESSING LETTERS, 2022, 29 :513-517
[4]  
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[5]   No-Reference Quality Assessment of Contrast-Distorted Images Based on Natural Scene Statistics [J].
Fang, Yuming ;
Ma, Kede ;
Wang, Zhou ;
Lin, Weisi ;
Fang, Zhijun ;
Zhai, Guangtao .
IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (07) :838-842
[6]   No-Reference Image Quality Assessment via Transformers, Relative Ranking, and Self-Consistency [J].
Golestaneh, S. Alireza ;
Dadsetan, Saba ;
Kitani, Kris M. .
2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, :3989-3999
[7]   MetaIQA: Deep Meta-learning for No-Reference Image Quality Assessment [J].
Zhu, Hancheng ;
Li, Leida ;
Wu, Jinjian ;
Dong, Weisheng ;
Shi, Guangming .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :14131-14140
[8]   KonIQ-10k: An Ecologically Valid Database for Deep Learning of Blind Image Quality Assessment [J].
Hosu, Vlad ;
Lin, Hanhe ;
Sziranyi, Tamas ;
Saupe, Dietmar .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) :4041-4056
[9]   Convolutional Neural Networks for No-Reference Image Quality Assessment [J].
Kang, Le ;
Ye, Peng ;
Li, Yi ;
Doermann, David .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :1733-1740
[10]   MUSIQ: Multi-scale Image Quality Transformer [J].
Ke, Junjie ;
Wang, Qifei ;
Wang, Yilin ;
Milanfar, Peyman ;
Yang, Feng .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :5128-5137