Dual-attention pyramid transformer network for No-Reference Image Quality Assessment

被引:0
|
作者
Ma, Jiliang [1 ]
Chen, Yihua [1 ]
Chen, Lv [1 ]
Tang, Zhenjun [1 ]
机构
[1] Guangxi Normal Univ, Key Lab Educ Blockchain & Intelligent Technol, Minist Educ, Guilin 541004, Peoples R China
关键词
Image quality assessment; Pyramid structure; Transformer; Dual-attention; Multi-scale features; JOINT STATISTICS; METRICS;
D O I
10.1016/j.eswa.2024.125008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
No-Reference Image Quality Assessment (NR-IQA) is a fundamental and important task in the field of computer vision. Most NR-IQA methods have limitation in making desirable NR-IQA performance due to the lack of sufficiently rich features. To address this problem, we propose a dual-attention pyramid Transformer network for NR-IQA. In the proposed method, a feature extraction module is firstly used to extract multi-scale features which contain rich distortion and semantic information. Then, a pyramid Transformer network with channel and spatial attentions is designed to learn multi-scale global features from spatial and channel aspects. The combination of pyramid structure and dual attentions enables our network to focus on features in different regions of the image and learn richer and more comprehensive global features. This in turn improves the quality score prediction performance. Finally, the score prediction module predicts the quality scores in different stages of the pyramid Transformer network by channel adaptive prediction branches and determines the final quality score by aggregating these quality scores. Extensive experiments performed on four widely used public databases show that our proposed method is superior to some state-of-the-art NR-IQA methods in perceiving image quality.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Unifying Dual-Attention and Siamese Transformer Network for Full-Reference Image Quality Assessment
    Tang, Zhenjun
    Chen, Zhiyuan
    Li, Zhixin
    Zhong, Bineng
    Zhang, Xianquan
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (06)
  • [2] A Dual-Attention Transformer Network for Pansharpening
    Wu, Kun
    Yang, Xiaomin
    Nie, Zihao
    Li, Haoran
    Jeon, Gwanggil
    IEEE SENSORS JOURNAL, 2024, 24 (05) : 5500 - 5511
  • [3] Dual-Feature Aggregation Network for No-Reference Image Quality Assessment
    Chen, Yihua
    Chen, Zhiyuan
    Yu, Mengzhu
    Tang, Zhenjun
    MULTIMEDIA MODELING, MMM 2023, PT I, 2023, 13833 : 149 - 161
  • [4] MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment
    Yang, Sidi
    Wu, Tianhe
    Shi, Shuwei
    Lao, Shanshan
    Gong, Yuan
    Cao, Mingdeng
    Wang, Jiahao
    Yang, Yujiu
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 1190 - 1199
  • [5] No-Reference Image Quality Assessment: An Attention Driven Approach
    Chen, Diqi
    Wang, Yizhou
    Gao, Wen
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 6496 - 6506
  • [6] No-Reference Image Quality Assessment: An Attention Driven Approach
    Chen, Diqi
    Wang, Yizhou
    Ren, Hongyu
    Gao, Wen
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 376 - 385
  • [7] Lightweight transformer and multi-head prediction network for no-reference image quality assessment
    Tang, Zhenjun
    Chen, Yihua
    Chen, Zhiyuan
    Liang, Xiaoping
    Zhang, Xianquan
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (04): : 1947 - 1957
  • [8] Lightweight transformer and multi-head prediction network for no-reference image quality assessment
    Zhenjun Tang
    Yihua Chen
    Zhiyuan Chen
    Xiaoping Liang
    Xianquan Zhang
    Neural Computing and Applications, 2024, 36 : 1931 - 1946
  • [9] HIERARCHICAL FEATURE FUSION TRANSFORMER FOR NO-REFERENCE IMAGE QUALITY ASSESSMENT
    Wang, Zesheng
    Wu, Wei
    Yuan, Liang
    Sun, Wei
    Chen, Ying
    Li, Kai
    Zhai, Guangtao
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2205 - 2209
  • [10] No-reference image quality assessment based on feature tokenizer and Transformer
    Song, Wei
    Li, Jia-jin
    Liu, Xiao-chen
    Liu, Zhi-xiang
    Shi, Shao-hua
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2023, 38 (03) : 356 - 367