Bilinear Pooling of Transformer Embeddings for Blind Image Quality Assessment

被引:0
|
作者
Feng, Yeli [1 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
关键词
Vision transformer; Bilinear pooling; Blind image quality assessment; Authentic distortions;
D O I
10.1007/978-981-97-3559-4_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Blind image quality assessment finds its practical usage in real-world applications where image distortions are more complex than computer generated synthetic distortions, but high-quality images are not available for reference. In the past decade, research in blind quality prediction has advanced tremendously thanks to the success of convolutional neural networks. However, it is far from human-like performance and remains a challenging research problem. For the first time, this paper investigates the potential of imagenet pre-trained Vision Transformer, a new generation architecture for image understanding, in providing better quality aware features. This paper proposed BPTIQ, a method that leverages multi-level transformer embeddings with bilinear feature pooling and non-monotonic error regularization for blind quality assessment of authentic distortions. The effectiveness of the proposed method was evaluated with four IQA databases with authentic distortions. Experimental outcomes and ablation studies show that the performance of BPTIQ is competitive with nine state-of-the-art IQA methods in comparison that mainly utilized pre-trained convolutional neural networks for feature extraction. BPTIQ performed the best over two of the four single databases and demonstrated a more robust cross-database generalization capability.
引用
收藏
页码:137 / 150
页数:14
相关论文
共 50 条
  • [41] The context effect for blind image quality assessment
    Liang, Zehong
    Lu, Wen
    Zheng, Yong
    He, Weiquan
    Yang, Jiachen
    NEUROCOMPUTING, 2023, 521 : 172 - 180
  • [42] Continual Learning for Blind Image Quality Assessment
    Zhang, Weixia
    Li, Dingquan
    Ma, Chao
    Zhai, Guangtao
    Yang, Xiaokang
    Ma, Kede
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 2864 - 2878
  • [43] Swin Transformer Fusion Network for Image Quality Assessment
    Kim, Hyeongmyeon
    Yim, Changhoon
    IEEE ACCESS, 2024, 12 : 57741 - 57754
  • [44] Blind Image Quality Assessment by Pairwise Ranking Image Series
    Li Xu
    Xiuhua Jiang
    ChinaCommunications, 2023, 20 (09) : 127 - 143
  • [45] Blind Image Quality Assessment by Pairwise Ranking Image Series
    Xu, Li
    Jiang, Xiuhua
    CHINA COMMUNICATIONS, 2023, 20 (09) : 127 - 143
  • [46] Augmenting Blind Image Quality Assessment using Image Semantics
    Siahaan, Ernestasia
    Hanjalic, Alan
    Redi, Judith A.
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2016, : 307 - 312
  • [47] Statistical hypothesis testing as a novel perspective of pooling for image quality assessment
    Zhu, Rui
    Zhou, Fei
    Yang, Wenming
    Xue, Jing-Hao
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 114
  • [48] Frequency and spatial pooling of visual differences for still image quality assessment
    Le Callet, P
    Saadane, A
    Barba, D
    HUMAN VISION AND ELECTRONIC IMAGING V, 2000, 3959 : 595 - 603
  • [49] Quality-Aware CLIP for Blind Image Quality Assessment
    Pan, Wensheng
    Yang, Zhifu
    Liu, DingMing
    Fang, Chenxin
    Zhang, Yan
    Dai, Pingyang
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VI, 2024, 14430 : 396 - 408
  • [50] Blind Predicting Similar Quality Map for Image Quality Assessment
    Pan, Da
    Shi, Ping
    Hou, Ming
    Ying, Zefeng
    Fu, Sizhe
    Zhang, Yuan
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6373 - 6382