Bilinear Pooling of Transformer Embeddings for Blind Image Quality Assessment

被引:0
|
作者
Feng, Yeli [1 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
关键词
Vision transformer; Bilinear pooling; Blind image quality assessment; Authentic distortions;
D O I
10.1007/978-981-97-3559-4_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Blind image quality assessment finds its practical usage in real-world applications where image distortions are more complex than computer generated synthetic distortions, but high-quality images are not available for reference. In the past decade, research in blind quality prediction has advanced tremendously thanks to the success of convolutional neural networks. However, it is far from human-like performance and remains a challenging research problem. For the first time, this paper investigates the potential of imagenet pre-trained Vision Transformer, a new generation architecture for image understanding, in providing better quality aware features. This paper proposed BPTIQ, a method that leverages multi-level transformer embeddings with bilinear feature pooling and non-monotonic error regularization for blind quality assessment of authentic distortions. The effectiveness of the proposed method was evaluated with four IQA databases with authentic distortions. Experimental outcomes and ablation studies show that the performance of BPTIQ is competitive with nine state-of-the-art IQA methods in comparison that mainly utilized pre-trained convolutional neural networks for feature extraction. BPTIQ performed the best over two of the four single databases and demonstrated a more robust cross-database generalization capability.
引用
收藏
页码:137 / 150
页数:14
相关论文
共 50 条
  • [31] Sparse Representation for Blind Image Quality Assessment
    He, Lihuo
    Tao, Dacheng
    Li, Xuelong
    Gao, Xinbo
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 1146 - 1153
  • [32] Blind Image Quality Assessment With Active Inference
    Ma, Jupo
    Wu, Jinjian
    Li, Leida
    Dong, Weisheng
    Xie, Xuemei
    Shi, Guangming
    Lin, Weisi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3650 - 3663
  • [33] Learning to Rank for Blind Image Quality Assessment
    Gao, Fei
    Tao, Dacheng
    Gao, Xinbo
    Li, Xuelong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (10) : 2275 - 2290
  • [34] Blind image quality assessment through anisotropy
    Gabarda, Salvador
    Cristobal, Gabriel
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2007, 24 (12) : B42 - B51
  • [35] Blind Quality Assessment for Slice of Microtomographic Image
    Kornilov, Anton
    Safonov, Ilia
    Yakimchuk, Ivan
    PROCEEDINGS OF THE 24TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION (FRUCT), 2019, : 170 - 178
  • [36] Totally Blind Image Quality Assessment Evaluator
    Abdalmajeed, Saifeldeen
    Jiao Shuhong
    Wei, Liu
    FIFTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2013), 2014, 9069
  • [37] Blind Image Quality Assessment in Shearlet Domain
    Ren, Yuling
    Lu, Wen
    He, Lihuo
    Gao, Xinbo
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: IMAGE AND VIDEO DATA ENGINEERING, ISCIDE 2015, PT I, 2015, 9242 : 472 - 481
  • [38] LIQA: Lifelong Blind Image Quality Assessment
    Liu, Jianzhao
    Zhou, Wei
    Li, Xin
    Xu, Jiahua
    Chen, Zhibo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 5358 - 5373
  • [39] Blind image quality assessment in the contourlet domain
    Li, Chaofeng
    Guan, Tuxin
    Zheng, Yuhui
    Zhong, Xiaochun
    Wu, Xiaojun
    Bovik, Alan
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 91
  • [40] Blind image quality assessment with semantic information
    Ji, Weiping
    Wu, Jinjian
    Shi, Guangming
    Wan, Wenfei
    Xie, Xuemei
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 58 : 195 - 204