GraphIQA: Learning Distortion Graph Representations for Blind Image Quality Assessment

被引:74
作者
Sun, Simeng [1 ]
Yu, Tao [1 ]
Xu, Jiahua [1 ]
Zhou, Wei [1 ]
Chen, Zhibo [1 ]
机构
[1] Univ Sci & Technol China, Dept Elect Engineer & Informat Sci, Hefei 230026, Anhui, Peoples R China
关键词
Blind image quality assessment; graph representation learning; pre-training; STATISTICS;
D O I
10.1109/TMM.2022.3152942
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A good distortion representation is crucial for the success of deep blind image quality assessment (BIQA). However, most previous methods do not effectively model the relationship between distortions or the distribution of samples with the same distortion type but different distortion levels. In this work, we start from the analysis of the relationship between perceptual image quality and distortion-related factors, such as distortion types and levels. Then, we propose a Distortion Graph Representation (DGR) learning framework for IQA, named GraphIQA, in which each distortion is represented as a graph, i.e., DGR. One can distinguish distortion types by learning the contrast relationship between these different DGRs, and can infer the ranking distribution of samples from different levels in a DGR. Specifically, we develop two sub-networks to learn the DGRs: a) Type Discrimination Network (TDN) that aims to embed DGR into a compact code for better discriminating distortion types and learning the relationship between types; b) Fuzzy Prediction Network (FPN) that aims to extract the distributional characteristics of the samples in a DGR and predicts fuzzy degrees based on a Gaussian prior. Experiments show that our GraphIQA achieves state-of-the-art performance on many benchmark datasets of both synthetic and authentic distortions.
引用
收藏
页码:2912 / 2925
页数:14
相关论文
共 78 条
[1]  
Golestaneh SA, 2020, Arxiv, DOI [arXiv:2006.03783, DOI 10.48550/ARXIV.2006.03783]
[2]  
[Anonymous], 2015, ACS SYM SER
[3]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[4]  
Beyer L., 2017, In defense of the triplet loss for person re-identification, P1
[5]   Deep Neural Networks for No-Reference and Full-Reference Image Quality Assessment [J].
Bosse, Sebastian ;
Maniry, Dominique ;
Mueller, Klaus-Robert ;
Wiegand, Thomas ;
Samek, Wojciech .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (01) :206-219
[6]   Emerging Properties in Self-Supervised Vision Transformers [J].
Caron, Mathilde ;
Touvron, Hugo ;
Misra, Ishan ;
Jegou, Herve ;
Mairal, Julien ;
Bojanowski, Piotr ;
Joulin, Armand .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9630-9640
[7]  
Casanova Arantxa, 2017, P INT C LEARN REPR I
[8]   Stereoscopic Omnidirectional Image Quality Assessment Based on Predictive Coding Theory [J].
Chen, Zhibo ;
Xu, Jiahua ;
Lin, Chaoyi ;
Zhou, Wei .
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2020, 14 (01) :103-117
[9]   Blind Stereoscopic Video Quality Assessment: From Depth Perception to Overall Experience [J].
Chen, Zhibo ;
Zhou, Wei ;
Li, Weiping .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (02) :721-734
[10]   Full Reference Quality Assessment for Image Retargeting Based on Natural Scene Statistics Modeling and Bi-Directional Saliency Similarity [J].
Chen, Zhibo ;
Lin, Jianxin ;
Liao, Ning ;
Chen, Chang Wen .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (11) :5138-5148