SPIQ: A Self-Supervised Pre-Trained Model for Image Quality Assessment

被引:21
作者
Chen, Pengfei [1 ]
Li, Leida [2 ]
Wu, Qingbo [3 ]
Wu, Jinjian [2 ]
机构
[1] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Jiangsu, Peoples R China
[2] Xidian Univ, Sch Artificial Intelligence, Xian 710071, Peoples R China
[3] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Peoples R China
关键词
Distortion; Feature extraction; Task analysis; Transformers; Training; Predictive models; Image quality; Blind image quality assessment; self-supervised pre-training; contrastive learning; INDEX;
D O I
10.1109/LSP.2022.3145326
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Blind image quality assessment (BIQA) has witnessed a flourishing progress due to the rapid advances in deep learning technique. The vast majority of prior BIQA methods try to leverage models pre-trained on ImageNet to mitigate the data shortage problem. These well-trained models, however, can be sub-optimal when applied to BIQA task that varies considerably from the image classification domain. To address this issue, we make the first attempt to leverage the plentiful unlabeled data to conduct self-supervised pre-training for BIQA task. Based on the distorted images generated from the high-quality samples using the designed distortion augmentation strategy, the proposed pre-training is implemented by a feature representation prediction task. Specifically, patch-wise feature representations corresponding to a certain grid are integrated to make prediction for the representation of the patch below it. The prediction quality is then evaluated using a contrastive loss to capture quality-aware information for BIQA task. Experimental results conducted on KADID-10 k and KonIQ-10 k databases demonstrate that the learned pre-trained model can significantly benefit the existing learning based IQA models.
引用
收藏
页码:513 / 517
页数:5
相关论文
共 42 条
[1]   SpEED-QA: Spatial Efficient Entropic Differencing for Image and Video Quality [J].
Bampis, Christos G. ;
Gupta, Praful ;
Soundararajan, Rajiv ;
Bovik, Alan C. .
IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (09) :1333-1337
[2]   End-to-End Object Detection with Transformers [J].
Carion, Nicolas ;
Massa, Francisco ;
Synnaeve, Gabriel ;
Usunier, Nicolas ;
Kirillov, Alexander ;
Zagoruyko, Sergey .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229
[3]  
Caron M, 2020, ADV NEUR IN, V33
[4]   Unsupervised Curriculum Domain Adaptation for No-Reference Video Quality Assessment [J].
Chen, Pengfei ;
Li, Leida ;
Wu, Jinjian ;
Dong, Weisheng ;
Shi, Guangming .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :5158-5167
[5]   Contrastive Self-Supervised Pre-Training for Video Quality Assessment [J].
Chen, Pengfei ;
Li, Leida ;
Wu, Jinjian ;
Dong, Weisheng ;
Shi, Guangming .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 :458-471
[6]   RIRNet: Recurrent-In-Recurrent Network for Video Quality Assessment [J].
Chen, Pengfei ;
Li, Leida ;
Ma, Lei ;
Wu, Jinjian ;
Shi, Guangming .
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, :834-842
[7]   Blind quality index for tone-mapped images based on luminance partition [J].
Chen, Pengfei ;
Li, Leida ;
Zhang, Xinfeng ;
Wang, Shanshe ;
Tan, Allen .
PATTERN RECOGNITION, 2019, 89 :108-118
[8]  
Chen T, 2020, PR MACH LEARN RES, V119
[9]   Knowledge-guided Deep Reinforcement Learning for Interactive Recommendation [J].
Chen, Xiaocong ;
Huang, Chaoran ;
Yao, Lina ;
Wang, Xianzhi ;
Liu, Wei ;
Zhang, Wenjie .
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[10]   Perceptual Image Quality Assessment with Transformers [J].
Cheon, Manri ;
Yoon, Sung-Jun ;
Kang, Byungyeon ;
Lee, Junwoo .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, :433-442