KonIQ-10k: An Ecologically Valid Database for Deep Learning of Blind Image Quality Assessment

被引:440
作者
Hosu, Vlad [1 ]
Lin, Hanhe [1 ]
Sziranyi, Tamas [2 ]
Saupe, Dietmar [1 ]
机构
[1] Univ Konstanz, Dept Comp & Informat Sci, D-78464 Constance, Germany
[2] Inst Comp Sci & Control SZTAKI, Machine Percept Res Lab, H-1111 Budapest, Hungary
基金
匈牙利科学研究基金会;
关键词
Image database; diversity sampling; crowdsourcing; blind image quality assessment; subjective image quality assessment; convolutional neural networks; deep learning;
D O I
10.1109/TIP.2020.2967829
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning methods for image quality assessment (IQA) are limited due to the small size of existing datasets. Extensive datasets require substantial resources both for generating publishable content and annotating it accurately. We present a systematic and scalable approach to creating KonIQ-10k, the largest IQA dataset to date, consisting of 10,073 quality scored images. It is the first in-the-wild database aiming for ecological validity, concerning the authenticity of distortions, the diversity of content, and quality-related indicators. Through the use of crowdsourcing, we obtained 1.2 million reliable quality ratings from 1,459 crowd workers, paving the way for more general IQA models. We propose a novel, deep learning model (KonCept512), to show an excellent generalization beyond the test set (0.921 SROCC), to the current state-of-the-art database LIVE-in-the-Wild (0.825 SROCC). The model derives its core performance from the InceptionResNet architecture, being trained at a higher resolution than previous models (512x384). Correlation analysis shows that KonCept512 performs similar to having 9 subjective scores for each test image.
引用
收藏
页码:4041 / 4056
页数:16
相关论文
共 64 条
[1]  
[Anonymous], 2016, Electronic Imaging
[2]  
[Anonymous], 2015, Tiny ImageNet Visual Recognition Challenge., DOI DOI 10.1109/ICCV.2015.123
[3]   On the use of deep learning for blind image quality assessment [J].
Bianco, Simone ;
Celona, Luigi ;
Napoletano, Paolo ;
Schettini, Raimondo .
SIGNAL IMAGE AND VIDEO PROCESSING, 2018, 12 (02) :355-362
[4]  
Bosse S, 2016, IEEE IMAGE PROC, P3773, DOI 10.1109/ICIP.2016.7533065
[5]  
Chollet F., 2015, KERAS
[6]   Generating Image Distortion Maps Using Convolutional Autoencoders With Application to No Reference Image Quality Assessment [J].
Dendi, Sathya Veera Reddy ;
Dev, Chander ;
Kothari, Narayan ;
Channappayya, Sumohana S. .
IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (01) :89-93
[7]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[8]  
Dodge S, 2016, 2016 EIGHTH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE (QOMEX)
[9]   Blind image quality prediction by exploiting multi-level deep representations [J].
Gao, Fei ;
Yu, Jun ;
Zhu, Suguo ;
Huang, Qingming ;
Han, Qi .
PATTERN RECOGNITION, 2018, 81 :432-442
[10]   Massive Online Crowdsourced Study of Subjective and Objective Picture Quality [J].
Ghadiyaram, Deepti ;
Bovik, Alan C. .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (01) :372-387