Active Learning-Based Sample Selection for Label-Efficient Blind Image Quality Assessment

被引：2

作者：

Song, Tianshu ^{[1
]}

Li, Leida ^{[2
]}

Cheng, Deqiang ^{[1
]}

Chen, Pengfei ^{[2
]}

Wu, Jinjian ^{[2
]}

机构：

[1] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Jiangsu, Peoples R China

[2] Xidian Univ, Sch Artificial Intelligence, Xian 710071, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 07期

基金：

中国国家自然科学基金;

关键词：

Distortion; Training; Predictive models; Image quality; Uncertainty; Tuning; Task analysis; Image quality assessment; active learning; generalization; data-efficient; FRAMEWORK;

D O I：

10.1109/TCSVT.2023.3341611

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Despite the considerable effort devoted to high-generalizable blind image quality assessment (BIQA), the generalization performance of the state-of-the-art metrics remains limited when facing new visual scenes. A straightforward way to address the dilemma is labeling a great number of images from the new scene and subsequently training a new model, which is quite labor-intensive and cost-expensive. Hence, there is an urgent need to mitigate the dependency on labeled samples by designing a data-efficient BIQA algorithm. Motivated by the above facts, this paper presents an Active Learning-based IQA (AL-IQA) framework, which reduces the requirement for training samples by selecting representative images from two perspectives, including distortion and content. Specifically, in terms of distortion, we design distortion prompts and adopt Contrastive Language-Image Pre-Training (CLIP) to predict image distortion in a zero-shot manner. Then, we employ curriculum learning-inspired strategy to select samples with gradually increasing difficulty (measured by prediction uncertainty of CLIP), in order to facilitate model training. Meantime, in terms of content, we adopt distribution matching-based dataset distillation to distill unlabeled images into several high-density informative synthetic images. Then, feature distances between unlabeled images and distilled images are compared to identify images with the most representative content. Finally, Borda count is adopted to capture a consensus of both distortion and content through weighted counting, and prompt tuning is utilized for adapting the model to the IQA task. Extensive experiments are conducted on five IQA datasets, and the results demonstrate that the proposed AL-IQA not only effectively reduces the number of training samples but also achieves state-of-the-art prediction accuracy and generalization performance. The source code is available at https://github.com/esnthere/AL-IQA.

引用

页码：5884 / 5896

页数：13

共 67 条

[1] On the use of deep learning for blind image quality assessment [J].

Bianco, Simone ;

Celona, Luigi ;

Napoletano, Paolo ;

Schettini, Raimondo .

SIGNAL IMAGE AND VIDEO PROCESSING, 2018, 12 (02) :355-362

[2] Deep Neural Networks for No-Reference and Full-Reference Image Quality Assessment [J].

Bosse, Sebastian ;

Maniry, Dominique ;

Mueller, Klaus-Robert ;

Wiegand, Thomas ;

Samek, Wojciech .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (01) :206-219

[3] Dataset Distillation by Matching Training Trajectories [J].

Cazenavette, George ;

Wang, Tongzhou ;

Torralba, Antonio ;

Efros, Alexei A. ;

Zhu, Jun-Yan .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :10708-10717

[4] No-Reference Blur Assessment of Digital Pictures Based on Multifeature Classifiers [J].

Ciancio, Alexandre ;

Targino da Costa, Andre Luiz N. ;

da Silva, Eduardo A. B. ;

Said, Amir ;

Samadani, Ramin ;

Obrador, Pere .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2011, 20 (01) :64-75

[5]

Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929

[6] The original Borda count and partial voting [J].

Emerson, Peter .

SOCIAL CHOICE AND WELFARE, 2013, 40 (02) :353-358

[7] Active Sampling Exploiting Reliable Informativeness for Subjective Image Quality Assessment Based on Pairwise Comparison [J].

Fan, Zhiwei ;

Jiang, Tingting ;

Huang, Tiejun .

IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (12) :2720-2735

[8] Perceptual Quality Assessment of Smartphone Photography [J].

Fang, Yuming ;

Zhu, Hanwei ;

Zeng, Yan ;

Ma, Kede ;

Wang, Zhou .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :3674-3683

[9] Massive Online Crowdsourced Study of Subjective and Objective Picture Quality [J].

Ghadiyaram, Deepti ;

Bovik, Alan C. .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (01) :372-387

[10] Using Free Energy Principle For Blind Image Quality Assessment [J].

Gu, Ke ;

Zhai, Guangtao ;

Yang, Xiaokang ;

Zhang, Wenjun .

IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (01) :50-63

← 1 2 3 4 5 6 7 →