Active Fine-Tuning From gMAD Examples Improves Blind Image Quality Assessment

被引:19
|
作者
Wang, Zhihua [1 ]
Ma, Kede [1 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Computational modeling; Databases; Adaptation models; Training; Predictive models; Task analysis; Image quality; Blind image quality assessment; deep neural networks; gMAD competition; active learning; STATISTICS; INDEX;
D O I
10.1109/TPAMI.2021.3071759
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The research in image quality assessment (IQA) has a long history, and significant progress has been made by leveraging recent advances in deep neural networks (DNNs). Despite high correlation numbers on existing IQA datasets, DNN-based models may be easily falsified in the group maximum differentiation (gMAD) competition. Here we show that gMAD examples can be used to improve blind IQA (BIQA) methods. Specifically, we first pre-train a DNN-based BIQA model using multiple noisy annotators, and fine-tune it on multiple synthetically distorted images, resulting in a "top-performing" baseline model. We then seek pairs of images by comparing the baseline model with a set of full-reference IQA methods in gMAD. The spotted gMAD examples are most likely to reveal the weaknesses of the baseline, and suggest potential ways for refinement. We query human quality annotations for the selected images in a well-controlled laboratory environment, and further fine-tune the baseline on the combination of human-rated images from gMAD and existing databases. This process may be iterated, enabling active fine-tuning from gMAD examples for BIQA. We demonstrate the feasibility of our active learning scheme on a large-scale unlabeled image set, and show that the fine-tuned quality model achieves improved generalizability in gMAD, without destroying performance on previously seen databases.
引用
收藏
页码:4577 / 4590
页数:14
相关论文
共 50 条
  • [21] Augmenting Blind Image Quality Assessment using Image Semantics
    Siahaan, Ernestasia
    Hanjalic, Alan
    Redi, Judith A.
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2016, : 307 - 312
  • [22] Multitask Deep Neural Network With Knowledge-Guided Attention for Blind Image Quality Assessment
    Zhou, Tianwei
    Tan, Songbai
    Zhao, Baoquan
    Yue, Guanghui
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7577 - 7588
  • [23] CDINet: Content Distortion Interaction Network for Blind Image Quality Assessment
    Zheng, Limin
    Luo, Yu
    Zhou, Zihan
    Ling, Jie
    Yue, Guanghui
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 7089 - 7100
  • [24] Fine-Grained Image Quality Assessment: A Revisit and Further Thinking
    Zhang, Xinfeng
    Lin, Weisi
    Huang, Qingming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (05) : 2746 - 2759
  • [25] From Pixels to Rich-Nodes: A Cognition-Inspired Framework for Blind Image Quality Assessment
    He, Tian
    Shi, Lin
    Xu, Wenjia
    Wang, Yu
    Qiu, Weijie
    Guo, Houbang
    Jiang, Zhuqing
    IEEE TRANSACTIONS ON BROADCASTING, 2025, 71 (01) : 229 - 239
  • [26] Blind Image Quality Assessment by Visual Neuron Matrix
    Chang, Hua-Wen
    Bi, Xiao-Dong
    Kai, Chen
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1803 - 1807
  • [27] Semantic-aware blind image quality assessment
    Siahaan, Ernestasia
    Hanjalic, Alan
    Redi, Judith A.
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2018, 60 : 237 - 252
  • [28] Blind image quality assessment by simulating the visual cortex
    Cai, Rongtai
    Fang, Ming
    VISUAL COMPUTER, 2023, 39 (10) : 4639 - 4656
  • [29] Blind Image Quality Assessment via Transformer Predicted Error Map and Perceptual Quality Token
    Shi, Jinsong
    Gao, Pan
    Smolic, Aljosa
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 4641 - 4651
  • [30] Local Feature Aggregation for Blind Image Quality Assessment
    Xu, Jingtao
    Li, Qiaohong
    Ye, Peng
    Du, Haiqing
    Liu, Yong
    2015 VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2015,