From Pixels to Rich-Nodes: A Cognition-Inspired Framework for Blind Image Quality Assessment

被引:0
|
作者
He, Tian [1 ]
Shi, Lin [2 ]
Xu, Wenjia [3 ]
Wang, Yu [1 ]
Qiu, Weijie [1 ]
Guo, Houbang [4 ]
Jiang, Zhuqing [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
[2] China Acad Informat & Commun, Artificial Intelligence Res Ctr, Secur & Metaverse Dept, Beijing 100083, Peoples R China
[3] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100876, Peoples R China
[4] UCL, Phys & Astron Dept, London WC1E 6BT, England
关键词
Image quality; Feature extraction; Distortion; Cognition; Neurons; Semantics; Graph neural networks; Deep learning; Training; Topology; Blind image quality assessment; rich club; graph-inspired feature integrator; frequency prior; ranking prior;
D O I
10.1109/TBC.2024.3464418
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Blind image quality assessment (BIQA) is a subjective perception-driven task, which necessitates assessment results consistent with human cognition. The human cognitive system inherently involves both separation and integration mechanisms. Recent works have witnessed the success of deep learning methods in separating distortion features. Nonetheless, traditional deep-learning-based BIQA methods predominantly depend on fixed topology to mimic the information integration in the brain, which gives rise to scale sensitivity and low flexibility. To handle this challenge, we delve into the dynamic interactions among neurons and propose a cognition-inspired BIQA model. Drawing insights from the rich club structure in network neuroscience, a graph-inspired feature integrator is devised to reconstruct the network topology. Specifically, we argue that the activity of individual neurons (pixels) tends to exhibit a random fluctuation with ambiguous meaning, while clear and coherent cognition arises from neurons with high connectivity (rich-nodes). Therefore, a self-attention mechanism is employed to establish strong semantic associations between pixels and rich-nodes. Subsequently, we design intra-and inter-layer graph structures to promote the feature interaction across spatial and scale dimensions. Such dynamic circuits endow the BIQA method with efficient, flexible, and robust information processing capabilities, so as to achieve more human-subjective assessment results. Moreover, since the limited samples in existing IQA datasets are prone to model overfitting, we devise two prior hypotheses: frequency prior and ranking prior. The former stepwise augments high-frequency components that reflect the distortion degree during the multilevel feature extraction, while the latter seeks to motivate the model's in-depth comprehension of differences in sample quality. Extensive experiments on five publicly datasets reveal that the proposed algorithm achieves competitive results.
引用
收藏
页码:229 / 239
页数:11
相关论文
共 12 条
  • [1] Blind Image Quality Assessment for a Single Image From Text-to-Image Synthesis
    Yu, Wenxin
    Zhang, Xuewen
    Zhang, Yunye
    Zhang, Zhiqiang
    Zhou, Jinjia
    IEEE ACCESS, 2021, 9 : 94656 - 94667
  • [2] FreqAlign: Excavating Perception-Oriented Transferability for Blind Image Quality Assessment From a Frequency Perspective
    Li, Xin
    Lu, Yiting
    Chen, Zhibo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 4652 - 4666
  • [3] A hybrid learning-based framework for blind image quality assessment
    Wu, Meiyin
    Chen, Li
    Tian, Jing
    MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 2018, 29 (03) : 839 - 849
  • [4] Active Fine-Tuning From gMAD Examples Improves Blind Image Quality Assessment
    Wang, Zhihua
    Ma, Kede
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 4577 - 4590
  • [5] A hybrid learning-based framework for blind image quality assessment
    Meiyin Wu
    Li Chen
    Jing Tian
    Multidimensional Systems and Signal Processing, 2018, 29 : 839 - 849
  • [6] Blind stereo image quality assessment inspired by brain sensory-motor fusion
    Karimi, Maryam
    Soltanian, Najmeh
    Samavi, Shadrokh
    Najarian, Kayvan
    Karimi, Nader
    Soroushmehr, S. M. Reza
    DIGITAL SIGNAL PROCESSING, 2019, 91 : 91 - 104
  • [7] BLIND IMAGE QUALITY ASSESSMENT BY LEARNING FROM MULTIPLE ANNOTATORS
    Ma, Kede
    Liu, Xuelin
    Fang, Yuming
    Simoncelli, Eero P.
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 2344 - 2348
  • [8] Blind Image Quality Assessment: From Natural Scene Statistics to Perceptual Quality
    Moorthy, Anush Krishna
    Bovik, Alan Conrad
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2011, 20 (12) : 3350 - 3364
  • [9] PATCH-IQ: A patch based learning framework for blind image quality assessment
    Manap, Redzuan Abdul
    Shao, Ling
    Frangi, Alejandro F.
    INFORMATION SCIENCES, 2017, 420 : 329 - 344
  • [10] A PROPOSAL PROJECT FOR A BLIND IMAGE QUALITY ASSESSMENT BY LEARNING DISTORTIONS FROM THE FULL REFERENCE IMAGE QUALITY ASSESSMENTS
    Paris, Stefane
    2012 FOURTH INTERNATIONAL WORKSHOP ON QUALITY OF MULTIMEDIA EXPERIENCE (QOMEX), 2012, : 29 - 30