From Pixels to Rich-Nodes: A Cognition-Inspired Framework for Blind Image Quality Assessment

被引：0

作者：

He, Tian ^{[1
]}

Shi, Lin ^{[2
]}

Xu, Wenjia ^{[3
]}

Wang, Yu ^{[1
]}

Qiu, Weijie ^{[1
]}

Guo, Houbang ^{[4
]}

Jiang, Zhuqing ^{[1
]}

机构：

[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China

[2] China Acad Informat & Commun, Artificial Intelligence Res Ctr, Secur & Metaverse Dept, Beijing 100083, Peoples R China

[3] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100876, Peoples R China

[4] UCL, Phys & Astron Dept, London WC1E 6BT, England

来源：

IEEE TRANSACTIONS ON BROADCASTING | 2025年 / 71卷 / 01期

关键词：

Image quality; Feature extraction; Distortion; Cognition; Neurons; Semantics; Graph neural networks; Deep learning; Training; Topology; Blind image quality assessment; rich club; graph-inspired feature integrator; frequency prior; ranking prior;

D O I：

10.1109/TBC.2024.3464418

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Blind image quality assessment (BIQA) is a subjective perception-driven task, which necessitates assessment results consistent with human cognition. The human cognitive system inherently involves both separation and integration mechanisms. Recent works have witnessed the success of deep learning methods in separating distortion features. Nonetheless, traditional deep-learning-based BIQA methods predominantly depend on fixed topology to mimic the information integration in the brain, which gives rise to scale sensitivity and low flexibility. To handle this challenge, we delve into the dynamic interactions among neurons and propose a cognition-inspired BIQA model. Drawing insights from the rich club structure in network neuroscience, a graph-inspired feature integrator is devised to reconstruct the network topology. Specifically, we argue that the activity of individual neurons (pixels) tends to exhibit a random fluctuation with ambiguous meaning, while clear and coherent cognition arises from neurons with high connectivity (rich-nodes). Therefore, a self-attention mechanism is employed to establish strong semantic associations between pixels and rich-nodes. Subsequently, we design intra-and inter-layer graph structures to promote the feature interaction across spatial and scale dimensions. Such dynamic circuits endow the BIQA method with efficient, flexible, and robust information processing capabilities, so as to achieve more human-subjective assessment results. Moreover, since the limited samples in existing IQA datasets are prone to model overfitting, we devise two prior hypotheses: frequency prior and ranking prior. The former stepwise augments high-frequency components that reflect the distortion degree during the multilevel feature extraction, while the latter seeks to motivate the model's in-depth comprehension of differences in sample quality. Extensive experiments on five publicly datasets reveal that the proposed algorithm achieves competitive results.

引用

页码：229 / 239

页数：11

共 12 条

[1] Blind Image Quality Assessment for a Single Image From Text-to-Image Synthesis
Yu, Wenxin
Zhang, Xuewen
Zhang, Yunye
Zhang, Zhiqiang
Zhou, Jinjia
IEEE ACCESS, 2021, 9 : 94656 - 94667
[2] FreqAlign: Excavating Perception-Oriented Transferability for Blind Image Quality Assessment From a Frequency Perspective
Li, Xin
Lu, Yiting
Chen, Zhibo
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 4652 - 4666
[3] A hybrid learning-based framework for blind image quality assessment
Wu, Meiyin
Chen, Li
Tian, Jing
MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 2018, 29 (03) : 839 - 849
[4] Active Fine-Tuning From gMAD Examples Improves Blind Image Quality Assessment
Wang, Zhihua
Ma, Kede
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 4577 - 4590
[5] A hybrid learning-based framework for blind image quality assessment
Meiyin Wu
Li Chen
Jing Tian
Multidimensional Systems and Signal Processing, 2018, 29 : 839 - 849
[6] Blind stereo image quality assessment inspired by brain sensory-motor fusion
Karimi, Maryam
Soltanian, Najmeh
Samavi, Shadrokh
Najarian, Kayvan
Karimi, Nader
Soroushmehr, S. M. Reza
DIGITAL SIGNAL PROCESSING, 2019, 91 : 91 - 104
[7] BLIND IMAGE QUALITY ASSESSMENT BY LEARNING FROM MULTIPLE ANNOTATORS
Ma, Kede
Liu, Xuelin
Fang, Yuming
Simoncelli, Eero P.
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 2344 - 2348
[8] Blind Image Quality Assessment: From Natural Scene Statistics to Perceptual Quality
Moorthy, Anush Krishna
Bovik, Alan Conrad
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2011, 20 (12) : 3350 - 3364
[9] PATCH-IQ: A patch based learning framework for blind image quality assessment
Manap, Redzuan Abdul
Shao, Ling
Frangi, Alejandro F.
INFORMATION SCIENCES, 2017, 420 : 329 - 344
[10] A PROPOSAL PROJECT FOR A BLIND IMAGE QUALITY ASSESSMENT BY LEARNING DISTORTIONS FROM THE FULL REFERENCE IMAGE QUALITY ASSESSMENTS
Paris, Stefane
2012 FOURTH INTERNATIONAL WORKSHOP ON QUALITY OF MULTIMEDIA EXPERIENCE (QOMEX), 2012, : 29 - 30

← 1 2 →