No-Reference Video Quality Assessment Using Voxel-Wise fMRI Models of the Visual Cortex

被引：5

作者：

Mahankali, Naga Sailaja ^{[1
]}

Raghavan, Mohan ^{[2
]}

Channappayya, Sumohana S. ^{[1
]}

机构：

[1] Indian Inst Technol Hyderabad, Dept Elect Engn, Hyderabad 500020, Telangana, India

[2] Indian Inst Technol Hyderabad, Dept Biomed Engn, Hyderabad 500020, Telangana, India

来源：

IEEE SIGNAL PROCESSING LETTERS | 2022年 / 29卷

关键词：

Visualization; Functional magnetic resonance imaging; Brain modeling; Predictive models; Prediction algorithms; Encoding; Quality assessment; Human visual system (HVS); functional magnetic resonance imaging (fMRI); blood oxygen level-dependent (BOLD); haemodynamic response function (HRF); video quality assessment (VQA);

D O I：

10.1109/LSP.2021.3136487

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The performance of the human visual system is very efficient in many visual tasks such as identifying visual scenes, anticipating future actions based on the past observations, assessing the quality of visual stimuli, etc. A significant amount of effort has been directed towards finding quality aware representations of natural videos to solve the quality prediction task. In this work we present a novel no reference video quality assessment (NR-VQA) algorithm based on the functional Magnetic Resonance Imaging (fMRI) Blood Oxygen Level Dependent (BOLD) signal prediction with voxel-wise encoding models of the human brain. The voxel encoding models are learnt using deep features extracted from the AlexNet model to predict the fMRI response to natural video stimuli. We show that the curvature in the predicted voxel response time series provides good quality discriminability, and forms an important feature for quality prediction. Further, we show that the proposed curvature features in combination with the spatial index, temporal index and NIQE features deliver acceptable performance on the Video Quality Assessment (VQA) task on both synthetic and authentic distortion data-sets.

引用

页码：319 / 323

页数：5

共 38 条

[11]

Hosu Vlad, 2017, QOMEX, P1

[12] RECEPTIVE FIELDS, BINOCULAR INTERACTION AND FUNCTIONAL ARCHITECTURE IN CATS VISUAL CORTEX [J].

HUBEL, DH ;

WIESEL, TN .

JOURNAL OF PHYSIOLOGY-LONDON, 1962, 160 (01) :106-&

[13] Identifying natural images from human brain activity [J].

Kay, Kendrick N. ;

Naselaris, Thomas ;

Prenger, Ryan J. ;

Gallant, Jack L. .

NATURE, 2008, 452 (7185) :352-U7

[14] Two-Level Approach for No-Reference Consumer Video Quality Assessment [J].

Korhonen, Jari .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (12) :5923-5938

[15] ImageNet Classification with Deep Convolutional Neural Networks [J].

Krizhevsky, Alex ;

Sutskever, Ilya ;

Hinton, Geoffrey E. .

COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90

[16]

Kroupi E, 2014, EUR SIGNAL PR CONF, P2135

[17] VIDEO QUALITY PREDICTION USING VOXEL-WISE FMRI MODELS OF THE VISUAL CORTEX [J].

Mahankali, Naga Sailaja ;

Channappayya, Sumohana S. .

2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, :2125-2129

[18] FUNCTIONAL-PROPERTIES OF NEURONS IN MIDDLE TEMPORAL VISUAL AREA OF THE MACAQUE MONKEY .1. SELECTIVITY FOR STIMULUS DIRECTION, SPEED, AND ORIENTATION [J].

MAUNSELL, JHR ;

VANESSEN, DC .

JOURNAL OF NEUROPHYSIOLOGY, 1983, 49 (05) :1127-1147

[19] A Completely Blind Video Integrity Oracle [J].

Mittal, Anish ;

Saad, Michele A. ;

Bovik, Alan C. .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (01) :289-300

[20] Making a "Completely Blind" Image Quality Analyzer [J].

Mittal, Anish ;

Soundararajan, Rajiv ;

Bovik, Alan C. .

IEEE SIGNAL PROCESSING LETTERS, 2013, 20 (03) :209-212

← 1 2 3 4 →