State-of-the-art image and video quality assessment with a metric based on an intrinsically non-linear neural summation model

被引:5
作者
Luna, Raul [1 ]
Zabaleta, Itziar [2 ]
Bertalmio, Marcelo [1 ]
机构
[1] CSIC, Inst Opt, Madrid, Spain
[2] Univ Pompeu Fabra, Dept Informat & Commun Technol, Barcelona, Spain
基金
欧盟地平线“2020”;
关键词
visual perception; visual neuroscience; receptive field; INRF; computational modeling; image quality assessment; video quality assessment; high frame rate videos; SIMILARITY;
D O I
10.3389/fnins.2023.1222815
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
The development of automatic methods for image and video quality assessment that correlate well with the perception of human observers is a very challenging open problem in vision science, with numerous practical applications in disciplines such as image processing and computer vision, as well as in the media industry. In the past two decades, the goal of image quality research has been to improve upon classical metrics by developing models that emulate some aspects of the visual system, and while the progress has been considerable, state-of-the-art quality assessment methods still share a number of shortcomings, like their performance dropping considerably when they are tested on a database that is quite different from the one used to train them, or their significant limitations in predicting observer scores for high framerate videos. In this work we propose a novel objective method for image and video quality assessment that is based on the recently introduced Intrinsically Non-linear Receptive Field (INRF) formulation, a neural summation model that has been shown to be better at predicting neural activity and visual perception phenomena than the classical linear receptive field. Here we start by optimizing, on a classic image quality database, the four parameters of a very simple INRF-based metric, and proceed to test this metric on three other databases, showing that its performance equals or surpasses that of the state-of-the-art methods, some of them having millions of parameters. Next, we extend to the temporal domain this INRF image quality metric, and test it on several popular video quality datasets; again, the results of our proposed INRF-based video quality metric are shown to be very competitive.
引用
收藏
页数:11
相关论文
共 57 条
[1]  
[Anonymous], 2015, Final Report From the Video Quality Experts Group on the Validation of Objective Models of Video Quality Assessment
[2]   SpEED-QA: Spatial Efficient Entropic Differencing for Image and Video Quality [J].
Bampis, Christos G. ;
Gupta, Praful ;
Soundararajan, Rajiv ;
Bovik, Alan C. .
IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (09) :1333-1337
[3]   Evidence for the intrinsically nonlinear nature of receptive fields in vision [J].
Bertalmio, Marcelo ;
Gomez-Villa, Alex ;
Martin, Adrian ;
Vazquez-Corral, Javier ;
Kane, David ;
Malo, Jesus .
SCIENTIFIC REPORTS, 2020, 10 (01)
[4]  
Bertalmío M, 2020, COMPUT VIS PATT REC, P247, DOI 10.1016/B978-0-12-813894-6.00015-6
[5]   Testing the role of luminance edges in White's illusion with contour adaptation [J].
Betz, Torsten ;
Shapley, Robert ;
Wichmann, Felix A. ;
Maertens, Marianne .
JOURNAL OF VISION, 2015, 15 (11)
[6]   Do we know what the early visual system does? [J].
Carandini, M ;
Demb, JB ;
Mante, V ;
Tolhurst, DJ ;
Dan, Y ;
Olshausen, BA ;
Gallant, JL ;
Rust, NC .
JOURNAL OF NEUROSCIENCE, 2005, 25 (46) :10577-10597
[7]   Selectivity and spatial distribution of signals from the receptive field surround in macaque V1 neurons [J].
Cavanaugh, JR ;
Bair, W ;
Movshon, JA .
JOURNAL OF NEUROPHYSIOLOGY, 2002, 88 (05) :2547-2556
[8]   VSNR: A wavelet-based visual signal-to-noise ratio for natural images [J].
Chandler, Damon M. ;
Hemami, Sheila S. .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2007, 16 (09) :2284-2298
[9]   Cortical Surround Interactions and Perceptual Salience via Natural Scene Statistics [J].
Coen-Cagli, Ruben ;
Dayan, Peter ;
Schwartz, Odelia .
PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (03)
[10]   Image quality assessment based on a degradation model [J].
Damera-Venkata, N ;
Kite, TD ;
Geisler, WS ;
Evans, BL ;
Bovik, AC .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2000, 9 (04) :636-650