No-Reference Video quality assessment of H.264 video streams based on semantic saliency maps

被引:2
作者
Boujut, H. [1 ]
Benois-Pineau, J. [1 ]
Ahmed, T. [1 ]
Hadar, O. [2 ]
Bonnet, P. [3 ]
机构
[1] Univ Bordeaux 1, IPB Matmeca Enseirb, LABRI UMR CNRS 5800, 351 Cours Liberat, F-33405 Talence, France
[2] Ben Gurion Univ Negev, Commun Syst Engn Dept, IL-84105 Beer Sheva, Israel
[3] Audemat WorldCast Syst Grp, F-33700 Bordeaux, France
来源
IMAGE QUALITY AND SYSTEM PERFORMANCE IX | 2012年 / 8293卷
关键词
IMAGE; FACES;
D O I
10.1117/12.905379
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The paper contributes to No-Reference video quality assessment of broadcasted HD video over IP networks and DVB. In this work we have enhanced our bottom-up spatio-temporal saliency map model by considering semantics of the visual scene. Thus we propose a new saliency map model based on face detection that we called semantic saliency map. A new fusion method has been proposed to merge the bottom-up saliency maps with the semantic saliency map. We show that our NR metric WMBER weighted by the spatio-temporal-semantic saliency map provides higher results then the WMBER weighted by the bottom-up spatio-temporal saliency map. Tests are performed on two H.264/AVC video databases for video quality assessment over lossy networks.
引用
收藏
页数:9
相关论文
共 17 条
[1]   Axiomatization of an exponential similarity function [J].
Billot, Antoine ;
Gilboa, Itzhak ;
Schmeidler, David .
MATHEMATICAL SOCIAL SCIENCES, 2008, 55 (02) :107-115
[2]  
Boujut H., 2011, ICME 2011 WORKSH HOT
[3]  
Boujut H., 2011, IS T SPIE EL IM JAN
[4]   Faces and text attract gaze independent of the task: Experimental data and computer model [J].
Cerf, Moran ;
Frady, E. Paxon ;
Koch, Christof .
JOURNAL OF VISION, 2009, 9 (12)
[5]  
Engelke U., 2010, QOMEX
[6]  
Feng XZ, 2008, I C WIREL COMM NETW, P12560
[7]   No-reference image and video quality estimation: Applications and human-motivated design [J].
Hemami, Sheila S. ;
Reibman, Amy R. .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2010, 25 (07) :469-481
[8]  
International Telecommunication Union, 2007, G1050 ITUT
[9]  
International Telecommunication Union, 2002, BT50011 ITUT
[10]   Detection of human faces in color image sequences with arbitrary motions for very low bit-rate videophone coding [J].
Kapfer, M ;
Benois-Pineau, J .
PATTERN RECOGNITION LETTERS, 1997, 18 (14) :1503-1518