The Accuracy of Subjects in a Quality Experiment: A Theoretical Subject Model

被引：57

作者：

Janowski, Lucjan ^{[1
]}

Pinson, Margaret ^{[2
]}

机构：

[1] AGH Univ Sci & Technol, PL-30059 Krakow, Poland

[2] Inst Telecommun Sci, Boulder, CO 80305 USA

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2015年 / 17卷 / 12期

关键词：

Design of experiments; mean opinion score; quality of experience (QoE); subject model; subjective ratings; video quality assessment;

D O I：

10.1109/TMM.2015.2484963

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

How accurately are people able to use the absolute category rating (ACR) 5-level scale? Put another way, how repeatable are an individual subject's scores? Several subjective experiments have asked subjects to rate the same sequences a couple of times. Analyses indicate that none of the subjects exactly repeated their prior scores for these sequences. We would like to better understand this imperfection. This paper uses ACR subjective video quality tests to explore the precision of subjective ratings. To make formal measurements possible, we propose a theoretical subject model that is the main contribution of this paper. The proposed subject model indicates three major factors that influence accuracy: subject bias, subject inaccuracy, and stimulus scoring difficulty. These appear to be separate random effects and their existence is a reason why none of the subjects were able to perfectly repeat scores. There are three key consequences. First, subject scoring behavior includes a random component that spans approximately half of the rating scale. Second, the sensitivity and accuracy of most subjective analyses can be improved if the subject scores are normalized by removing subject bias. Third, to some extent, multiple subjects can be replaced with a single subject who rates each sequence multiple times.

引用

页码：2210 / 2224

页数：15

共 26 条

[1]

[Anonymous], 2013, PROCEEDING EUROPEAN

[2]

[Anonymous], 1994, CONTRIBUTION TO ANSI

[3]

[Anonymous], 2003, BS1534 ITUR

[4]

Connors L., 2014, HB14501 NAT TEL INF

[5]

Dong Shi, 2010, 2010 2nd International Conference on Industrial Mechatronics and Automation (ICIMA 2010), P229, DOI 10.1109/ICINDMA.2010.5538328

[6] Automated qualitative assessment of multi-modal distortions in digital images based on GLZ [J].

Gowacz, Andrzej ;

Grega, Micha ;

Gwiazda, Przemysaw ;

Janowski, Lucjan ;

Leszczuk, Mikoaj ;

Romaniak, Piotr ;

Romano, Simon Pietro .

ANNALS OF TELECOMMUNICATIONS-ANNALES DES TELECOMMUNICATIONS, 2010, 65 (1-2) :3-17

[7]

Hoene C., 2013, SUMMARY OPUS LISTENI

[8]

Hossfeld T, 2011, INT WORK QUAL MULTIM, P131, DOI 10.1109/QoMEX.2011.6065690

[9] Modelling of spatio-temporal interaction for video quality assessment [J].

Huynh-Thu, Quan ;

Ghanbari, Mohammed .

SIGNAL PROCESSING-IMAGE COMMUNICATION, 2010, 25 (07) :535-546

[10]

Janowski L, 2014, TM14505 NTIA

← 1 2 3 →