Data Analysis in Multimedia Quality Assessment: Revisiting the Statistical Tests

被引:18
作者
Narwaria, Manish [1 ]
Krasula, Lukas [2 ]
Le Callet, Patrick [2 ]
机构
[1] Dhirubhai Ambani Inst Informat & Commun Technol, Gandhinagar 382007, India
[2] Univ Nantes, IPI Grp LS2N, F-44306 Nantes, France
关键词
Assumption of normality; homogeneity of variance; multimedia quality; statistical analysis; VIDEO;
D O I
10.1109/TMM.2018.2794266
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Assessment of multimedia quality relies heavily on subjective assessment, and is typically done by human subjects in the form of preferences or continuous ratings. Such data are crucial for analysis of different multimedia-processing algorithms as well as validation of objective (computational) methods for the said purpose. To that end, statistical testing provides a theoretical framework toward drawing meaningful inferences, and making well-grounded conclusions and recommendations. While parametric tests (such as t test, ANOVA, and error estimates like confidence intervals) are popular and widely used in the community, there appears to be a certain degree of confusion in the application of such tests. Specifically, the assumptions of normality and homogeneity of variance are often not well understood, leading to incorrect application and/or interpretation of the statistical test results. Therefore, the main goal of this paper is to present new guidelines toward proper use of statistical tests and, hence, fix some of the issues in multimedia quality assessment. The said guidelines are derived based on theoretical analysis of sampling distribution of test statistics, and consider practical aspects of data analysis in the said domain. Experimental results on both simulated and real data are presented to support the arguments made. Software that implements the said recommendations is also made publicly available, in order to help researchers and practitioners perform correct statistical comparison of models.
引用
收藏
页码:2063 / 2072
页数:10
相关论文
共 40 条
[1]  
[Anonymous], P WORKSH QOE MULT CO
[2]  
[Anonymous], 2012, TECH REP
[3]  
[Anonymous], 2003, Final report from the video quality experts group on the validation of objective models of video quality assessment
[4]  
[Anonymous], 2015, TECH REP
[5]  
[Anonymous], 2009, TECH REP
[6]  
Belmudez B., 2016, T LABS SERIES TELECO
[7]   Multimedia Quality Assessment Standards in ITU-T SG12 [J].
Coverdale, Paul ;
Moeller, Sebastian ;
Raake, Alexander ;
Takahashi, Akira .
IEEE SIGNAL PROCESSING MAGAZINE, 2011, 28 (06) :91-97
[8]   No-Reference Quality Assessment of Screen Content Pictures [J].
Gu, Ke ;
Zhou, Jun ;
Qiao, Jun-Fei ;
Zhai, Guangtao ;
Lin, Weisi ;
Bovik, Alan Conrad .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (08) :4005-4018
[9]   Analysis of Distortion Distribution for Pooling in Image Quality Prediction [J].
Gu, Ke ;
Wang, Shiqi ;
Zhai, Guangtao ;
Lin, Weisi ;
Yang, Xiaokang ;
Zhang, Wenjun .
IEEE TRANSACTIONS ON BROADCASTING, 2016, 62 (02) :446-456
[10]  
Hossfeld T, 2011, INT WORK QUAL MULTIM, P131, DOI 10.1109/QoMEX.2011.6065690