Perception and Automated Assessment of Audio Quality in User Generated Content An improved model

被引:0
作者
Fazenda, Bruno M. [1 ]
Kendrick, Paul [1 ]
Cox, Trevor J. [1 ]
Li, Francis [1 ]
Jackson, Iain [2 ]
机构
[1] Univ Salford, Acoust Res Ctr, Salford, Lancs, England
[2] Univ Manchester, Sch Psychol Sci, Ctr Child Study, Manchester, Lancs, England
来源
2016 EIGHTH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE (QOMEX) | 2016年
关键词
audio quality; perception; audio quality of experience; distortion; wind noise; handling noise; ITU-T STANDARD; ASSESSMENT POLQA; INDEX HASQI; PART II; SPEECH; PESQ;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Technology to record sound, available in personal devices such as smartphones or video recording devices, is now ubiquitous. However, the production quality of the sound on this user-generated content is often very poor: distorted, noisy, with garbled speech or indistinct music. Our interest lies in the causes of the poor recording, especially what happens between the sound source and the electronic signal emerging from the microphone, and finding an automated method to warn the user of such problems. Typical problems, such as distortion, wind noise, microphone handling noise and frequency response, were tested. A perceptual model has been developed from subjective tests on the perceived quality of such errors and data measured from a training dataset composed of various audio files. It is shown that perceived quality is associated with distortion and frequency response, with wind and handling noise being just slightly less important. In addition, the contextual content of the audio sample was found to modulate perceived quality at similar levels to degradations such as wind and rendering those introduced by handling noise negligible.
引用
收藏
页数:6
相关论文
共 22 条
[11]  
Kates JM, 2010, J AUDIO ENG SOC, V58, P363
[12]  
Kendrick P, 2013, MEASURING PORT UNPUB
[13]   Microphone Handling Noise: Measurements of Perceptual Threshold and Effects on Audio Quality [J].
Kendrick, Paul ;
Jackson, Iain R. ;
Fazenda, Bruno M. ;
Cox, Trevor J. ;
Li, Francis F. .
PLOS ONE, 2015, 10 (10)
[14]  
Kendrick P, 2015, J AUDIO ENG SOC, V63, P698
[15]   Evaluating the Generalization of the Hearing Aid Speech Quality Index (HASQI) [J].
Kressner, Abigail A. ;
Anderson, David V. ;
Rozell, Christopher J. .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (02) :407-415
[16]   Perceived naturalness of spectrally distorted speech and music [J].
Moore, BCJ ;
Tan, CT .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2003, 114 (01) :408-419
[17]  
Rix AW, 2002, J AUDIO ENG SOC, V50, P755
[18]  
Scheirer E, 1997, INT CONF ACOUST SPEE, P1331, DOI 10.1109/ICASSP.1997.596192
[19]  
Tan C., 2003, J AUDIO ENG SOC, V51
[20]  
Tan CT, 2004, J AUDIO ENG SOC, V52, P699