The Impact of Tone Language and Non-Native Language Listening on Measuring Speech Quality

被引:0
作者
Ebem, Deborah U. [1 ]
Beerends, John G. [2 ]
Van Vugt, Jeroen [2 ]
Schmidmer, Christian [3 ]
Kooij, Robert E. [2 ,4 ]
Uguru, Joy O. [5 ]
机构
[1] Univ Nigeria, Dept Comp Sci, Nsukka, Enugu State, Nigeria
[2] TNO, NL-2600 GB Delft, Netherlands
[3] OPTICOM GmbH, D-91052 Erlangen, Germany
[4] Delft Univ Technol, Fac Elect Engn Math & Comp Sci, NL-2600 GA Delft, Netherlands
[5] Univ Nigeria, Dept Linguist Igbo & Other Nigerian Languages, Humanities Unit, Sch Gen Studies, Nsukka, Nigeria
来源
JOURNAL OF THE AUDIO ENGINEERING SOCIETY | 2011年 / 59卷 / 09期
关键词
PERCEPTUAL EVALUATION; ITU STANDARD; PESQ;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The extent to which the modeling used in objective speech quality algorithms depends on the cultural background of listeners as well as on the language characteristics using American English and Igbo, an African tone language is investigated. Two different approaches were used in order to separate behavioral aspects from speech signal aspects. In the first approach degraded American English sentences were presented to Igbo listeners and American listeners, showing that Igbo subjects are more disturbed by additive noise in comparison to other degradations than American subjects. In the second approach objective modeling, using ITU-T P.863 (POLQA), showed that Igbo subjects listening to degraded Igbo speech are more disturbed by background noise and low-level listening than predicted by the P.863 standard, which was trained on Western languages using native listeners. The most likely conclusion is that low-level signal parts of the Igbo tone language are relatively more important than low-level signal parts of American English. In judging the quality of their own language Igbo listeners thus need more signal level and more signal-to-noise ratio for perceiving high quality than American subjects require in judging their own language. When Igbo subjects judge the quality of American speech samples the impact of noise is overestimated but low-level listening does not have a significant impact on the perceived speech quality. The results show that one cannot build a universal objective speech quality measurement system but that adaptation toward the behavior of a set of subjects is necessary. Further investigation into the impact of tone language signal characteristics and the behavior of subjects who are raised in a specific cultural environment is necessary before a new speech quality measure for assessing voice quality in that environment can be developed. The results also suggest that speech communication systems have to be optimized dependent on the cultural context where the system is used and/or the languages for which the system is intended.
引用
收藏
页码:647 / 655
页数:9
相关论文
共 18 条
[1]  
[Anonymous], 2011, P863 ITUT
[2]  
[Anonymous], 2003, ITU-T Recommendation G.984.2 - Gigabit-capable Passive Optical Networks
[3]  
[Anonymous], 2001, REC ITU T P 862
[4]  
[Anonymous], 2017, P.862.2
[5]  
Beerends JG, 2002, J AUDIO ENG SOC, V50, P765
[6]  
BEERENDS JG, 1994, J AUDIO ENG SOC, V42, P115
[7]   Non-native speech perception in adverse conditions: A review [J].
Garcia Lecumberri, Maria Luisa ;
Cooke, Martin ;
Cutler, Anne .
SPEECH COMMUNICATION, 2010, 52 (11-12) :864-886
[8]  
HOLUB J, 2010, P 9 WIR TEL S
[9]  
Hoth DF, 1941, J ACOUST SOC AM, V12, P499, DOI 10.1121/1.1916129
[10]  
IKEKEONWU C, 1993, 40 LUND U DEP LING, P95