The Influence of Reverberation on the Perceptual Judgment of Cross-Lingual Speakers' Timbre

被引:0
作者
Liu, Yali [1 ]
Wu, Mian [1 ]
机构
[1] Commun Univ China, Minist Educ, Key Lab Media Audio & Video, Commun Acoust Lab, Beijing 100024, Peoples R China
来源
2021 IEEE/ACIS 20TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS 2021-SUMMER) | 2021年
关键词
Reverberation; Cross-language; Perceptual judgment; Average spectrum;
D O I
10.1109/ICIS51600.2021.9516857
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper aims at the influence of reverberation on the perceptual judgment of the cross-lingual (English and Mandarin Chinese) speakers' timbre from the two aspects: perception experiment and acoustic features. The results show that: (1) The reverberation can reduce the identification accuracy of cross-lingual speaker's timbre, but there is no significant difference in reverberation time. (2) The subjects are more likely to misjudge the same speaker as the different speakers. (3) The perception of the speaker's timbre has significant gender difference, and the misjudgment rate of females is higher than that of males. (4) After adding reverberation, the difference of the average spectrum value between the English and Mandarin Chinese speech has been decreased in the full frequency band. And the difference is reduced to less than 10 dB, in which subjects are more likely to confuse the cross-lingual speakers.
引用
收藏
页码:116 / 120
页数:5
相关论文
共 12 条
[1]  
Askar R, 2015, 2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, P519, DOI 10.1109/ChinaSIP.2015.7230457
[2]  
Auckenthaler R., 2001, ICASSP 2001
[3]  
BAO Z. W., 1978, J PHYS SCI, V27, P476
[4]  
Jianglin Wang, 2013, 2013 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP), P170, DOI 10.1109/ChinaSIP.2013.6625321
[5]  
Jing Zhang, 2019, 2019 IEEE 19th International Conference on Communication Technology (ICCT), P193, DOI 10.1109/ICCT46805.2019.8947173
[6]  
Li LT, 2017, ASIAPAC SIGN INFO PR, P1040, DOI 10.1109/APSIPA.2017.8282182
[7]  
Lu L, 2009, INT CONF ACOUST SPEE, P4217, DOI 10.1109/ICASSP.2009.4960559
[8]  
Misra A, 2014, IEEE W SP LANG TECH, P372, DOI 10.1109/SLT.2014.7078603
[9]  
Rozi Askar, 2016, 2016 Conference of The Oriental Chapter of International Committee for Coordination and Standardization of Speech Databases and Assessment Techniques (O-COCOSDA), P161, DOI 10.1109/ICSDA.2016.7919004
[10]   Evaluating automatic speech recognition systems as quantitative models of cross-lingual phonetic category perception [J].
Schatz, Thomas ;
Bach, Francis ;
Dupoux, Emmanuel .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2018, 143 (05) :EL372-EL378