Loudness Contour Can Influence Mandarin Tone Recognition: Vocoder Simulation and Cochlear Implants

被引:12
作者
Meng, Qinglin [1 ,2 ]
Zheng, Nengheng [3 ,4 ]
Li, Xia [3 ]
机构
[1] SCUT, Sch Phys & Optoelect, Acoust Lab, Guangzhou 510641, Guangdong, Peoples R China
[2] Shenzhen Univ, Coll Informat Engn, Shenzhen 518060, Peoples R China
[3] Shenzhen Univ, Coll Informat Engn, Shenzhen Key Lab Modern Commun & Informat Proc, Shenzhen 518060, Peoples R China
[4] Univ New South Wales, Sydney, NSW 2052, Australia
基金
中国博士后科学基金;
关键词
Cochlear implant; loudness contour; Mandarin tone recognition; pitch; TEMPORAL PERIODICITY CUES; FINE-STRUCTURE; FUNDAMENTAL-FREQUENCY; IMPROVED PERCEPTION; SPEECH RECOGNITION; RELATIVE PITCH; LEXICAL TONES; IDENTIFICATION; ENVELOPE; CHINESE;
D O I
10.1109/TNSRE.2016.2593489
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Lexical tone recognition with current cochlear implants (CI) remains unsatisfactory due to significantly degraded pitch-related acoustic cues, which dominate the tone recognition by normal-hearing (NH) listeners. Several secondary cues (e.g., amplitude contour, duration, and spectral envelope) that influence tone recognition in NH listeners and CI users have been studied. This work proposes a loudness contour manipulation algorithm, namely Loudness-Tone (L-Tone), to investigate the effects of loudness contour on Mandarin tone recognition and the effectiveness of using loudness cue to enhance tone recognition for CI users. With L-Tone, the intensity of sound samples is multiplied by gain values determined by instantaneous fundamental frequencies (F0s) and pre-defined gain-F0 mapping functions. Perceptual experiments were conducted with a four-channel noise-band vocoder simulation in NH listeners and with CI users. The results suggested that 1) loudness contour is a useful secondary cue for Mandarin tone recognition, especially when pitch cues are significantly degraded; 2) L-Tone can be used to improve Mandarin tone recognition in both simulated and actual CI-hearing without significant negative effect on vowel and consonant recognition. L-Tone is a promising algorithm for incorporation into real-time CI processing and off-line CI rehabilitation training software.
引用
收藏
页码:641 / 649
页数:9
相关论文
共 37 条
[1]  
Boersma P., 2009, Praat: doing phonetics by computer (version 5.1.13), DOI DOI 10.1097/AUD.0B013-31821473F7
[2]  
Brookes M., 2014, VOICEBOX: Speech processing toolbox for MATLAB
[3]   The perception of speech modulation cues in lexical tones is guided by early language-specific experience [J].
Cabrera, Laurianne ;
Tsao, Feng-Ming ;
Liu, Huei-Mei ;
Li, Lu-Yang ;
Hu, You-Hsin ;
Lorenzi, Christian ;
Bertoncini, Josiane .
FRONTIERS IN PSYCHOLOGY, 2015, 6
[4]   The role of spectro-temporal fine structure cues in lexical-tone discrimination for French and Mandarin listeners [J].
Cabrera, Laurianne ;
Tsao, Feng-Ming ;
Gnansia, Dan ;
Bertoncini, Josiane ;
Lorenzi, Christian .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 136 (02) :877-882
[5]   The perception of Cantonese lexical tones by early-deafened cochlear implantees [J].
Ciocca, V ;
Francis, AL ;
Aisha, R ;
Wong, L .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2002, 111 (05) :2250-2256
[6]   What breaks a melody: Perceiving FO and intensity sequences with a cochlear implant [J].
Cousineau, Marion ;
Demany, Laurent ;
Meyer, Bernard ;
Pressnitzer, Daniel .
HEARING RESEARCH, 2010, 269 (1-2) :34-41
[7]  
Duanmu S., 2002, PHONOLOGY STANDARD C, P225
[8]  
Fu Q.J., 2000, Asia Pac J Speech, Lang Hear, V5, P45
[9]   Importance of tonal envelope cues in Chinese speech recognition [J].
Fu, QJ ;
Zeng, FG ;
Shannon, RV ;
Soli, SD .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1998, 104 (01) :505-510
[10]   Coding of the fundamental frequency in continuous interleaved sampling processors for cochlear implants [J].
Geurts, L ;
Wouters, J .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2001, 109 (02) :713-726