A Novel Speech Processing Algorithm Based on Harmonicity Cues in Cochlear Implant

被引:0
作者
Wang, Jian [1 ]
Chen, Yousheng [1 ]
Zhang, Zongping [1 ]
Chen, Yan [1 ]
Zhang, Weifeng [1 ]
机构
[1] Shenzhen Inst Informat Technol, Sch Elect & Commun, Shenzhen 518172, Guangdong, Peoples R China
来源
GREEN ENERGY AND SUSTAINABLE DEVELOPMENT I | 2017年 / 1864卷
基金
中国国家自然科学基金;
关键词
Cochlear Implant; Mandarin Speech Recognition; Harmonicity Cues; Fundamental Frequency; Continuous Interleaved Sampling Strategy; PERCEPTION; PITCH; RECOGNITION; INFORMATION; ENVELOPE; TONES;
D O I
10.1063/1.4993000
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This paper proposed a novel speech processing algorithm in cochlear implant, which used harmonicity cues to enhance tonal information in Mandarin Chinese speech recognition. The input speech was filtered by a 4-channel band-pass filter bank. The frequency ranges for the four bands were: 300-621, 621-1285, 1285-2657, and 2657-5499 Hz. In each pass band, temporal envelope and periodicity cues (TEPCs) below 400 Hz were extracted by full wave rectification and low-pass filtering. The TEPCs were modulated by a sinusoidal carrier, the frequency of which was fundamental frequency (F0) and its harmonics most close to the center frequency of each band. Signals from each band were combined together to obtain an output speech. Mandarin tone, word, and sentence recognition in quiet listening conditions were tested for the extensively used continuous interleaved sampling (CIS) strategy and the novel F0-harmonic algorithm. Results found that the F0-harmonic algorithm performed consistently better than CIS strategy in Mandarin tone, word, and sentence recognition. In addition, sentence recognition rate was higher than word recognition rate, as a result of contextual information in the sentence. Moreover, tone 3 and 4 performed better than tone 1 and tone 2, due to the easily identified features of the former. In conclusion, the F0-harmonic algorithm could enhance tonal information in cochlear implant speech processing due to the use of harmonicity cues, thereby improving Mandarin tone, word, and sentence recognition. Further study will focus on the test of the F0-harmonic algorithm in noisy listening conditions.
引用
收藏
页数:7
相关论文
共 25 条
[1]   Effects of stimulation rates on Cantonese lexical tone perception by cochlear implant users in Hong Kong [J].
Au, DKK .
CLINICAL OTOLARYNGOLOGY, 2003, 28 (06) :533-538
[2]   HEARING THEORIES AND COMPLEX SOUNDS [J].
BEKESY, GV .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1963, 35 (04) :588-&
[3]   The identification of speech in noise by cochlear implant patients and normal-hearing listeners using 6-channel signal processors [J].
Dorman, MF ;
Loizou, PC ;
Fitzke, J .
EAR AND HEARING, 1998, 19 (06) :481-484
[4]  
Fu Q.J., 2000, Asia Pac J Speech, Lang Hear, V5, P45
[5]   Development and validation of the Mandarin speech perception test [J].
Fu, Qian-Jie ;
Zhu, Meimei ;
Wang, Xiaosong .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2011, 129 (06) :EL267-EL273
[6]   Importance of tonal envelope cues in Chinese speech recognition [J].
Fu, QJ ;
Zeng, FG ;
Shannon, RV ;
Soli, SD .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1998, 104 (01) :505-510
[7]   Coding of the fundamental frequency in continuous interleaved sampling processors for cochlear implants [J].
Geurts, L ;
Wouters, J .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2001, 109 (02) :713-726
[8]   Enhancement of temporal periodicity cues in cochlear implants: Effects on prosodic perception and vowel identification [J].
Green, T ;
Faulkner, A ;
Rosen, S ;
Macherey, O .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2005, 118 (01) :375-385
[9]   Enhancing temporal cues to voice pitch in continuous interleaved sampling cochlear implants [J].
Green, T ;
Faulkner, A ;
Rosen, S .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2004, 116 (04) :2298-2310
[10]  
KONG YY, 2003, 26 ANN MIDW RES M, V26, P213