Effects of lowpass and highpass filtering on the intelligibility of speech based on temporal fine structure or envelope cues

被引:39
作者
Ardoint, Marine [1 ]
Lorenzi, Christian [1 ]
机构
[1] Univ Paris 05, DEC, Ecole Normale Super, CNRS,Lab Psychol Percept, F-75005 Paris, France
关键词
Filtering; Intelligibility; Speech; Temporal envelope; Temporal fine structure; AUDITORY-NERVE FIBERS; FUNDAMENTAL-FREQUENCY; PERCEPTION; RECOGNITION; AMPLITUDE; TONES; RESPONSES; HEARING; SOUNDS;
D O I
10.1016/j.heares.2009.12.002
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
This study aimed to assess whether or not temporal envelope (E) and fine structure (TFS) cues in speech convey distinct phonetic information. Syllables uttered by a male and female speaker were (i) processed to retain either E or TFS within 16 frequency bands, (ii) lowpass or highpass filtered at different cut-off frequencies, and (iii) presented for identification to seven listeners. Psychometric functions were fitted using a sigmoid function, and used to determine crossover frequencies (cut-off frequencies at which lowpass and highpass filtering yielded equivalent performance), and gradients at each point of the psychometric functions (change in performance with respect to cut-off frequency). Crossover frequencies and gradients were not significantly different across speakers. Crossover frequencies were not significantly different between E and TFS speech (similar to 1.5 kHz). Gradients were significantly different between E and TFS speech in various filtering conditions. When stimuli were highpass filtered above 2.5 kHz, performance was significantly above chance level and gradients were significantly different from 0 for E speech only. These findings suggest that E and TFS convey important but distinct phonetic cues between 1 and 2 kHz. Unlike TFS, E conveys information up to 6 kHz, consistent with the characteristics of neural phase locking to E and TFS. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:89 / 95
页数:7
相关论文
共 34 条
[1]  
ARDOINT M, EFFECTS COMBIN UNPUB
[2]  
ARDOINT M, PERCEPTION TEM UNPUB
[3]   YIN, a fundamental frequency estimator for speech and music [J].
de Cheveigné, A ;
Kawahara, H .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2002, 111 (04) :1917-1930
[4]   FACTORS GOVERNING THE INTELLIGIBILITY OF SPEECH SOUNDS [J].
FRENCH, NR ;
STEINBERG, JC .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1947, 19 (01) :90-119
[5]   On the upper cutoff frequency of the auditory critical-band envelope detectors in the context of speech perception [J].
Ghitza, O .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2001, 110 (03) :1628-1640
[6]   The ability of listeners to use recovered envelope cues from speech fine structure [J].
Gilbert, G ;
Lorenzi, C .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 119 (04) :2438-2444
[7]   Effects of periodic interruptions on the intelligibility of speech based on temporal fine-structure or envelope cues (L) [J].
Gilbert, Gaeetan ;
Bergeras, Isabelle ;
Voillery, Dorothee ;
Lorenzi, Christian .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2007, 122 (03) :1336-1339
[8]   EVALUATING THE ARTICULATION INDEX FOR AUDITORY VISUAL INPUT [J].
GRANT, KW ;
BRAIDA, LD .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1991, 89 (06) :2952-2960
[9]   The role of contrasting temporal amplitude patterns in the perception of speech [J].
Healy, EW ;
Warren, RM .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2003, 113 (03) :1676-1688
[10]  
HEINZ MG, 2009, J ASS RES OTOLARYNGO