Using auditory classification images for the identification of fine acoustic cues used in speech perception

被引:21
作者
Varnet, Leo [1 ,2 ]
Knoblauch, Kenneth [3 ]
Meunier, Fanny [1 ,2 ,4 ]
Hoen, Michel [1 ,2 ]
机构
[1] CNRS UMR5292, INSERM U1028, Brain Dynam & Cognit Team, Neurosci Res Ctr, Lyon, France
[2] Univ Lyon 1, Ecole Doctorale Neurosci & Cognit, F-69365 Lyon, France
[3] INSERM U846, Stem Cell & Brain Res Inst, Integrat Neurosci Dept, Bron, France
[4] CNRS UMR5304, Lab Langage Cerveau & Cognit, Lyon, France
来源
FRONTIERS IN HUMAN NEUROSCIENCE | 2013年 / 7卷
基金
欧洲研究理事会;
关键词
classification images; GLM; phoneme recognition; speech perception; acoustic cues; phonetics; STIMULUS FEATURES; RECEPTIVE-FIELDS; FREQUENCY; NOISE; DISCRIMINATION; NUMBER; SOUNDS; STOP; TIME;
D O I
10.3389/fnhum.2013.00865
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
An essential step in understanding the processes underlying the general mechanism of perceptual categorization is to identify which portions of a physical stimulation modulate the behavior of our perceptual system. More specifically, in the context of speech comprehension, it is still a major open challenge to understand which information is used to categorize a speech stimulus as one phoneme or another, the auditory primitives relevant for the categorical perception of speech being still unknown. Here we propose to adapt a method relying on a Generalized Linear Model with smoothness priors, already used in the visual domain for the estimation of so-called classification images, to auditory experiments. This statistical model offers a rigorous framework for dealing with non-Gaussian noise, as it is often the case in the auditory modality, and limits the amount of noise in the estimated template by enforcing smoother solutions. By applying this technique to a specific two-alternative forced choice experiment between stimuli aba and ada in noise with an adaptive SNR, we confirm that the second formantic transition is key for classifying phonemes into /b/ or /d/ in noise, and that its estimation by the auditory system is a relative measurement across spectral bands and in relation to the perceived height of the second formant in the preceding syllable. Through this example, we show how the GLM with smoothness priors approach can be applied to the identification of fine functional acoustic cues in speech perception. Finally we discuss some assumptions of the model in the specific case of speech perception.
引用
收藏
页数:12
相关论文
共 61 条
[1]   Classification images for detection, contrast discrimination, and identification tasks with a common ideal observer [J].
Abbey, Craig K. ;
Eckstein, Miguel P. .
JOURNAL OF VISION, 2006, 6 (04) :335-355
[2]   Classification image analysis: Estimation and statistical inference for two-alternative forced-choice experiments [J].
Abbey, Craig K. ;
Eckstein, Miguel P. .
JOURNAL OF VISION, 2002, 2 (01) :66-78
[3]   STIMULUS FEATURES IN SIGNAL DETECTION [J].
AHUMADA, A ;
LOVELL, J .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1971, 49 (06) :1751-&
[4]  
AHUMADA A, 1975, J ACOUST SOC AM, V57, P385, DOI 10.1121/1.380453
[5]  
Ahumada AJ, 1996, PERCEPTION, V25, P18
[6]   Classification image weights and internal noise level estimation [J].
Ahumada, Albert J., Jr. .
JOURNAL OF VISION, 2002, 2 (01) :121-131
[7]   Relative importance of temporal information in various frequency regions for consonant identification in quiet and in noise [J].
Apoux, F ;
Bacon, SP .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2004, 116 (03) :1671-1680
[8]   On the number of auditory filter outputs needed to understand speech: Further evidence for auditory channel independence [J].
Apoux, Frederic ;
Healy, Eric W. .
HEARING RESEARCH, 2009, 255 (1-2) :99-108
[9]  
Ardoint M., 2007, 30 ARO MIDW M FEB 10
[10]   Nonlinear features in vernier acuity [J].
Barth, E ;
Beard, BL ;
Ahumada, AJ .
HUMAN VISION AND ELECTRONIC IMAGING IV, 1999, 3644 :88-96