Nonlinear auditory models yield new insights into representations of vowels

被引:17
作者
Carney, Laurel H. [1 ,2 ]
McDonough, Joyce M. [3 ]
机构
[1] Univ Rochester, Dept Biomed Engn, 601 Elmwood Ave,Box 603, Rochester, NY 14642 USA
[2] Univ Rochester, Dept Neurosci, 601 Elmwood Ave,Box 603, Rochester, NY 14642 USA
[3] Univ Rochester, Dept Linguist, Rochester, NY USA
基金
美国国家卫生研究院;
关键词
Audition; Speech perception; Physiological psychology; NERVE FIBERS; PHENOMENOLOGICAL MODEL; HAIR-CELLS; VERTICAL-BAR; RESPONSES; INNER; SIMULATION; EPSILON; COCHLEA; CAT;
D O I
10.3758/s13414-018-01644-w
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Studies of vowel systems regularly appeal to the need to understand how the auditory system encodes and processes the information in the acoustic signal. The goal of this study is to present computational models to address this need, and to use the models to illustrate responses to vowels at two levels of the auditory pathway. Many of the models previously used to study auditory representations of speech are based on linear filter banks simulating the tuning of the inner ear. These models do not incorporate key nonlinear response properties of the inner ear that influence responses at conversational-speech sound levels. These nonlinear properties shape neural representations in ways that are important for understanding responses in the central nervous system. The model for auditory-nerve (AN) fibers used here incorporates realistic nonlinear properties associated with the basilar membrane, inner hair cells (IHCs), and the IHC-AN synapse. These nonlinearities set up profiles of f0-related fluctuations that vary in amplitude across the population of frequency-tuned AN fibers. Amplitude fluctuations in AN responses are smallest near formant peaks and largest at frequencies between formants. These f0-related fluctuations strongly excite or suppress neurons in the auditory midbrain, the first level of the auditory pathway where tuning for low-frequency fluctuations in sounds occurs. Formant-related amplitude fluctuations provide representations of the vowel spectrum in discharge rates of midbrain neurons. These representations in the midbrain are robust across a wide range of sound levels, including the entire range of conversational-speech levels, and in the presence of realistic background noise levels.
引用
收藏
页码:1034 / 1046
页数:13
相关论文
共 61 条
[1]  
Becker-Kristal R., 2010, THESIS U CALIF
[2]   AN INTERNATIONAL COMPARISON OF LONG-TERM AVERAGE SPEECH SPECTRA [J].
BYRNE, D ;
DILLON, H ;
TRAN, K ;
ARLINGER, S ;
WILBRAHAM, K ;
COX, R ;
HAGERMAN, B ;
HETU, R ;
KEI, J ;
LUI, C ;
KIESSLING, J ;
KOTBY, MN ;
NASSER, NHA ;
ELKHOLY, WAH ;
NAKANISHI, Y ;
OYER, H ;
POWELL, R ;
STEPHENS, D ;
MEREDITH, R ;
SIRIMANNA, T ;
TAVARTKILADZE, G ;
FROLENKOV, GI ;
WESTERMAN, S ;
LUDVIGSEN, C .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1994, 96 (04) :2108-2120
[3]  
Carlson R, 1982, REPRESENTATION SPEEC, P109
[4]   Supra-Threshold Hearing and Fluctuation Profiles: Implications for Sensorineural and Hidden Hearing Loss [J].
Carney, Laurel H. .
JARO-JOURNAL OF THE ASSOCIATION FOR RESEARCH IN OTOLARYNGOLOGY, 2018, 19 (04) :331-352
[5]   Speech Coding in the Brain: Representation of Vowel Formants by Midbrain Neurons Tuned to Sound Fluctuations [J].
Carney, Laurel H. ;
Li, Tianhao ;
McDonough, Joyce M. .
ENEURO, 2015, 2 (04)
[6]   Speech Coding in the Midbrain: Effects of Sensorineural Hearing Loss [J].
Carney, Laurel H. ;
Kim, Duck O. ;
Kuwada, Shigeyuki .
PHYSIOLOGY, PSYCHOACOUSTICS AND COGNITION IN NORMAL AND IMPAIRED HEARING, 2016, 894 :427-435
[7]   A MODEL FOR THE RESPONSES OF LOW-FREQUENCY AUDITORY-NERVE FIBERS IN CAT [J].
CARNEY, LH .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1993, 93 (01) :401-417
[8]   ACOUSTIC LESIONS IN THE MAMMALIAN COCHLEA - IMPLICATIONS FOR THE SPATIAL-DISTRIBUTION OF THE ACTIVE PROCESS [J].
CODY, AR .
HEARING RESEARCH, 1992, 62 (02) :166-172
[9]  
Crothers J, 1978, UNIVERSALS HUMAN LAN, V2, P99
[10]   NEUROBIOLOGY OF COCHLEAR INNER AND OUTER HAIR-CELLS - INTRACELLULAR-RECORDINGS [J].
DALLOS, P .
HEARING RESEARCH, 1986, 22 (1-3) :185-198