Linking speech perception and neurophysiology: speech decoding guided by cascaded oscillators locked to the input rhythm

被引:245
作者
Ghitza, Oded [1 ,2 ]
机构
[1] Boston Univ, Hearing Res Ctr, Boston, MA 02215 USA
[2] Boston Univ, Ctr Biodynam, Boston, MA 02215 USA
基金
美国国家科学基金会;
关键词
speech perception; memory access; decoding time; brain rhythms; cascaded cortical oscillations; phase locking; parsing; decoding; TIME-COMPRESSED SPEECH; BRAIN-WAVE RECOGNITION; NEURONAL OSCILLATIONS; SENSORY INPUT; INTELLIGIBILITY; COMPREHENSION; INTEGRATION; SYNCHRONY; DYNAMICS; PATTERNS;
D O I
10.3389/fpsyg.2011.00130
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
The premise of this study is that current models of speech perception, which are driven by acoustic features alone, are incomplete, and that the role of decoding time during memory access must be incorporated to account for the patterns of observed recognition phenomena. It is postulated that decoding time is governed by a cascade of neuronal oscillators, which guide template-matching operations at a hierarchy of temporal scales. Cascaded cortical oscillations in the theta, beta, and gamma frequency bands are argued to be crucial for speech intelligibility. Intelligibility is high so long as these oscillations remain phase locked to the auditory input rhythm. A model (Tempo) is presented which is capable of emulating recent psychophysical data on the intelligibility of speech sentences as a function of "packaging" rate (Ghitza and Greenberg, 2009). The data show that intelligibility of speech that is time-compressed by a factor of 3 (i.e., a high syllabic rate) is poor (above 50% word error rate), but is substantially restored when the information stream is re-packaged by the insertion of silent gaps in between successive compressed-signal intervals - a counterintuitive finding, difficult to explain using classical models of speech perception, but emerging naturally from the Tempo architecture.
引用
收藏
页数:13
相关论文
共 47 条
[1]   Speech comprehension is correlated with temporal response patterns recorded from auditory cortex [J].
Ahissar, E ;
Nagarajan, S ;
Ahissar, M ;
Protopapas, A ;
Mahncke, H ;
Merzenich, MM .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (23) :13367-13372
[2]   Decoding temporally encoded sensory input by cortical oscillations and thalamic phase comparators [J].
Ahissar, E ;
Haidarliu, S ;
Zacksenhouse, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1997, 94 (21) :11633-11638
[3]   Oscillatory neuronal dynamics during language comprehension [J].
Bastiaansen, Marcel ;
Hagoort, Peter .
EVENT-RELATED DYNAMICS OF BRAIN OSCILLATIONS, 2006, 159 :179-196
[4]   Background gamma rhythmicity and attention in cortical local circuits:: A computational study [J].
Börgers, C ;
Epstein, S ;
Kopell, NJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (19) :7002-7007
[5]  
Buzaki G., 2006, Rhythms of the Brain, DOI 10.1093/acprof:oso/9780195301069.001.0001
[6]   Theta rhythm of navigation:: Link between path integration and landmark navigation, episodic and semantic memory [J].
Buzsáki, G .
HIPPOCAMPUS, 2005, 15 (07) :827-840
[7]   Spatiotemporal dynamics of word processing in the human brain [J].
Canolty, Ryan T. ;
Soltani, Maryam ;
Dalal, Sarang S. ;
Edwards, Erik ;
Dronkers, Nina F. ;
Nagarajan, Srikantan S. ;
Kirsch, Heidi E. ;
Barbaro, Nicholas M. ;
Knight, Robert T. .
FRONTIERS IN NEUROSCIENCE, 2007, 1 (01) :185-196
[8]   Spectro-temporal modulation transfer functions and speech intelligibility [J].
Chi, TS ;
Gao, YJ ;
Guyton, MC ;
Ru, PW ;
Shamma, S .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 106 (05) :2719-2732
[9]   Modeling auditory processing of amplitude modulation .1. Detection and masking with narrow-band carriers [J].
Dau, T ;
Kollmeier, B ;
Kohlrausch, A .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1997, 102 (05) :2892-2905
[10]   Altering Context Speech Rate Can Cause Words to Appear or Disappear [J].
Dilley, Laura C. ;
Pitt, Mark A. .
PSYCHOLOGICAL SCIENCE, 2010, 21 (11) :1664-1670