Linking speech perception and neurophysiology: speech decoding guided by cascaded oscillators locked to the input rhythm

被引:236
作者
Ghitza, Oded [1 ,2 ]
机构
[1] Boston Univ, Hearing Res Ctr, Boston, MA 02215 USA
[2] Boston Univ, Ctr Biodynam, Boston, MA 02215 USA
来源
FRONTIERS IN PSYCHOLOGY | 2011年 / 2卷
基金
美国国家科学基金会;
关键词
speech perception; memory access; decoding time; brain rhythms; cascaded cortical oscillations; phase locking; parsing; decoding; TIME-COMPRESSED SPEECH; BRAIN-WAVE RECOGNITION; NEURONAL OSCILLATIONS; SENSORY INPUT; INTELLIGIBILITY; COMPREHENSION; INTEGRATION; SYNCHRONY; DYNAMICS; PATTERNS;
D O I
10.3389/fpsyg.2011.00130
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
The premise of this study is that current models of speech perception, which are driven by acoustic features alone, are incomplete, and that the role of decoding time during memory access must be incorporated to account for the patterns of observed recognition phenomena. It is postulated that decoding time is governed by a cascade of neuronal oscillators, which guide template-matching operations at a hierarchy of temporal scales. Cascaded cortical oscillations in the theta, beta, and gamma frequency bands are argued to be crucial for speech intelligibility. Intelligibility is high so long as these oscillations remain phase locked to the auditory input rhythm. A model (Tempo) is presented which is capable of emulating recent psychophysical data on the intelligibility of speech sentences as a function of "packaging" rate (Ghitza and Greenberg, 2009). The data show that intelligibility of speech that is time-compressed by a factor of 3 (i.e., a high syllabic rate) is poor (above 50% word error rate), but is substantially restored when the information stream is re-packaged by the insertion of silent gaps in between successive compressed-signal intervals - a counterintuitive finding, difficult to explain using classical models of speech perception, but emerging naturally from the Tempo architecture.
引用
收藏
页数:13
相关论文
共 47 条
  • [1] Speech comprehension is correlated with temporal response patterns recorded from auditory cortex
    Ahissar, E
    Nagarajan, S
    Ahissar, M
    Protopapas, A
    Mahncke, H
    Merzenich, MM
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (23) : 13367 - 13372
  • [2] Decoding temporally encoded sensory input by cortical oscillations and thalamic phase comparators
    Ahissar, E
    Haidarliu, S
    Zacksenhouse, M
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1997, 94 (21) : 11633 - 11638
  • [3] Oscillatory neuronal dynamics during language comprehension
    Bastiaansen, Marcel
    Hagoort, Peter
    [J]. EVENT-RELATED DYNAMICS OF BRAIN OSCILLATIONS, 2006, 159 : 179 - 196
  • [4] Background gamma rhythmicity and attention in cortical local circuits:: A computational study
    Börgers, C
    Epstein, S
    Kopell, NJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (19) : 7002 - 7007
  • [5] Buzaki G., 2006, Rhythms of the Brain, DOI 10.1093/acprof:oso/9780195301069.001.0001
  • [6] Theta rhythm of navigation:: Link between path integration and landmark navigation, episodic and semantic memory
    Buzsáki, G
    [J]. HIPPOCAMPUS, 2005, 15 (07) : 827 - 840
  • [7] Spatiotemporal dynamics of word processing in the human brain
    Canolty, Ryan T.
    Soltani, Maryam
    Dalal, Sarang S.
    Edwards, Erik
    Dronkers, Nina F.
    Nagarajan, Srikantan S.
    Kirsch, Heidi E.
    Barbaro, Nicholas M.
    Knight, Robert T.
    [J]. FRONTIERS IN NEUROSCIENCE, 2007, 1 (01): : 185 - 196
  • [8] Spectro-temporal modulation transfer functions and speech intelligibility
    Chi, TS
    Gao, YJ
    Guyton, MC
    Ru, PW
    Shamma, S
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 106 (05) : 2719 - 2732
  • [9] Modeling auditory processing of amplitude modulation .1. Detection and masking with narrow-band carriers
    Dau, T
    Kollmeier, B
    Kohlrausch, A
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1997, 102 (05) : 2892 - 2905
  • [10] Altering Context Speech Rate Can Cause Words to Appear or Disappear
    Dilley, Laura C.
    Pitt, Mark A.
    [J]. PSYCHOLOGICAL SCIENCE, 2010, 21 (11) : 1664 - 1670