Temporal context in speech processing and attentional stream selection: A behavioral and neural perspective

被引:123
作者
Golumbic, Elana M. Zion [1 ,2 ,3 ]
Poeppel, David [4 ]
Schroeder, Charles E. [1 ,2 ,3 ]
机构
[1] Columbia Univ, Dept Neurol, Med Ctr, New York, NY 10032 USA
[2] Columbia Univ, Dept Psychiat, Med Ctr, New York, NY 10032 USA
[3] Nathan S Kline Inst Psychiat Res, Orangeburg, NY 10962 USA
[4] NYU, Dept Psychol, New York, NY 10003 USA
关键词
Oscillations; Entrainment; Attention; Speech; Auditory; Time; Rhythm; PRIMARY AUDITORY-CORTEX; MODULATION TRANSFER-FUNCTIONS; TIME-COMPRESSED SPEECH; NEURONAL OSCILLATIONS; AUDIOVISUAL SPEECH; COCKTAIL PARTY; VISUAL-PERCEPTION; ACOUSTIC STIMULI; RECEPTIVE-FIELDS; HUMAN NEOCORTEX;
D O I
10.1016/j.bandl.2011.12.010
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
The human capacity for processing speech is remarkable, especially given that information in speech unfolds over multiple time scales concurrently. Similarly notable is our ability to filter out of extraneous sounds and focus our attention on one conversation, epitomized by the 'Cocktail Party' effect. Yet, the neural mechanisms underlying on-line speech decoding and attentional stream selection are not well understood. We review findings from behavioral and neurophysiological investigations that underscore the importance of the temporal structure of speech for achieving these perceptual feats. We discuss the hypothesis that entrainment of ambient neuronal oscillations to speech's temporal structure, across multiple time-scales, serves to facilitate its decoding and underlies the selection of an attended speech stream over other competing input. In this regard, speech decoding and attentional stream selection are examples of 'Active Sensing', emphasizing an interaction between proactive and predictive top-down modulation of neuronal dynamics and bottom-up sensory input. (C) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:151 / 161
页数:11
相关论文
共 192 条
  • [1] Right-hemisphere auditory cortex is dominant for coding syllable patterns in speech
    Abrams, Daniel A.
    Nicol, Trent
    Zecker, Steven
    Kraus, Nina
    [J]. JOURNAL OF NEUROSCIENCE, 2008, 28 (15) : 3958 - 3965
  • [2] Ahissar E, 2005, AUDITORY CORTEX: SYNTHESIS OF HUMAN AND ANIMAL RESEARCH, P295
  • [3] Speech comprehension is correlated with temporal response patterns recorded from auditory cortex
    Ahissar, E
    Nagarajan, S
    Ahissar, M
    Protopapas, A
    Mahncke, H
    Merzenich, MM
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (23) : 13367 - 13372
  • [4] Human cortical responses to the speech envelope
    Aiken, Steven J.
    Picton, Terence W.
    [J]. EAR AND HEARING, 2008, 29 (02) : 139 - 157
  • [5] Selectively attending to auditory objects
    Alain, C
    Arnott, SR
    [J]. FRONTIERS IN BIOSCIENCE-LANDMARK, 2000, 5 : D202 - D212
  • [6] Allen J.B., 2005, ARTICULATION INTELLI
  • [7] How Do Humans Process and Recognize Speech?
    Allen, Jont B.
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (04): : 567 - 577
  • [8] [Anonymous], REANALYSIS SENTENCE
  • [9] Arai T., 1998, IEEE INT C AC SPEECH
  • [10] The where and when of linguistic word-level prosody
    Arciuli, Joanne
    Slowiaczek, Louisa M.
    [J]. NEUROPSYCHOLOGIA, 2007, 45 (11) : 2638 - 2642