On the role of theta-driven syllabic parsing in decoding speech: intelligibility of speech with a manipulated modulation spectrum

被引:109
作者
Ghitza, Oded [1 ]
机构
[1] Boston Univ, Hearing Res Ctr, Boston, MA 02215 USA
基金
美国国家科学基金会;
关键词
speech perception; intelligibility; syllabic parsing; modulation spectrum; cascaded neuronal oscillations; theta band; hierarchical window structure; synchronization;
D O I
10.3389/fpsyg.2012.00238
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Recent hypotheses on the potential role of neuronal oscillations in speech perception propose that speech is processed on multi-scale temporal analysis windows formed by a cascade of neuronal oscillators locked to the input pseudo-rhythm. In particular, Ghitza (2011) proposed that the oscillators are in the theta, beta, and gamma frequency bands with the theta oscillator the master, tracking the input syllabic rhythm and setting a time-varying, hierarchical window structure synchronized with the input. In the study described here the hypothesized role of theta was examined by measuring the intelligibility of speech with a manipulated modulation spectrum. Each critical-band signal was manipulated by controlling the degree of temporal envelope flatness. Intelligibility of speech with critical-band envelopes that are flat is poor; inserting extra information, restricted to the input syllabic rhythm, markedly improves intelligibility. It is concluded that flattening the critical-band envelopes prevents the theta oscillator from tracking the input rhythm, hence the disruption of the hierarchical window structure that controls the decoding process. Reinstating the input-rhythm information revives the tracking capability, hence restoring the synchronization between the window structure and the input, resulting in the extraction of additional information from the flat modulation spectrum.
引用
收藏
页数:12
相关论文
共 16 条
[1]  
Ahissar E, 2005, AUDITORY CORTEX: SYNTHESIS OF HUMAN AND ANIMAL RESEARCH, P295
[2]   Speech comprehension is correlated with temporal response patterns recorded from auditory cortex [J].
Ahissar, E ;
Nagarajan, S ;
Ahissar, M ;
Protopapas, A ;
Mahncke, H ;
Merzenich, MM .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (23) :13367-13372
[3]   Decoding temporally encoded sensory input by cortical oscillations and thalamic phase comparators [J].
Ahissar, E ;
Haidarliu, S ;
Zacksenhouse, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1997, 94 (21) :11633-11638
[4]  
Chait M, 2005, ISCA WORKSH PLAST SP
[5]   EFFECT OF TEMPORAL ENVELOPE SMEARING ON SPEECH RECEPTION [J].
DRULLMAN, R ;
FESTEN, JM ;
PLOMP, R .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1994, 95 (02) :1053-1064
[6]   On the upper cutoff frequency of the auditory critical-band envelope detectors in the context of speech perception [J].
Ghitza, O .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2001, 110 (03) :1628-1640
[7]   Linking speech perception and neurophysiology: speech decoding guided by cascaded oscillators locked to the input rhythm [J].
Ghitza, Oded .
FRONTIERS IN PSYCHOLOGY, 2011, 2
[8]   On the Possible Role of Brain Rhythms in Speech Perception: Intelligibility of Time-Compressed Speech with Periodic and Aperiodic Insertions of Silence [J].
Ghitza, Oded ;
Greenberg, Steven .
PHONETICA, 2009, 66 (1-2) :113-126
[9]   Cortical oscillations and speech processing: emerging computational principles and operations [J].
Giraud, Anne-Lise ;
Poeppel, David .
NATURE NEUROSCIENCE, 2012, 15 (04) :511-517
[10]   Temporal context in speech processing and attentional stream selection: A behavioral and neural perspective [J].
Golumbic, Elana M. Zion ;
Poeppel, David ;
Schroeder, Charles E. .
BRAIN AND LANGUAGE, 2012, 122 (03) :151-161