Speech recognition with amplitude and frequency modulations

被引:293
|
作者
Zeng, FG [1 ]
Nie, K
Stickney, GS
Kong, YY
Vongphoe, M
Bhargave, A
Weit, CG
Cao, K
机构
[1] Univ Calif Irvine, Dept Anat & Neurobiol, Irvine, CA 92697 USA
[2] Univ Calif Irvine, Dept Biomed Engn, Irvine, CA 92697 USA
[3] Univ Calif Irvine, Dept Cognit Sci, Irvine, CA 92697 USA
[4] Univ Calif Irvine, Dept Otolaryngol Head & Neck Surg, Irvine, CA 92697 USA
[5] Beijing Union Med Coll Hosp, Dept Otolaryngol Head & Neck Surg, Beijing 100730, Peoples R China
关键词
auditory analysis; cochlear implant; neural code; phase; scene analysis;
D O I
10.1073/pnas.0406460102
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Amplitude modulation (AM) and frequency modulation (FM) are commonly used in communication, but their relative contributions to speech recognition have not been fully explored. To bridge this gap, we derived slowly varying AM and FM from speech sounds and conducted listening tests using stimuli with different modulations in normal-hearing and cochlear-implant subjects. We found that although AM from a limited number of spectral bands may be sufficient for speech recognition in quiet, FM significantly enhances speech recognition in noise, as well as speaker and tone recognition. Additional speech reception threshold measures revealed that FM is particularly critical for speech recognition with a competing voice and is independent of spectral resolution and similarity. These results suggest that AM and FM provide independent yet complementary contributions to support robust speech recognition under realistic listening situations. Encoding FM may improve auditory scene analysis, cochlear-implant, and audiocoding performance.
引用
收藏
页码:2293 / 2298
页数:6
相关论文
共 50 条
  • [1] Frequency modulations in the speech signal
    A. S. Leonov
    I. S. Makarov
    V. N. Sorokin
    Acoustical Physics, 2009, 55 : 876 - 887
  • [2] Frequency modulations in the speech signal
    Leonov, A. S.
    Makarov, I. S.
    Sorokin, V. N.
    ACOUSTICAL PHYSICS, 2009, 55 (06) : 876 - 887
  • [3] Extracting amplitude modulations from speech in the time domain
    Prendergast, Garreth
    Johnson, Sam R.
    Green, Gary G. R.
    SPEECH COMMUNICATION, 2011, 53 (06) : 903 - 913
  • [4] AMPLITUDE AND FREQUENCY MODULATIONS OF SPONTANEOUS OTOACOUSTIC EMISSIONS
    Bian, Lin
    CONCEPTS AND CHALLENGES IN THE BIOPHYSICS OF HEARING, 2009, : 183 - 189
  • [5] Amplitude and Frequency Modulations with Cellular Neural Networks
    Tander, Baran
    Ozmen, Atilla
    NEURAL PROCESSING LETTERS, 2015, 41 (02) : 259 - 272
  • [6] Amplitude and Frequency Modulations with Cellular Neural Networks
    Baran Tander
    Atilla Özmen
    Neural Processing Letters, 2015, 41 : 259 - 272
  • [7] EEG frequency-amplitude characteristics of the successful recognition of emotional speech
    Kislova O.O.
    Rusalova M.N.
    Neuroscience and Behavioral Physiology, 2010, 40 (6) : 639 - 644
  • [8] EEG Frequency-Amplitude Characteristics of Successful Recognition of Emotion in Speech
    Kislova, O. O.
    Rusalova, M. N.
    ZHURNAL VYSSHEI NERVNOI DEYATELNOSTI IMENI I P PAVLOVA, 2009, 59 (03) : 281 - 287
  • [9] The role of phase synchronisation between low frequency amplitude modulations in child phonology and morphology speech tasks
    Flanagan, Sheila
    Goswami, Usha
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2018, 143 (03): : 1366 - 1375
  • [10] Hemispheric difference in processing tone frequency and amplitude modulations
    Pardo, PJ
    Mäkelä, JP
    Sams, M
    NEUROREPORT, 1999, 10 (14) : 3081 - 3086