Psychoacoustical Approach to Sinusoidal Modeling of Speech

被引:0
|
作者
Nagy, Martin Turi [1 ]
Minarik, Ivan [1 ]
机构
[1] Slovak Univ Technol Bratislava, Dept Telecommun, Ilkovicova 3, Bratislava 81219, Slovakia
关键词
Psychoacoustics; sinusoidal modeling; SN model; speech synthesis;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes the psychoacoustical approach to the sinusoidal analysis/synthesis of the speech. With the SN (sinusoids plus noise) model, the periodic part of the speech signal is described by time-varying sinusoids and the non-periodic part by noise. The number of sinusoids needed for reconstruction can be then reduced by application of the psychoacoustical principles. The proposed compression scheme allows us to reduce the amount of physical data needed to store the parameters. The whole method can be used in speech synthesis for storing speech corpuses that are prepared for easy prosodic manipulation.
引用
收藏
页码:217 / 220
页数:4
相关论文
共 50 条
  • [1] Rate-distortion optimal sinusoidal modeling of audio and speech using psychoacoustical matching pursuits
    Heusdens, R
    van de Par, S
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 1809 - 1812
  • [2] Sinusoidal modeling and modification of unvoiced speech
    Macon, MW
    Clements, MA
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (06): : 557 - 560
  • [3] SPEECH AS TEMPORAL PATTERN - A PSYCHOACOUSTICAL PROFILE
    LAUTER, JL
    HIRSH, IJ
    SPEECH COMMUNICATION, 1985, 4 (1-3) : 41 - 54
  • [4] FREQUENCY-VARYING SINUSOIDAL MODELING OF SPEECH
    MARQUES, JS
    ALMEIDA, LB
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (05): : 763 - 765
  • [5] Exponential sinusoidal modeling of transitional speech segments
    Aalborg Univ, Aalborg, Denmark
    ICASSP IEEE Int Conf Acoust Speech Signal Process Proc, (473-476):
  • [6] Exponential sinusoidal modeling of transitional speech segments
    Jensen, J
    Jensen, SH
    Hansen, E
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 473 - 476
  • [7] Psychoacoustical enhancement of speech based on multitaper spectrum
    School of Information Science and Engineering, Southeast University, Nanjing 210096, China
    不详
    Shengxue Xuebao, 2007, 3 (275-281):
  • [8] A perceptual subspace method for sinusoidal speech and audio modeling
    Jensen, J
    Heusdens, R
    Jensen, SH
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO AND ELECTROACOUSTICS MULTIMEDIA SIGNAL PROCESSING, 2003, : 401 - 404
  • [9] Single Channel Speech Separation Based on Sinusoidal Modeling
    Wiem, Belhedi
    anouar, Ben messaoud Mohamed
    Aicha, Bouzid
    2016 2ND INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2016, : 672 - 676
  • [10] Separation of harmonic and speech signals using sinusoidal modeling
    Jancovic, Peter
    Kokuer, Munevver
    2007 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 2007, : 261 - 264