HYBRID HARMONIC CODING OF SPEECH AT LOW BIT-RATES

被引:5
|
作者
MARQUES, JS
ABRANTES, AJ
机构
[1] INES, ISEL, R ALVES REDOL 9, P-1000 LISBON, PORTUGAL
[2] INESC, IST, P-1000 LISBON, PORTUGAL
关键词
SPEECH MODELING; SINUSODAL MODELING; CODING;
D O I
10.1016/0167-6393(94)90064-7
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a novel approach to sinusoidal coding of speech which avoids the use of a voicing detector. The proposed model represents the speech signal as a sum of sinusoids and bandpass random signals and it is denoted hybrid harmonic model in this paper. The use of two different sets of basis functions increases the robustness of the model since there is no need to switch between techniques tailored to particular classes of sounds. Sinusoidal basis functions with harmonically related frequencies allow an accurate representation of the quasi-periodic structure of voiced speech but show difficulties in representing unvoiced sounds. On the other hand, the bandpass random functions are well suited for high quality representation of unvoiced speech sounds, since their bandwidth is larger than the bandwidth of sinusoids. The amplitudes of both sets of basis functions are simultaneously estimated by a least squares algorithm and the output speech signal is synthesized in the time domain by the superposition of all basis functions multiplied by their amplitudes. Experimental tests confirm an improved performance of the hybrid model for operation with noise-corrupted input speech, relative to classic sinusoidal models which exhibit a strong dependency on voicing decision. Finally, the implementation and test of a fully quantized hybrid coder at 4.8 kbit/s is described.
引用
收藏
页码:231 / 247
页数:17
相关论文
共 50 条
  • [1] Speech coding at very low bit-rates for mobile communication
    Gandhi, AG
    Dhekane, SS
    APCC 2003: 9TH ASIA-PACIFIC CONFERENCE ON COMMUNICATION, VOLS 1-3, PROCEEDINGS, 2003, : 358 - 362
  • [2] A Long Term Harmonic plus Noise Model for Narrow-Band Speech Coding at Very Low Bit-Rates
    Ben Ali, Faten
    Djaziri-Larbi, Sonia
    2017 40TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2017, : 372 - 376
  • [3] Combined harmonic and waveform coding of speech at low bit rates
    Shlomot, E
    Cuperman, V
    Gersho, A
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 585 - 588
  • [4] Fast video coding at low bit-rates for mobile devices
    Jindal, M
    Prasad, RSV
    Ramkishor, K
    ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 483 - 487
  • [5] CODING SPEECH AT LOW BIT RATES
    JAYANT, NS
    IEEE SPECTRUM, 1986, 23 (08) : 58 - 63
  • [6] PREDICTIVE CODING OF SPEECH AT LOW BIT RATES
    ATAL, BS
    IEEE TRANSACTIONS ON COMMUNICATIONS, 1982, 30 (04) : 600 - 614
  • [7] Speech coding at low and very low bit rates
    Baudoin, G
    Cernocky, J
    Gournay, P
    Chollet, G
    ANNALS OF TELECOMMUNICATIONS, 2000, 55 (9-10) : 462 - 482
  • [8] Speech coding at low and very low bit rates
    Baudoin, Geneviève
    Cernocky, Jan
    Gournay, Philippe
    Chollet, Gérard
    Annales des Telecommunications/Annals of Telecommunications, 2000, 55 (9-10): : 462 - 482
  • [9] Contourless region-based video coding for very low bit-rates
    Salgado, L
    Garcia, N
    Menendez, JM
    Rendon, E
    1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 1, 1998, : 299 - 303
  • [10] TRELLIS EXCITATION SPEECH CODING AT LOW BIT RATES
    KANG, SW
    FISCHER, TR
    IEEE TRANSACTIONS ON COMMUNICATIONS, 1994, 42 (2-4) : 1902 - 1910