Experimental Study on Noise Pre-Processing for a Low Bit Rate Speech Coder

被引:0
|
作者
Shi, Wenhua [1 ,2 ]
Zhang, Xiongwei [1 ]
Zou, Xia [1 ]
Song, Xiaodong [2 ]
机构
[1] PLA Univ Sci & Technol, Lab Intelligent Infounat Proc, Nanjing, Jiangsu, Peoples R China
[2] Air Force Aviat Univ, Flight Instructor Training Base, Bengbu, Peoples R China
关键词
parametric speech coding; MELP; SMV; NPP; speech enhancement; PARAMETERS; DATABASE;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper focuses on the quality of speech coding parameters extraction under noisy and clean conditions. The influence of speech enhancement on the quality of extracted parameters for a low bit rate speech coder is addressed. MELP vocoder is used to estimate three parameters: the fundamental frequency, voicing and linear prediction coefficients. De-noising methods in MELPe vocoder and SMV are adopted as preprocessor under different noise environment separately. Pitch accuracy rate, voicing decision error rate and average spectral distortion are employed to quantitatively evaluate the quality and intelligibility improvements for the degraded speech with and without noise pre-processing system. The experimental results show that noise pre-processing can provide improvement in parameter estimation especially in low SNR. MELPe speech enhancement algorithm has better parameter extraction performance than SMV. The research will be helpful in designing specific noise pre-processing algorithm for low bit rate parametric coding.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] AN EXPERIMENTAL STUDY OF NOISE ON THE PERFORMANCE OF A LOW BIT RATE PARAMETRIC SPEECH CODER
    Shi, Wenhua
    Zhang, Xiongwei
    Zou, Xia
    Han, Wei
    2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
  • [2] Low bit-rate CELP speech coder with low delay
    Hayashi, S
    Suguimoto, M
    Erinnoviar
    SIGNAL PROCESSING, 1999, 72 (02) : 97 - 105
  • [3] TTS based very low bit rate speech coder
    Lee, Ki-Seung
    Cox, Richard V.
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 181 - 184
  • [4] TTS based very low bit rate speech coder
    Lee, KS
    Cox, RV
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 181 - 184
  • [5] HCELP: Low bit rate speech coder for voice storage applications
    Bouraoui, M
    Druilhe, FB
    Feng, G
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 739 - 742
  • [6] Pre-processing of the speech data
    不详
    ROBUST ADAPTATION TO NON-NATIVE ACCENTS IN AUTOMATIC SPEECH RECOGNITION, 2002, 2560 : 15 - 19
  • [7] A very low bit rate speech coder based on a recognition/synthesis paradigm
    Lee, KS
    Cox, RV
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (05): : 482 - 491
  • [8] A SPEAKER ADAPTABLE VERY LOW BIT RATE SPEECH CODER BASED ON HMM
    彭煳
    朱杰
    Journal of Shanghai Jiaotong University, 2000, (02) : 1 - 5
  • [9] Diphone-like units for very low bit rate speech coder
    Motlicek, P
    Cernocky, J
    Baudoin, G
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 4023 - 4023
  • [10] Selection of glottal excitation for low bit rate speech coder with speaker recognizability
    Wu, CH
    IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 617 - 620