400bps High-Quality Speech Coding Algorithm

被引:2
作者
Ma, Xiaofeng [1 ]
Li, Ye [1 ]
Jiang, Jingsai [1 ]
Zhang, Peng [1 ]
Fan, Yanhong [1 ]
Hao, Qiuyun [1 ]
机构
[1] Natl Supercomp Ctr Jinan, Shandong Prov Key Lab Comp Networks, Shandong Comp Sci Ctr, Jinan, Peoples R China
来源
2016 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C) | 2016年
关键词
speech coding; super-frame; vector quantization; MELP; mean opinion score;
D O I
10.1109/IS3C.2016.75
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Low bit rate speech coding is important to speech communications over band-limited or harsh channels. In this paper, based on the mixed excitation linear prediction (MELP) model, we propose a high-quality 400bps low bit rate speech coding algorithm which introduces multi-frame joint vector quantization, adaptive spectral enhancement and multi-band sinusoidal mixed excitation. Efficient parameter quantization schemes are employed on the basis of the super-frame structure. It is verified that the synthesized speech has fairly high intelligibility and naturalness, and the mean opinion score (MOS) is about 2.52.
引用
收藏
页码:256 / 259
页数:4
相关论文
共 14 条
  • [1] Atal B. S., 1982, Proceedings of ICASSP 82. IEEE International Conference on Acoustics, Speech and Signal Processing, P614
  • [2] Brady K., 2004, IEEE INT C AC SPEECH, V1, P1
  • [3] Duta CL, 2015, INT SYMP IMAGE SIG, P250, DOI 10.1109/ISPA.2015.7306067
  • [4] Hardwick J. C., 1988, ICASSP 88: 1988 International Conference on Acoustics, Speech, and Signal Processing (Cat. No.88CH2561-9), P374, DOI 10.1109/ICASSP.1988.196595
  • [5] Kleijn W. B., 1988, ICASSP 88: 1988 International Conference on Acoustics, Speech, and Signal Processing (Cat. No.88CH2561-9), P155, DOI 10.1109/ICASSP.1988.196536
  • [6] Kohler MA, 1997, INT CONF ACOUST SPEE, P1587, DOI 10.1109/ICASSP.1997.596256
  • [7] Li Q, 2012, 2012 5TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), P1580, DOI 10.1109/CISP.2012.6469908
  • [8] Li Y, 2015, AER ADV ENG RES, V21, P91
  • [9] McAulay R. J., 1988, MOB SAT C, V1, P503
  • [10] McCree A, 1996, INT CONF ACOUST SPEE, P200, DOI 10.1109/ICASSP.1996.540325