A very low bit rate speech coder using HMM-based speech recognition synthesis techniques

被引：0

作者：

Tokuda, K ^{[1
]}

Masuko, T ^{[1
]}

Hiroi, J ^{[1
]}

Kobayashi, T ^{[1
]}

Kitamura, T ^{[1
]}

机构：

[1] Nagoya Inst Technol, Dept Comp Sci, Nagoya, Aichi 466, Japan

来源：

PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6 | 1998年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper presents a very low bit rate speech coder based on HMM (Hidden Markov Model). The encoder carries out phoneme recognition, and transmits phoneme indexes, state durations and pitch information to the decoder. In the decoder, phoneme HMMs are concatenated according to the phoneme indexes, and a sequence of mel-cepstral coefficient vectors is generated from the concatenated HMM by using an ML-based speech parameter generation technique. Finally we obtain synthetic speech by exciting the MLSA (Mel Log Spectrum Approximation) filter, whose coefficients are given by mel-cepstral coefficients, according to the pitch information. A subjective listening test shows that the performance of the proposed coder at about 150 bit/s (for the test data including 26% silence region) is comparable to a VQ-based vocoder at 400 bit/s (= 8 bit/frame x 50 frame/s) without pitch quantization for both coders.

引用

页码：609 / 612

页数：4

共 50 条

[1] An HMM-based speaker adaptable very low bit rate speech coder
Peng, H
Zhu, J
CHINESE JOURNAL OF ELECTRONICS, 2000, 9 (02): : 135 - 139
[2] A very low bit rate speech coder based on a recognition/synthesis paradigm
Lee, KS
Cox, RV
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (05): : 482 - 491
[3] A SPEAKER ADAPTABLE VERY LOW BIT RATE SPEECH CODER BASED ON HMM
彭煳
朱杰
Journal of Shanghai Jiaotong University, 2000, (02) : 1 - 5
[4] Improving the performance of HMM-based very low bit rate speech coding
Hoshiya, T
Sako, S
Zen, H
Tokuda, K
Masuko, T
Kobayashi, T
Kitamura, T
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 800 - 803
[5] TTS based very low bit rate speech coder
Lee, Ki-Seung
Cox, Richard V.
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 181 - 184
[6] TTS based very low bit rate speech coder
Lee, KS
Cox, RV
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 181 - 184
[7] An HMM-based speech recognition IC
Han, W
Hon, KW
Chan, CF
Lee, T
Choy, CS
Pun, KP
Ching, PC
PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL II: COMMUNICATIONS-MULTIMEDIA SYSTEMS & APPLICATIONS, 2003, : 744 - 747
[8] Very low bit rate speech coding based on HMM with speaker adaptation
Masuko, Takashi
Kobayashi, Takao
Tokuda, Keiichi
Systems and Computers in Japan, 2006, 37 (02): : 67 - 78
[9] HMM-Based Speech Recognition Using Adaptive Framing
Goh, Yeh-Huann
Raveendran, Paramesran
TENCON 2009 - 2009 IEEE REGION 10 CONFERENCE, VOLS 1-4, 2009, : 226 - 230
[10] Prediction method of speech recognition performance based on HMM-based speech synthesis technique
Terashima R.
Yoshimura T.
Wakita T.
Tokuda K.
Kitamura T.
IEEJ Transactions on Electronics, Information and Systems, 2010, 130 (04) : 557 - 564+3

← 1 2 3 4 5 →