Syllable-based Pitch Encoding for Low Bit Rate Speech Coding with Recognition/Synthesis Architecture

被引:0
|
作者
Cernak, Milos [1 ]
Na, Xingyu [1 ,2 ]
Garner, Philip N. [1 ]
机构
[1] Idiap Reseach Inst, Martigny, Switzerland
[2] Beijing Inst Technol, Beijing, Peoples R China
关键词
speech coding; pitch analysis; speech synthesis; QUANTIZATION; FEATURES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current HMM-based low bit rate speech coding systems work with phonetic vocoders. Pitch contour coding (on frame or phoneme level) is usually fairly orthogonal to other speech coding parameters. We make an assumption in our work that the speech signal contains supra-segmental cues. Hence, we present encoding of the pitch on the syllable level, used in the framework of a recognition/synthesis speech coder with phonetic vocoder. The results imply that high accuracy pitch contour reconstruction with negligible speech quality degradation is possible. The proposed pitch encoding technique operates on 30-35 bits per second.
引用
收藏
页码:3416 / 3419
页数:4
相关论文
共 50 条
  • [1] Syllable-based automatic Arabic speech recognition
    Azmi, Mohamed Mostafa
    Tolba, Hesham
    Mahdy, Sherif
    Fashal, Mervat
    PROCEEDINGS OF THE 7TH WSEAS INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, ROBOTICS AND AUTOMATION: ADVANCED TOPICS ON SIGNAL PROCESSING, ROBOTICS AND AUTOMATION, 2008, : 246 - +
  • [2] Syllable-Based Speech Recognition Using EMG
    Lopez-Larraz, Eduardo
    Mozos, Oscar M.
    Antelis, Javier M.
    Minguez, Javier
    2010 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2010, : 4699 - 4702
  • [3] Syllable-based Myanmar Language Model for Speech Recognition
    Soe, Wunna
    Thein, Yadana
    2015 IEEE/ACIS 14TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2015, : 291 - 296
  • [4] Pitch quantization in low bit-rate speech coding
    Eriksson, T
    Kang, HG
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 489 - 492
  • [5] Syllable-based large vocabulary continuous speech recognition
    Ganapathiraju, A
    Hamaker, J
    Picone, J
    Ordowski, M
    Doddington, GR
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (04): : 358 - 366
  • [6] Research on Syllable-Based Language Model in Malay Speech Recognition
    Wei, Xiangfeng
    Zhang, Quan
    Yuan, Yi
    2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 150 - 155
  • [7] Syllable-Based Concatenative Speech Synthesis for Marathi Language
    Ghate, Pravin M.
    Shirbahadurkar, Suresh D.
    INFORMATION AND COMMUNICATION TECHNOLOGY FOR COMPETITIVE STRATEGIES, 2019, 40 : 615 - 624
  • [8] A syllable-based pseudo-articulatory approach to speech recognition
    Zhang, L
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 78 - 83
  • [9] Syllable-based automatic Arabic speech recognition in noisy enviroment
    Azmi, Mohamed M.
    Tolba, Hesham
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1436 - 1441
  • [10] SYLLABLE-BASED SPEECH RECOGNITION USING ELECTROMYOGRAPHY AND DECISION SET CLASSIFIER
    Topalovic, Marko
    Damnjanovic, Dorde
    Peulic, Aleksandar
    Blagojevic, Milan
    Filipovic, Nenad
    BIOMEDICAL ENGINEERING-APPLICATIONS BASIS COMMUNICATIONS, 2015, 27 (02):