Tree-structured vector quantization for speech recognition

被引:6
|
作者
Barszcz, M [1 ]
Chen, W [1 ]
Boulianne, G [1 ]
Kenny, P [1 ]
机构
[1] INRS Telecommun, Ile Des Soeurs, PQ H3E 1H6, Canada
来源
COMPUTER SPEECH AND LANGUAGE | 2000年 / 14卷 / 03期
关键词
Acoustic signal processing - Markov processes - Mathematical models - Speech analysis - Trees (mathematics) - Vector quantization;
D O I
10.1006/csla.2000.0143
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe some new methods for constructing discrete acoustic phonetic hidden Markov models (HMMs) using tree quantizers having very large numbers (16-64 K) of leaf nodes and tree-structured smoothing techniques. We consider two criteria for constructing tree quantizers (minimum distortion and minimum entropy) and three types of smoothing (mixture smoothing, smoothing by adding 1 and Gaussian smoothing). We show that these methods are capable of achieving recognition accuracies which are generally comparable to those obtained with Gaussian mixture HMMs at a computational cost which is only marginally greater than that of conventional discrete HMMs. We present some evidence of superior performance in situations where the number of HMM distributions to be estimated is small compared with the amount of training data. We also show how our methods can accommodate feature vectors of much higher dimensionality than are traditionally used in speech recognition. (C) 2000 Academic Press.
引用
收藏
页码:227 / 239
页数:13
相关论文
共 50 条
  • [1] OPTIMAL PRUNING FOR TREE-STRUCTURED VECTOR QUANTIZATION
    LIN, JH
    STORER, JA
    COHN, M
    INFORMATION PROCESSING & MANAGEMENT, 1992, 28 (06) : 723 - 733
  • [2] Tree-Structured Vector Quantization for Similarity Queries
    Wu, Hanwei
    Wang, Qiwen
    Flierl, Markus
    2017 DATA COMPRESSION CONFERENCE (DCC), 2017, : 467 - 467
  • [3] Fine granularity scalable speech coding using embedded tree-structured vector quantization
    Djamah, Mouloud
    O'Shaughnessy, Douglas
    SPEECH COMMUNICATION, 2012, 54 (01) : 23 - 39
  • [4] Hierarchical summarization of videos by tree-structured vector quantization
    Benini, S.
    Bianchetti, A.
    Leonardi, R.
    Migliorati, R.
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 969 - +
  • [5] Variable-branch tree-structured vector quantization
    Yang, SB
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2004, 13 (09) : 1275 - 1285
  • [6] Fast encoding algorithms for tree-structured vector quantization
    Lai, JZC
    IMAGE AND VISION COMPUTING, 1997, 15 (11) : 867 - 871
  • [7] Tree-structured product-codebook vector quantization
    Poggi, G
    Ragozini, ARP
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2001, 16 (05) : 421 - 430
  • [8] Image classification using tree-structured discriminant vector quantization
    Ozonat, KM
    CONFERENCE RECORD OF THE THIRTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 2003, : 1610 - 1614
  • [9] VARIABLE-BRANCH TREE-STRUCTURED RESIDUAL VECTOR QUANTIZATION
    Yang, Shiueng-Bien
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2008, 8 (01) : 61 - 80
  • [10] Fast texture synthesis using tree-structured vector quantization
    Wei, LY
    Levoy, M
    SIGGRAPH 2000 CONFERENCE PROCEEDINGS, 2000, : 479 - 488