Discriminative Kernel-Based Phoneme Sequence Recognition

被引:0
|
作者
Keshet, Joseph [1 ]
Shalev-Shwartz, Shai [1 ]
Bengio, Samy [2 ]
Singer, Yoram [1 ,3 ]
Chazan, Dan [4 ]
机构
[1] Hebrew Univ Jerusalem, Sch Comp Sci & Engn, Jerusalem, Israel
[2] IDIAP Res Inst, Martigny, Switzerland
[3] Google Inc, Mountain View, CA USA
[4] Technion, Dept Elect Engn, Haifa, Israel
关键词
speech recognition; phoneme recognition; acoustic modeling; support vector machines;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a new method for phoneme sequence recognition given a speech utterance, which is not based on the HMM. In contrast to HMM-based approaches, our method uses a discriminative kernel-based training procedure in which the learning process is tailored to the goal of minimizing the Levenshtein distance between the predicted phoneme sequence and the correct sequence. The phoneme sequence predictor is devised by mapping the speech utterance along with a proposed phoneme sequence to a vector-space endowed with an inner-product that is realized by a Mercer kernel. Building on large margin techniques for predicting whole sequences, we are able to devise a learning algorithm which distills to separating the correct phoneme sequence from all other sequences. We describe an iterative algorithm for learning the phoneme sequence recognizer and further describe an efficient implementation of it. We present initial encouraging experimental results with the TIMIT and compare the proposed method to an HMM-based approach.
引用
收藏
页码:593 / +
页数:2
相关论文
共 50 条
  • [31] Kernel-based nonlinear discriminant analysis for face recognition
    QingShan Liu
    Rui Huang
    HanQing Lu
    SongDe Ma
    Journal of Computer Science and Technology, 2003, 18 : 788 - 795
  • [32] Kernel-based nonlinear subspace method for pattern recognition
    Miwa, Tomoko
    Kako, Jun-Ichi
    Yamamoto, Shinji
    Matsumoto, Mitsuomi
    Tateno, Yukio
    Iinuma, Takeshi
    Matsumoto, Toru
    Systems and Computers in Japan, 2002, 33 (01) : 38 - 52
  • [33] Kernel-based convolution expansion for facial expression recognition
    Mahmoudi, M. Amine
    Chetouani, Aladine
    Boufera, Fatma
    Tabia, Hedi
    PATTERN RECOGNITION LETTERS, 2022, 160 : 128 - 134
  • [34] A Kernel-based sparse representation method for face recognition
    Ningbo Zhu
    Shengtao Li
    Neural Computing and Applications, 2014, 24 : 845 - 852
  • [35] GMM and kernel-based speaker recognition with the ISIP toolkit
    Imbiriba, T
    Klautau, A
    Parihar, N
    Raghavan, S
    Picone, J
    MACHINE LEARNING FOR SIGNAL PROCESSING XIV, 2004, : 371 - 380
  • [36] Face recognition based on kernel discriminative common vectors
    Department of Radio Engineering, Southeast University, Nanjing 210096, China
    不详
    Dianzi Yu Xinxi Xuebao, 2006, 12 (2296-2300):
  • [37] Links Between the Sequence Kernel Association and the Kernel-Based Adaptive Cluster Tests
    Zhang W.
    Epstein M.P.
    Fingerlin T.E.
    Ghosh D.
    Statistics in Biosciences, 2017, 9 (1) : 246 - 258
  • [38] Learning Kernel in Kernel-Based LDA for Face Recognition Under Illumination Variations
    Liu, Xiao-Zhang
    Yuen, Pong C.
    Feng, Guo-Can
    Chen, Wen-Sheng
    IEEE SIGNAL PROCESSING LETTERS, 2009, 16 (12) : 1019 - 1022
  • [39] Kernel self-optimization learning for kernel-based feature extraction and recognition
    Li, Jun-Bao
    Wang, Yun-Heng
    Chu, Shu-Chuan
    Roddick, John F.
    INFORMATION SCIENCES, 2014, 257 : 70 - 80
  • [40] Kernel-Based Approaches for Sequence Modeling: Connections to Neural Methods
    Liang, Kevin J.
    Wang, Guoyin
    Li, Yitong
    Henao, Ricardo
    Carin, Lawrence
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32