Discriminative Kernel-Based Phoneme Sequence Recognition

被引：0

作者：

Keshet, Joseph ^{[1
]}

Shalev-Shwartz, Shai ^{[1
]}

Bengio, Samy ^{[2
]}

Singer, Yoram ^{[1
,3
]}

Chazan, Dan ^{[4
]}

机构：

[1] Hebrew Univ Jerusalem, Sch Comp Sci & Engn, Jerusalem, Israel

[2] IDIAP Res Inst, Martigny, Switzerland

[3] Google Inc, Mountain View, CA USA

[4] Technion, Dept Elect Engn, Haifa, Israel

来源：

INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年

关键词：

speech recognition; phoneme recognition; acoustic modeling; support vector machines;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We describe a new method for phoneme sequence recognition given a speech utterance, which is not based on the HMM. In contrast to HMM-based approaches, our method uses a discriminative kernel-based training procedure in which the learning process is tailored to the goal of minimizing the Levenshtein distance between the predicted phoneme sequence and the correct sequence. The phoneme sequence predictor is devised by mapping the speech utterance along with a proposed phoneme sequence to a vector-space endowed with an inner-product that is realized by a Mercer kernel. Building on large margin techniques for predicting whole sequences, we are able to devise a learning algorithm which distills to separating the correct phoneme sequence from all other sequences. We describe an iterative algorithm for learning the phoneme sequence recognizer and further describe an efficient implementation of it. We present initial encouraging experimental results with the TIMIT and compare the proposed method to an HMM-based approach.

引用

页码：593 / +

页数：2

共 50 条

[31] Kernel-based nonlinear discriminant analysis for face recognition
QingShan Liu
Rui Huang
HanQing Lu
SongDe Ma
Journal of Computer Science and Technology, 2003, 18 : 788 - 795
[32] Kernel-based nonlinear subspace method for pattern recognition
Miwa, Tomoko
Kako, Jun-Ichi
Yamamoto, Shinji
Matsumoto, Mitsuomi
Tateno, Yukio
Iinuma, Takeshi
Matsumoto, Toru
Systems and Computers in Japan, 2002, 33 (01) : 38 - 52
[33] Kernel-based convolution expansion for facial expression recognition
Mahmoudi, M. Amine
Chetouani, Aladine
Boufera, Fatma
Tabia, Hedi
PATTERN RECOGNITION LETTERS, 2022, 160 : 128 - 134
[34] A Kernel-based sparse representation method for face recognition
Ningbo Zhu
Shengtao Li
Neural Computing and Applications, 2014, 24 : 845 - 852
[35] GMM and kernel-based speaker recognition with the ISIP toolkit
Imbiriba, T
Klautau, A
Parihar, N
Raghavan, S
Picone, J
MACHINE LEARNING FOR SIGNAL PROCESSING XIV, 2004, : 371 - 380
[36] Face recognition based on kernel discriminative common vectors
Department of Radio Engineering, Southeast University, Nanjing 210096, China
不详
Dianzi Yu Xinxi Xuebao, 2006, 12 (2296-2300):
[37] Links Between the Sequence Kernel Association and the Kernel-Based Adaptive Cluster Tests
Zhang W.
Epstein M.P.
Fingerlin T.E.
Ghosh D.
Statistics in Biosciences, 2017, 9 (1) : 246 - 258
[38] Learning Kernel in Kernel-Based LDA for Face Recognition Under Illumination Variations
Liu, Xiao-Zhang
Yuen, Pong C.
Feng, Guo-Can
Chen, Wen-Sheng
IEEE SIGNAL PROCESSING LETTERS, 2009, 16 (12) : 1019 - 1022
[39] Kernel self-optimization learning for kernel-based feature extraction and recognition
Li, Jun-Bao
Wang, Yun-Heng
Chu, Shu-Chuan
Roddick, John F.
INFORMATION SCIENCES, 2014, 257 : 70 - 80
[40] Kernel-Based Approaches for Sequence Modeling: Connections to Neural Methods
Liang, Kevin J.
Wang, Guoyin
Li, Yitong
Henao, Ricardo
Carin, Lawrence
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32

← 1 2 3 4 5 →