Speech recognition using fractals

被引：6

作者：

Bohez, ELJ ^{[1
]}

Senevirathne, TR ^{[1
]}

机构：

[1] Asian Inst Technol, Dept Mfg Syst Engn, Klongluang 12120, Pathumthani, Thailand

来源：

PATTERN RECOGNITION | 2001年 / 34卷 / 11期

关键词：

speech recognition; fractals; iterated function systems (IFS); clustering;

D O I：

10.1016/S0031-3203(00)00137-0

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The use of fractal theory for speech recognition is investigated. First, the possibility of using iterated function systems for speech recognition is discussed. Next, the use of fractal dimension for phoneme recognition and word segmentation is presented. A phoneme recognition method is presented based on fractal theory. Fractal dimension (FD) and iterated function system (IFS) parameters are investigated for word segmentation. The IFS matrices and the eigenvalues of the covariance matrix are proposed for phoneme recognition. (C) 2001 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.

引用

页码：2227 / 2243

页数：17

共 50 条

[41] SPEECH RECOGNITION WITH NO SPEECH OR WITH NOISY SPEECH
Krishna, Gautam
Co Tran
Yu, Jianguo
Tewfik, Ahmed H.
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1090 - 1094
[42] Speech enhancement using PCA and variance of the reconstruction error in distributed speech recognition
Abolhassani, Amin Haji
Selouani, Sid-Ahmed
O'Shaughnessy, Douglas
2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 19 - +
[43] Speaker Adaptive Model for Hindi Speech using Kaldi Speech Recognition toolkit
Upadhyaya, Prashant
Mittal, Sanjeev Kumar
Varshney, Yash Vardhan
Farooq, Omar
Abidi, Musiur Raza
PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON MULTIMEDIA, SIGNAL PROCESSING AND COMMUNICATION TECHNOLOGIES (IMPACT), 2017, : 222 - 226
[44] SPEECH SUPPORT SYSTEM USING BODY-CONDUCTED SPEECH RECOGNITION FOR DISORDERS
Nakayama, Masashi
Ishimitsu, Shunsuke
INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2009, 5 (11B): : 4255 - 4265
[45] Temporal Speech Normalization Methods Comparison in Speech Recognition Using Neural Network
Salam, Md Sah Bin Hj
Mohamad, Dzulkifli
Salleh, Sheikh Hussain Shaikh
2009 INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION, 2009, : 442 - 447
[46] Robust Speech Recognition using Generalized Distillation Framework
Markov, Konstantin
Matsui, Tomoko
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2364 - 2368
[47] Investigation Amazigh speech recognition using CMU tools
Satori, Hassan
ElHaoussi, Fatima
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (03) : 235 - 243
[48] Depression Detection in Arabic Using Speech Language Recognition
Alsharif, Zainab
Elhag, Salma
Alfakeh, Sulhi
2022 7TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND MACHINE LEARNING APPLICATIONS (CDMA 2022), 2022, : 61 - 66
[49] Stressed speech recognition using a warped frequency scale
Gharavian, D.
Ahadi, S. M.
IEICE ELECTRONICS EXPRESS, 2008, 5 (06) : 187 - 191
[50] Speech recognition using randomized relational decision trees
Amit, Y
Murua, A
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (04): : 333 - 341

← 1 2 3 4 5 →