Speech recognition using fractals

被引:6
|
作者
Bohez, ELJ [1 ]
Senevirathne, TR [1 ]
机构
[1] Asian Inst Technol, Dept Mfg Syst Engn, Klongluang 12120, Pathumthani, Thailand
关键词
speech recognition; fractals; iterated function systems (IFS); clustering;
D O I
10.1016/S0031-3203(00)00137-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The use of fractal theory for speech recognition is investigated. First, the possibility of using iterated function systems for speech recognition is discussed. Next, the use of fractal dimension for phoneme recognition and word segmentation is presented. A phoneme recognition method is presented based on fractal theory. Fractal dimension (FD) and iterated function system (IFS) parameters are investigated for word segmentation. The IFS matrices and the eigenvalues of the covariance matrix are proposed for phoneme recognition. (C) 2001 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:2227 / 2243
页数:17
相关论文
共 50 条
  • [41] SPEECH RECOGNITION WITH NO SPEECH OR WITH NOISY SPEECH
    Krishna, Gautam
    Co Tran
    Yu, Jianguo
    Tewfik, Ahmed H.
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1090 - 1094
  • [42] Speech enhancement using PCA and variance of the reconstruction error in distributed speech recognition
    Abolhassani, Amin Haji
    Selouani, Sid-Ahmed
    O'Shaughnessy, Douglas
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 19 - +
  • [43] Speaker Adaptive Model for Hindi Speech using Kaldi Speech Recognition toolkit
    Upadhyaya, Prashant
    Mittal, Sanjeev Kumar
    Varshney, Yash Vardhan
    Farooq, Omar
    Abidi, Musiur Raza
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON MULTIMEDIA, SIGNAL PROCESSING AND COMMUNICATION TECHNOLOGIES (IMPACT), 2017, : 222 - 226
  • [44] SPEECH SUPPORT SYSTEM USING BODY-CONDUCTED SPEECH RECOGNITION FOR DISORDERS
    Nakayama, Masashi
    Ishimitsu, Shunsuke
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2009, 5 (11B): : 4255 - 4265
  • [45] Temporal Speech Normalization Methods Comparison in Speech Recognition Using Neural Network
    Salam, Md Sah Bin Hj
    Mohamad, Dzulkifli
    Salleh, Sheikh Hussain Shaikh
    2009 INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION, 2009, : 442 - 447
  • [46] Robust Speech Recognition using Generalized Distillation Framework
    Markov, Konstantin
    Matsui, Tomoko
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2364 - 2368
  • [47] Investigation Amazigh speech recognition using CMU tools
    Satori, Hassan
    ElHaoussi, Fatima
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (03) : 235 - 243
  • [48] Depression Detection in Arabic Using Speech Language Recognition
    Alsharif, Zainab
    Elhag, Salma
    Alfakeh, Sulhi
    2022 7TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND MACHINE LEARNING APPLICATIONS (CDMA 2022), 2022, : 61 - 66
  • [49] Stressed speech recognition using a warped frequency scale
    Gharavian, D.
    Ahadi, S. M.
    IEICE ELECTRONICS EXPRESS, 2008, 5 (06) : 187 - 191
  • [50] Speech recognition using randomized relational decision trees
    Amit, Y
    Murua, A
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (04): : 333 - 341