The fractal properties of vocal sounds and their application in the speech recognition model

被引:27
|
作者
Sabanal, S
Nakagawa, M
机构
[1] Department of Electrical Engineering, Faculty of Engineering, Nagaoka University of Technology, Nagaoka, Niigata 940-21
关键词
D O I
10.1016/S0960-0779(96)00043-4
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
In this work, we shall examine the fractal properties of simple vocal sounds such as Japanese vowels by evaluating the fractal dimension related to the self-affine property. We shall also examine the existence of chaos in the attractors reconstructed from the vocal sound waveforms by evaluating the Lyapunov exponents. The reconstructed attractors are also examined for multifractal properties. To characterize the fractal properties of complicated vocal sounds, such as speech utterances composed of several vowels, phonemes, etc., we shall propose the time-dependent fractal dimensions (TDFDs), where the fractal dimensions are evaluated based on the self-affine:property, and the time-dependent multifractal dimensions (TDMFDs). We shall then use these fractal properties in a speech recognition model to examine if our method is able to characterize complicated vocal sounds effectively. For comparison, we shall utilize the running power spectrum (RPS) as a recognition parameter. It was found that utilizing the fractal properties of vocal sounds as recognition parameters gives a high recognition rate, showing that complicated vocal sounds can be effectively characterized by their fractal properties. Copyright (C) 1996 Elsevier Science Ltd.
引用
收藏
页码:1825 / 1843
页数:19
相关论文
共 50 条
  • [22] Effects of nonlinear frequency compression on the acoustic properties and recognition of speech sounds in Mandarin Chinese
    Yang, Jing
    Qian, Jinyu
    Chen, Xueqing
    Kuehnel, Volker
    Rehmann, Julia
    von Buol, Andreas
    Li, Yulin
    Ren, Cuncun
    Liu, Bo
    Xu, Li
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2018, 143 (03): : 1578 - 1590
  • [23] Setting the Stage for Speech Production: Infants Prefer Listening to Speech Sounds With Infant Vocal Resonances
    Polka, Linda
    Masapollo, Matthew
    Menard, Lucie
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2022, 65 (01): : 109 - 120
  • [24] The application of the additive model in the feature extraction of speech recognition
    Xi, WB
    Fang, L
    ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 753 - 756
  • [25] Environmental sounds recognition system using the speech recognition system techniques
    Uribe, OA
    Meana, HMP
    Miyatake, MN
    2005 2nd International Conference on Electrical & Electronics Engineering (ICEEE), 2005, : 13 - 16
  • [26] Contemporary Speech/Speaker Recognition with Speech from Impaired Vocal Apparatus
    Nidhyananthan, S. Selva
    Selvakumari, R. Shantha
    Shenbagalakshmi, V.
    2014 INTERNATIONAL CONFERENCE ON COMMUNICATION AND NETWORK TECHNOLOGIES (ICCNT), 2014, : 198 - 202
  • [27] Detecting breathing sounds in realistic Japanese telephone conversations and its application to automatic speech recognition
    Fukuda, Takashi
    Ichikawa, Osamu
    Nishimura, Masafumi
    SPEECH COMMUNICATION, 2018, 98 : 95 - 103
  • [28] Sounds and speech: Individual differences in unfamiliar voice recognition
    Sunilkumar, Dolly
    Kelly, Steve W. W.
    Stevenage, Sarah V. V.
    Rankine, Dillon
    Robertson, David J. J.
    APPLIED COGNITIVE PSYCHOLOGY, 2023, 37 (03) : 507 - 519
  • [29] Development of speech recognition system for remote vocal music teaching based on Markov model
    Fumei Xu
    Yu Xia
    Soft Computing, 2023, 27 : 10237 - 10248
  • [30] The effect of prior visual information on recognition of speech and sounds
    Noppeney, Uta
    Josephs, Oliver
    Hocking, Julia
    Price, Cathy J.
    Friston, Karl J.
    CEREBRAL CORTEX, 2008, 18 (03) : 598 - 609