A Fractal-based Approach for Speech Segmentation

被引:8
作者
Fantinato, Paulo Cesar [1 ]
Guido, Rodrigo Capobianco
Chen, Shi-Huang
Silveira Santos, Bruno Leonardo
Vieira, Lucimar Sasso
Barbon Junior, Sylvio
Rodrigues, Luciene Cavalcanti
Sanchez, Fabricio Lopes
Lemos Escola, Joao Paulo
Souza, Leonardo Mendes
Maciel, Carlos Dias
Scalassara, Paulo Rogerio
Pereira, Jose Carlos
机构
[1] Univ Sao Paulo, Inst Phys Sao Carlos, SpeechLab FFI IFSC USP, Ave Trabalhador Sao Carlense 400, BR-13566590 Sao Carlos, SP, Brazil
来源
ISM: 2008 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA | 2008年
基金
巴西圣保罗研究基金会;
关键词
D O I
10.1109/ISM.2008.123
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, fractal analysis has been successfully applied to digital speech processing, particularly for word and phoneme segmentation, which represents one of the fundamental steps in automatic speech recognition systems. The practical use of fractal analysis for this purpose should match two principles: low computational cost, to allow the use in real-time, and accuracy in the results, in order to produce a satisfactoly segmentation, sending the correct data to the classifier Aiming at meeting these two requirements, this work proposes a technique for speech segmentation based on the fractal dimension, which is obtained by using the discrete wavelet transform that avoids the use of 1/k pre-filtering. Many families of wavelets are presented and compared, and the results assure the efficacy of the proposed method.
引用
收藏
页码:551 / +
页数:2
相关论文
共 8 条
[1]  
Addison P.S., 2002, The illustrated wavelet transform handbook: Introductory theory and applications in science, engineering, medicine, and nance
[2]   The use of articulator motion information in automatic speech segmentation [J].
Akdemir, Eren ;
Ciloglu, Tolga .
SPEECH COMMUNICATION, 2008, 50 (07) :594-604
[3]  
Al-Akaidi M., 2004, Fractal speech processing
[4]  
BARBON S, 2007, J COMPUTATIONAL APPL
[5]  
Coleman H, 2005, Language and development: Africa and beyond
[6]  
Deng L., 2003, Speech processing: a dynamic and optimizationoriented approach
[7]   Chinese word segmentation as morpheme-based lexical chunking [J].
Fu, Guohong ;
Kit, Chunyu ;
Webster, Jonathan J. .
INFORMATION SCIENCES, 2008, 178 (09) :2282-2296
[8]   Acoustic speech unit segmentation for concatenative synthesis [J].
Torres, H. M. ;
Gurlekian, J. A. .
COMPUTER SPEECH AND LANGUAGE, 2008, 22 (02) :196-206