A Fractal-based Approach for Speech Segmentation

被引：8

作者：

Fantinato, Paulo Cesar ^{[1
]}

Guido, Rodrigo Capobianco

Chen, Shi-Huang

Silveira Santos, Bruno Leonardo

Vieira, Lucimar Sasso

Barbon Junior, Sylvio

Rodrigues, Luciene Cavalcanti

Sanchez, Fabricio Lopes

Lemos Escola, Joao Paulo

Souza, Leonardo Mendes

Maciel, Carlos Dias

Scalassara, Paulo Rogerio

Pereira, Jose Carlos

机构：

[1] Univ Sao Paulo, Inst Phys Sao Carlos, SpeechLab FFI IFSC USP, Ave Trabalhador Sao Carlense 400, BR-13566590 Sao Carlos, SP, Brazil

来源：

ISM: 2008 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA | 2008年

基金：

巴西圣保罗研究基金会;

关键词：

D O I：

10.1109/ISM.2008.123

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Nowadays, fractal analysis has been successfully applied to digital speech processing, particularly for word and phoneme segmentation, which represents one of the fundamental steps in automatic speech recognition systems. The practical use of fractal analysis for this purpose should match two principles: low computational cost, to allow the use in real-time, and accuracy in the results, in order to produce a satisfactoly segmentation, sending the correct data to the classifier Aiming at meeting these two requirements, this work proposes a technique for speech segmentation based on the fractal dimension, which is obtained by using the discrete wavelet transform that avoids the use of 1/k pre-filtering. Many families of wavelets are presented and compared, and the results assure the efficacy of the proposed method.

引用

页码：551 / +

页数：2

共 8 条

[1]

Addison P.S., 2002, The illustrated wavelet transform handbook: Introductory theory and applications in science, engineering, medicine, and nance

[2] The use of articulator motion information in automatic speech segmentation [J].

Akdemir, Eren ;

Ciloglu, Tolga .

SPEECH COMMUNICATION, 2008, 50 (07) :594-604

[3]

Al-Akaidi M., 2004, Fractal speech processing

[4]

BARBON S, 2007, J COMPUTATIONAL APPL

[5]

Coleman H, 2005, Language and development: Africa and beyond

[6]

Deng L., 2003, Speech processing: a dynamic and optimizationoriented approach

[7] Chinese word segmentation as morpheme-based lexical chunking [J].

Fu, Guohong ;

Kit, Chunyu ;

Webster, Jonathan J. .

INFORMATION SCIENCES, 2008, 178 (09) :2282-2296

[8] Acoustic speech unit segmentation for concatenative synthesis [J].

Torres, H. M. ;

Gurlekian, J. A. .

COMPUTER SPEECH AND LANGUAGE, 2008, 22 (02) :196-206

← 1 →