Dysphonic voice classification using wavelet packet transform and artificial neural network

被引:6
作者
Schuck, A [1 ]
Guimaraes, LV [1 ]
Wisbeck, JO [1 ]
机构
[1] Univ Fed Rio Grande Sul, Dept Elect Engn, Porto Alegre, RS, Brazil
来源
PROCEEDINGS OF THE 25TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-4: A NEW BEGINNING FOR HUMAN HEALTH | 2003年 / 25卷
关键词
Artificial Neural Networks; dysphonic voice classification; wavelet packet transform;
D O I
10.1109/IEMBS.2003.1280539
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
In [Schuck Jr. and Parraga, 2002] work, it was demonstrated the viability of the wavelet packet transform (WP) and the best basis algorithm (BBA) as a feature extractor (FE) for a dysphonic voice classification systems. It was shown the better choices of wavelet and cost functions were Symlet 5 and Shannon Entropy. Also, a linear discriminator between normal and dysphonic voices was performed. This work present the use an Artificial Neural Network (ANN) in addition to WP and BBA to perform a non-linear discriminator. The WP with 5 dilatation levels and BBA of the sustained vowel /a/ of 13 normal and 51 dysphonic previously diagnosed subjects were performed. Then the entropy values of each best tree's nodes were used for the classification. A ANN was designed with 3 layers ( 4 neurones in the hidden layer and 2 in the last layer). The non-linear function was hyperbolic tangent. The ANN was trained using backpropagation with a group of 6 normal and 21 dysphonic subjects chose at random from the database. Then, all the 61 subjects were classified. The system had a success rate 84.3%, with 4.6% of false negatives and 10.9% false positives.
引用
收藏
页码:2958 / 2961
页数:4
相关论文
共 29 条
[1]  
ALDROUBI A, 1996, WAVELETS MED BIOL
[2]  
[Anonymous], 1994, VOICE VOICE THERAPY
[3]  
[Anonymous], 1998, Time Frequency and Wavelets in Biomedical Signal Processing
[4]   Underwater target classification using wavelet packets and neural networks [J].
Azimi-Sadjadi, MR ;
Yao, D ;
Huang, Q ;
Dobeck, GJ .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2000, 11 (03) :784-794
[5]  
Baken R.J., 1987, CLIN MEASUREMENT SPE
[6]  
BALMELLI L, 1999, P IEEE INT C IM PROC, V4, P440
[7]  
COIFMAN RR, 1992, IEEE T INFORM THEORY, V38, P1713
[8]  
Colton R.H., 1996, UNDERSTANDING VOICE, V2nd
[9]  
Combes J-M., 1989, WAVELETS, P38, DOI [10.1007/978-3-642-97177-8_3, DOI 10.1007/978-3-642-97177-8_3]
[10]   ORTHONORMAL BASES OF COMPACTLY SUPPORTED WAVELETS [J].
DAUBECHIES, I .
COMMUNICATIONS ON PURE AND APPLIED MATHEMATICS, 1988, 41 (07) :909-996