A FAST NEURAL-NET TRAINING ALGORITHM AND ITS APPLICATION TO SPEECH CLASSIFICATION

被引：1

作者：

GHISELLICRIPPA, T ^{[1
]}

ELJAROUDI, A ^{[1
]}

机构：

[1] UNIV PITTSBURGH,DEPT ELECT ENGN,PITTSBURGH,PA 15261

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 1993年 / 6卷 / 06期

关键词：

NEURAL NETWORKS; CLASSIFICATION; LEARNING ALGORITHMS;

D O I：

10.1016/0952-1976(93)90051-X

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper describes a fast training algorithm for feedforward neural nets, as applied to a two-layer neural network to classify segments of speech as voiced, unvoiced, or silence. The speech classification method is based on five features computed for each speech segment and used as input to the network. The network weights are trained using a new fast training algorithm which minimizes the total least squares error between the actual output of the network and the corresponding desired output. The iterative training algorithm uses a quasi-Newtonian error-minimization method and employs a positive-definite approximation of the Hessian matrix to quickly converge to a locally optimal set of weights. Convergence is fast, with a local minimum typically reached within ten iterations; in terms of convergence speed, the algorithm compares favorably with other training techniques. When used for voiced-unvoiced-silence classification of speech frames, the network performance compares favorably with current approaches. Moreover, the approach used has the advantage of requiring no assumption of a particular probability distribution for the input features.

引用

页码：549 / 557

页数：9

共 10 条

[1]

[Anonymous], 2016, LINEAR NONLINEAR PRO

[2] PATTERN-RECOGNITION APPROACH TO VOICED UNVOICED SILENCE CLASSIFICATION WITH APPLICATIONS TO SPEECH RECOGNITION [J].

ATAL, BS ;

RABINER, LR .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1976, 24 (03) :201-212

[3]

Cybenko G., 1989, Mathematics of Control, Signals, and Systems, V2, P303, DOI 10.1007/BF02551274

[4]

El-Jaroudi A., 1990, IJCNN International Joint Conference on Neural Networks (Cat. No.90CH2879-5), P185, DOI 10.1109/IJCNN.1990.137843

[5]

GHISELLICRIPPA T, 1991, THESIS U PITTSBURGH

[6]

GHISELLICRIPPA T, 1991, IEEE INT C ACOUST SP, P444

[7]

GISH H, 1990, INT CONF ACOUST SPEE, P1361, DOI 10.1109/ICASSP.1990.115636

[8]

Price P., 1988, ICASSP 88: 1988 International Conference on Acoustics, Speech, and Signal Processing (Cat. No.88CH2561-9), P651, DOI 10.1109/ICASSP.1988.196669

[9]

Rabiner L.R., 1977, IEEE INT C AC SPEECH, P323

[10]

Shepanski J. F., 1988, IEEE International Conference on Neural Networks (IEEE Cat. No.88CH2632-8), P465, DOI 10.1109/ICNN.1988.23880

← 1 →