Time-Varying Feature Selection and Classification of Unvoiced Stop Consonants

被引：13

作者：

Nathan, Krishna S. ^{[1
]}

Silverman, Harvey F. ^{[1
]}

机构：

[1] Brown Univ, Div Engn, LEMS, Providence, RI 02912 USA

来源：

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1994年 / 2卷 / 03期

关键词：

D O I：

10.1109/89.294353

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

A feature set that captures the dynamics of formant transitions prior to closure in a VCV environment is used to characterize and classify the unvoiced stop consonants. The feature set is derived from a time-varying, data-selective model for the speech signal. Its performance is compared with that of comparable formant data from a standard delta-LPC-based madel. The different feature sets are evaluated on a database composed of eight talkers. A 40% reduction in classification error rate is obtained by means of the time-varying model. The performance of three different classifiers is discussed. A novel adaptive algorithm, termed learning vector classifier (LVC) is compared with standard K-means and LVQ2 classifiers. LVC is a supervised learning classifier that improves performance by increasing the resolution of the decision boundaries. Error rates obtained for the three-way (p, t, and k) classification task using LVC and the time-varying analysis are comparable to that of techniques that make use of additional discriminating information contained in the burst. Further improvements are expected when an expanded time-varying feature set is utilized, coupled with information from the burst.

引用

页码：395 / 405

页数：11

共 24 条

[1] BLUMSTEIN S, 1979, J ACOUST SOC AM, V66
[2] Conover W.J., 1999, PRACTICAL NONPARAMET, P428, DOI DOI 10.1002/BIMJ.19730150311
[3] CRANEN B, 1983, SIGNAL PROCESS, V2, P343
[4] ACOUSTIC LOCI AND TRANSITIONAL CUES FOR CONSONANTS
DELATTRE, PC
LIBERMAN, AM
COOPER, FS
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1955, 27 (04) : 769 - 773
[5] COMPUTER RECOGNITION OF PLOSIVE SOUNDS USING CONTEXTUAL INFORMATION
DEMICHELIS, P
DEMORI, R
LAFACE, P
OKANE, M
[J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1983, 31 (02): : 359 - 377
[6] Duda R. O., 1973, PATTERN CLASSIFICATI, V3
[7] DISTANCE MEASURES FOR SPEECH PROCESSING
GRAY, AH
MARKEL, JD
[J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1976, 24 (05): : 380 - 391
[8] MINIMUM PREDICTION RESIDUAL PRINCIPLE APPLIED TO SPEECH RECOGNITION
ITAKURA, F
[J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1975, AS23 (01): : 67 - 72
[9] THE POLES AND ZEROS OF A LINEAR TIME-VARYING SYSTEM
KAMEN, EW
[J]. LINEAR ALGEBRA AND ITS APPLICATIONS, 1988, 98 : 263 - 289
[10] KOHONEN T, 1988, P 1988 IEEE INT C NE, pI61

← 1 2 3 →