Time-Varying Feature Selection and Classification of Unvoiced Stop Consonants

被引:13
作者
Nathan, Krishna S. [1 ]
Silverman, Harvey F. [1 ]
机构
[1] Brown Univ, Div Engn, LEMS, Providence, RI 02912 USA
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1994年 / 2卷 / 03期
关键词
D O I
10.1109/89.294353
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A feature set that captures the dynamics of formant transitions prior to closure in a VCV environment is used to characterize and classify the unvoiced stop consonants. The feature set is derived from a time-varying, data-selective model for the speech signal. Its performance is compared with that of comparable formant data from a standard delta-LPC-based madel. The different feature sets are evaluated on a database composed of eight talkers. A 40% reduction in classification error rate is obtained by means of the time-varying model. The performance of three different classifiers is discussed. A novel adaptive algorithm, termed learning vector classifier (LVC) is compared with standard K-means and LVQ2 classifiers. LVC is a supervised learning classifier that improves performance by increasing the resolution of the decision boundaries. Error rates obtained for the three-way (p, t, and k) classification task using LVC and the time-varying analysis are comparable to that of techniques that make use of additional discriminating information contained in the burst. Further improvements are expected when an expanded time-varying feature set is utilized, coupled with information from the burst.
引用
收藏
页码:395 / 405
页数:11
相关论文
共 24 条
  • [1] BLUMSTEIN S, 1979, J ACOUST SOC AM, V66
  • [2] Conover W.J., 1999, PRACTICAL NONPARAMET, P428, DOI DOI 10.1002/BIMJ.19730150311
  • [3] CRANEN B, 1983, SIGNAL PROCESS, V2, P343
  • [4] ACOUSTIC LOCI AND TRANSITIONAL CUES FOR CONSONANTS
    DELATTRE, PC
    LIBERMAN, AM
    COOPER, FS
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1955, 27 (04) : 769 - 773
  • [5] COMPUTER RECOGNITION OF PLOSIVE SOUNDS USING CONTEXTUAL INFORMATION
    DEMICHELIS, P
    DEMORI, R
    LAFACE, P
    OKANE, M
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1983, 31 (02): : 359 - 377
  • [6] Duda R. O., 1973, PATTERN CLASSIFICATI, V3
  • [7] DISTANCE MEASURES FOR SPEECH PROCESSING
    GRAY, AH
    MARKEL, JD
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1976, 24 (05): : 380 - 391
  • [8] MINIMUM PREDICTION RESIDUAL PRINCIPLE APPLIED TO SPEECH RECOGNITION
    ITAKURA, F
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1975, AS23 (01): : 67 - 72
  • [9] THE POLES AND ZEROS OF A LINEAR TIME-VARYING SYSTEM
    KAMEN, EW
    [J]. LINEAR ALGEBRA AND ITS APPLICATIONS, 1988, 98 : 263 - 289
  • [10] KOHONEN T, 1988, P 1988 IEEE INT C NE, pI61