Dimensionality reduction of electropalatographic data using latent variable models

被引:8
作者
Carreira-Perpiñán, MA [1 ]
Renals, S [1 ]
机构
[1] Univ Sheffield, Dept Comp Sci, Sheffield S1 4DP, S Yorkshire, England
关键词
electropalatography; articulatory modelling; data reduction methods; dimensionality reduction; latent variable models; finite mixture distributions; mixture models; principal component analysis; factor analysis; mixtures of factor analysers; generalised topographic mapping; mixtures of multivariate Bernoulli distributions;
D O I
10.1016/S0167-6393(98)00059-4
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We consider the problem of obtaining a reduced dimension representation of electropalatographic (EPG) data. An unsupervised learning approach based on latent variable modelling is adopted, in which an underlying lower dimension representation is inferred directly from the data. Several latent variable models are investigated, including factor analysis and the generative topographic mapping (GTM). Experiments were carried out using a subset of the EUR-ACCOR database, and the results indicate that these automatic methods capture important, adaptive structure in the EPG data. Nonlinear latent variable modelling clearly outperforms the investigated linear models in terms of log-likelihood and reconstruction error and suggests a substantially smaller intrinsic dimensionality for the EPG data than that claimed by previous studies. A two-dimensional representation is produced with applications to speech therapy, language learning and articulatory dynamics. (C) 1998 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:259 / 282
页数:24
相关论文
共 35 条
  • [1] [Anonymous], 1979, PROBABILITY MATH STA
  • [2] NEURAL NETWORKS AND PRINCIPAL COMPONENT ANALYSIS - LEARNING FROM EXAMPLES WITHOUT LOCAL MINIMA
    BALDI, P
    HORNIK, K
    [J]. NEURAL NETWORKS, 1989, 2 (01) : 53 - 58
  • [3] Bartholomew DJ., 1987, LATENT VARIABLE MODE
  • [4] Berger JO., 1985, Statistical Decision Theory and Bayesian Analysis, V2, DOI DOI 10.1007/978-1-4757-4286-2
  • [5] GTM: The generative topographic mapping
    Bishop, CM
    Svensen, M
    Williams, CKI
    [J]. NEURAL COMPUTATION, 1998, 10 (01) : 215 - 234
  • [6] AUTO-ASSOCIATION BY MULTILAYER PERCEPTRONS AND SINGULAR VALUE DECOMPOSITION
    BOURLARD, H
    KAMP, Y
    [J]. BIOLOGICAL CYBERNETICS, 1988, 59 (4-5) : 291 - 294
  • [7] USING REGIONS AND INDEXES IN EPG DATA REDUCTION
    BYRD, D
    FLEMMING, E
    MUELLER, CA
    TAN, CC
    [J]. JOURNAL OF SPEECH AND HEARING RESEARCH, 1995, 38 (04): : 821 - 827
  • [8] CARREIRAPERPINA.MA, 1998, UNPUB NEURAL COMPUTA
  • [9] CARREIRAPERPINA.MA, 1997, CS9709 U SHEFF DEP C
  • [10] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38