DYNAMIC TARGET THEORIES OF VOWEL CLASSIFICATION - EVIDENCE FROM MONOPHTHONGS AND DIPHTHONGS IN AUSTRALIAN ENGLISH

被引:34
作者
HARRINGTON, J
CASSIDY, S
机构
[1] Macquarie University, Sydney
关键词
VOWEL CLASSIFICATION; DIPHTHONGS; AUSTRALIAN ENGLISH; NEURAL NETWORKS;
D O I
10.1177/002383099403700402
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Recent studies on the perception of speech have suggested that vowel identification depends on dynamic cues, rather than a single 'static' spectral slice at the vowel midpoint. The experiments reported in this paper seek both to test the extent to which vowel recognition depends on dynamic information, and to identify the nature of the dynamic cues on which such recognition might depend. Gaussian classification techniques, as well as different kinds of neural network architectures, were used to classify some 3000 vowels in /CVd/ citation-form Australian English words, following training on roughly the same number of vowel tokens produced by different talkers. The first set of experiments shows that when vowels are classified from three spectral slices taken at the vowel margins and midpoint, only diphthongs, but not monophthongs, benefit from the additional spectral information at the vowel margins. A further experiment shows that vowels are no better classified from a time-delay neural network than from the three-slice network in which time is not explicitly represented. At least for the citation-form, Australian English vowels in this study, these results are interpreted as being more consistent with a target, rather than a dynamic, theory of vowel perception.
引用
收藏
页码:357 / 373
页数:17
相关论文
共 35 条
[1]   ON THE SUFFICIENCY OF COMPOUND TARGET SPECIFICATION OF ISOLATED VOWELS AND VOWELS IN BVB SYLLABLES [J].
ANDRUSKI, JE ;
NEAREY, TM .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1992, 91 (01) :390-410
[2]   VOWEL IDENTIFICATION - ORTHOGRAPHIC, PERCEPTUAL, AND ACOUSTIC ASPECTS [J].
ASSMANN, PF ;
NEAREY, TM ;
HOGAN, JT .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1982, 71 (04) :975-989
[3]   THE EFFECT OF COARTICULATION ON THE ROLE OF TRANSITIONS IN VOWEL PERCEPTION [J].
BENGUEREL, AP ;
MCFADDEN, TU .
PHONETICA, 1989, 46 (1-3) :80-96
[4]  
Bernard JRL, 1970, STUFLANGUAGE TYPOLOG, V23, P113, DOI 10.1524/stuf.1970.23.16.113
[5]   ANALYSIS AND RECOGNITION OF ISOLATED PUTONGHUA VOWELS BY KARHUNEN-LOEVE TRANSFORMATION TECHNIQUES [J].
CHAN, LCM ;
CHEUNG, YS .
SPEECH COMMUNICATION, 1986, 5 (3-4) :299-330
[6]   A LOW-LEVEL SPEECH SYNTHESIS BY RULE SYSTEM [J].
CLARK, JE .
JOURNAL OF PHONETICS, 1981, 9 (04) :451-476
[7]  
CROOT K, 1992, 4TH P AUSTR INT C SP, P86
[8]   PERCEIVING VOWELS IN ISOLATION AND IN CONSONANTAL CONTEXT [J].
DIEHL, RL ;
MCCUSKER, SB ;
CHAPMAN, LS .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1981, 69 (01) :239-248
[9]  
FOWLER CA, 1980, J PHONETICS, V8, P113, DOI 10.1016/S0095-4470(19)31446-9
[10]  
FOWLER CA, 1986, J PHONETICS, V14, P3, DOI 10.1016/S0095-4470(19)30607-2