An Empirical Investigation of the Nonuniqueness in the Acoustic-to-Articulatory Mapping

被引:0
作者
Qin, Chao [1 ]
Carreira-Perpinan, Miguel A. [1 ]
机构
[1] Oregon Hlth & Sci Univ, Dept Comp Sci & Elect Engn, OGI Sch Sci & Engn, Beaverton, OR 97006 USA
来源
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4 | 2007年
关键词
acoustic-to-articulatory mapping; articulatory inversion; X-ray microbeam database;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Articulatory inversion is the problem of recovering the sequence of vocal tract shapes that produce a given acoustic speech signal. Traditionally, its difficulty has been attributed to nonuniqueness of the inverse mapping, where different vocal tract shapes can produce the same acoustics. However, evidence for the nonuniqueness has been restricted to theoretical studies, or to data from atypical speech or very specific sounds. We present a systematic large-scale study using articulatory data for normal speech from the Wisconsin XRDB. We find that nonuniqueness does exist for some sounds, but that the majority of normal speech is produced with a unique vocal tract shape.
引用
收藏
页码:2300 / 2303
页数:4
相关论文
共 13 条
[1]   INVERSION OF ARTICULATORY-TO-ACOUSTIC TRANSFORMATION IN VOCAL-TRACT BY A COMPUTER-SORTING TECHNIQUE [J].
ATAL, BS ;
CHANG, JJ ;
MATHEWS, MV ;
TUKEY, JW .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 63 (05) :1535-1555
[2]   Mode-finding for mixtures of Gaussian distributions [J].
Carreira-Perpiñán, MA .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (11) :1318-1323
[3]  
CARREIRAPERPINA.MA, 2001, THESIS U SHEFFIELD U
[4]  
ESPYWILSON CY, 2000, J ACOUS SOC AM, V108
[5]   FORMANT FREQUENCIES OF SOME FIXED-MANDIBLE VOWELS AND A MODEL OF SPEECH MOTOR PROGRAMMING BY PREDICTIVE SIMULATION [J].
LINDBLOM, B ;
LUBKER, J ;
GAY, T .
JOURNAL OF PHONETICS, 1979, 7 (02) :147-161
[6]  
QIN C, 2007, EUROSPEECH IN PRESS
[7]  
Rabiner L., 1993, Fundamentals of Speech Recognition
[8]  
Richmond Korin, 2001, THESIS U EDINBURGH
[9]  
Roweis S. T., 1999, THESIS CALTECH
[10]   Techniques for Estimating Vocal-Tract Shapes from the Speech Signal [J].
Schroeter, Juergen ;
Sondhi, Man Mohan .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (01) :133-150