Measurement of temporal changes in vocal tract area function from 3D cine-MRI data

被引:79
作者
Takemoto, H
Honda, K
Masaki, S
Shimada, Y
Fujimoto, I
机构
[1] ATR, Human Informat Sci Labs, Kyoto 6190288, Japan
[2] ATR, Brain Act Imaging Ctr, Kyoto 6190288, Japan
关键词
D O I
10.1121/1.2151823
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A 3D cine-MRI technique was developed based on a synchronized sampling method [Masaki et al., J. Acoust. Soc. Jpn. E 20, 375-379 (1999)] to measure the temporal changes in the vocal tract area function during a short utterance /aiueo/ in Japanese. A time series of head-neck volumes was obtained after 640 repetitions of the utterance produced by a male speaker, from which area functions were extracted frame-by-frame. A region-based analysis showed that the volumes of the front and back cavities tend to change reciprocally and that the areas near the larynx and posterior edge of the hard palate were almost constant throughout the utterance. The lower four formants were calculated from all the area functions and compared with those of natural speech sounds. The mean absolute percent error between calculated and measured formants among all the frames was 4.5%. The comparison of vocal tract shapes for the five vowels with those from the static MRI method suggested a problem of MRI observation of the vocal tract: data from static MRI tend to result in a deviation from natural vocal tract geometry because of the gravity effect. (c) 2006 Acoustical Society of America.
引用
收藏
页码:1037 / 1049
页数:13
相关论文
共 36 条
[21]   An approach to real-time magnetic resonance imaging for speech production [J].
Narayanan, S ;
Nayak, K ;
Lee, SB ;
Byrd, D .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2004, 115 (04) :1771-1776
[22]   Geometry, kinematics, and acoustics of Tamil liquid consonants [J].
Narayanan, S ;
Byrd, D ;
Kaun, A .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 106 (04) :1993-2007
[23]   AN ARTICULATORY STUDY OF FRICATIVE CONSONANTS USING MAGNETIC-RESONANCE-IMAGING [J].
NARAYANAN, SS ;
ALWAN, AA ;
HAKER, K .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1995, 98 (03) :1325-1347
[24]   Toward articulatory-acoustic models for liquid approximants based on MRI and EPG data .1. The laterals [J].
Narayanan, SS ;
Alwan, AA ;
Haker, K .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1997, 101 (02) :1064-1077
[25]  
Pruessmann KP, 1999, MAGNET RESON MED, V42, P952, DOI 10.1002/(SICI)1522-2594(199911)42:5<952::AID-MRM16>3.0.CO
[26]  
2-S
[27]   QUANTIFICATION OF CARDIAC-FUNCTION BY CONVENTIONAL AND CINE MAGNETIC-RESONANCE-IMAGING [J].
SECHTEM, U ;
PFLUGFELDER, P ;
HIGGINS, CB .
CARDIOVASCULAR AND INTERVENTIONAL RADIOLOGY, 1987, 10 (06) :365-373
[28]  
SHIRAI K, 1976, ELECT COMMUN JPN, V55, P35
[29]   A 3-DIMENSIONAL MODEL OF TONGUE MOVEMENT BASED ON ULTRASOUND AND X-RAY MICROBEAM DATA [J].
STONE, M .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1990, 87 (05) :2207-2217
[30]   Modeling the motion of the internal tongue from tagged cine-MRI images [J].
Stone, M ;
Davis, EP ;
Douglas, AS ;
NessAiver, M ;
Gullapalli, R ;
Levine, WS ;
Lundberg, A .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2001, 109 (06) :2974-2982