Improved vocal tract reconstruction and modeling using an image super-resolution technique

被引:8
作者
Zhou, Xinhui [1 ,2 ]
Woo, Jonghye [3 ,5 ]
Stone, Maureen [3 ,4 ]
Prince, Jerry L. [5 ]
Espy-Wilson, Carol Y. [1 ,2 ]
机构
[1] Univ Maryland, Syst Res Inst, Speech Commun Lab, College Pk, MD 20742 USA
[2] Univ Maryland, Dept Elect & Comp Engn, College Pk, MD 20742 USA
[3] Univ Maryland, Sch Dent, Dept Neural & Pain Sci, Baltimore, MD 21201 USA
[4] Univ Maryland, Sch Dent, Dept Orthodont, Baltimore, MD 21201 USA
[5] Johns Hopkins Univ, Dept Elect & Comp Engn, Baltimore, MD 21218 USA
基金
美国国家卫生研究院;
关键词
AREA FUNCTIONS; RESONANCE; MRI;
D O I
10.1121/1.4802903
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Magnetic resonance imaging has been widely used in speech production research. Often only one image stack (sagittal, axial, or coronal) is used for vocal tract modeling. As a result, complementary information from other available stacks is not utilized. To overcome this, a recently developed super-resolution technique was applied to integrate three orthogonal low-resolution stacks into one isotropic volume. The results on vowels show that the super-resolution volume produces better vocal tract visualization than any of the low-resolution stacks. Its derived area functions generally produce formant predictions closer to the ground truth, particularly for those formants sensitive to area perturbations at constrictions. (C) 2013 Acoustical Society of America
引用
收藏
页码:EL439 / EL445
页数:7
相关论文
共 10 条
[1]   ANALYSIS OF VOCAL-TRACT SHAPE AND DIMENSIONS USING MAGNETIC-RESONANCE-IMAGING - VOWELS [J].
BAER, T ;
GORE, JC ;
GRACCO, LC ;
NYE, PW .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1991, 90 (02) :799-828
[2]   MORPHOLOGICAL AND ACOUSTICAL ANALYSIS OF THE NASAL AND THE PARANASAL CAVITIES [J].
DANG, JW ;
HONDA, K ;
SUZUKI, H .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1994, 96 (04) :2088-2100
[3]   AN ARTICULATORY STUDY OF FRICATIVE CONSONANTS USING MAGNETIC-RESONANCE-IMAGING [J].
NARAYANAN, SS ;
ALWAN, AA ;
HAKER, K .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1995, 98 (03) :1325-1347
[4]   Toward articulatory-acoustic models for liquid approximants based on MRI and EPG data .1. The laterals [J].
Narayanan, SS ;
Alwan, AA ;
Haker, K .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1997, 101 (02) :1064-1077
[5]  
SCHMIDT M, 2005, TR0514 U ALB
[6]   Technique for "tuning" vocal tract area functions based on acoustic sensitivity functions (L) [J].
Story, BH .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 119 (02) :715-718
[7]   Vocal tract area functions from magnetic resonance imaging [J].
Story, BH ;
Titze, IR ;
Hoffman, EA .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 100 (01) :537-554
[8]   Measurement of temporal changes in vocal tract area function from 3D cine-MRI data [J].
Takemoto, H ;
Honda, K ;
Masaki, S ;
Shimada, Y ;
Fujimoto, I .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 119 (02) :1037-1049
[9]   Reconstruction of High-Resolution Tongue Volumes From MRI [J].
Woo, Jonghye ;
Murano, Emi Z. ;
Stone, Maureen ;
Prince, Jerry L. .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2012, 59 (12) :3511-3524
[10]   A magnetic resonance imaging-based articulatory and acoustic study of "retroflex" and "bunched" American English |r| [J].
Zhou, Xinhui ;
Espy-Wilson, Carol Y. ;
Boyce, Suzanne ;
Tiede, Mark ;
Holland, Christy ;
Choe, Ann .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 123 (06) :4466-4481