Vocal tract morphology using real-time magnetic resonance imaging

被引:1
作者
Sampaio, Rafael de A. [1 ]
Jackowski, Marcel P. [1 ]
机构
[1] Univ Sao Paulo, Inst Matemat & Estat, Sao Paulo, Brazil
来源
2017 30TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI) | 2017年
基金
巴西圣保罗研究基金会;
关键词
SEGMENTATION; SPEECH; MRI; SHAPE;
D O I
10.1109/SIBGRAPI.2017.54
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Real-time Magnetic Resonance Imaging (rtMRI) leads to the dynamic observation of hidden processes of articulation in an unprecedented way. The non-invasive image acquisition nature of MRI combined with enhanced anatomical discrimination made rtMRI the reference in capturing vocal tract configurations during speech production. However, this development also unveiled challenges, such as the shape extraction and analysis of the vocal tract contours automatically. This work describes automated techniques for the segmentation of the vocal tract and identification of articulatory structures using rtMRI. The identification of these structures is vital for modeling articulatory synthesis. The methodology is based on level set methods to outline the vocal tract shape. Changes in the vocal tract shape and its structures were investigated for different corpora in order to bind the expression of phonemes and the behavior of the anatomical shapes. These shapes were labeled from basal form invariants, whose final evolution yielded the classification of regions of interest. The methodology resulting from this work may be employed in accent-suppression systems, speech production for laryngectomized patients, and therapetic techniques for children suffering from speech apraxia.
引用
收藏
页码:359 / 366
页数:8
相关论文
共 31 条
  • [1] Image quality in non-gated versus gated reconstruction of tongue motion using magnetic resonance imaging: a comparison using automated image processing
    Alvey, Christopher
    Orphanidou, C.
    Coleman, J.
    McIntyre, A.
    Golding, S.
    Kochanski, G.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2008, 3 (05) : 457 - 464
  • [2] [Anonymous], 2011, THESIS
  • [3] Aubert G., 2006, APPL INTELL, V2nd Edition, DOI DOI 10.1007/978-0-387-44588-5
  • [4] Avila-Garcia M.S., 2004, Proceedings of the International Conference on Signal Processing (ICSP 2004), P288
  • [5] Babalola KO, 2008, LECT NOTES COMPUT SC, V5241, P409, DOI 10.1007/978-3-540-85988-8_49
  • [6] Badin P., 1998, Proc. 5th ICSLP, P417
  • [7] ANALYSIS OF VOCAL-TRACT SHAPE AND DIMENSIONS USING MAGNETIC-RESONANCE-IMAGING - VOWELS
    BAER, T
    GORE, JC
    GRACCO, LC
    NYE, PW
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1991, 90 (02) : 799 - 828
  • [8] Berry E., 2007, SERIES MED PHYS BIOM
  • [9] Seeing speech: Capturing vocal tract shaping using real-time magnetic resonance imaging
    Bresch, Erik
    Kim, Yoon-Chul
    Nayak, Krishna
    Byrd, Dani
    Narayanan, Shrikanth
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2008, 25 (03) : 123 - +
  • [10] Region Segmentation in the Frequency Domain Applied to Upper Airway Real-Time Magnetic Resonance Images
    Bresch, Erik
    Narayanan, Shrikanth
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2009, 28 (03) : 323 - 338