Tracking contours of orofacial articulators from real-time MRI of speech

被引:2
|
作者
Labrunie, Mathieu [1 ,2 ]
Badin, Pierre [1 ,2 ]
Voit, Dirk [5 ]
Joseph, Arun A. [5 ]
Lamalle, Laurent [3 ,4 ]
Vilain, Coriandre [1 ,2 ]
Boe, Louis-Jean [1 ,2 ]
Frahm, Jens [5 ]
机构
[1] Univ Grenoble Alpes, GIPSA Lab, F-38000 Grenoble, France
[2] CNRS, GIPSA Lab, F-38000 Grenoble, France
[3] Univ Grenoble Alpes, Inserm US 17, CNRS UMS 3552, Grenoble, France
[4] CHU Grenoble, UMS IRMaGe, La Tronche, France
[5] Max Planck Inst Biophys Chem, Biomed NMR Forsch GmbH, Gottingen, Germany
来源
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES | 2016年
关键词
Real-time MRI; orofacial articulators; speech articulation; automatic contour tracking; multiple linear regression; Active Shape Models; SEGMENTATION; RESOLUTION;
D O I
10.21437/Interspeech.2016-78
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We introduce a method for predicting midsagittal contours of orofacial articulators from real-time MRI data. A corpus of about 26 minutes of speech has been recorded of a French speaker at a rate of 55 images / s using highly undersampled radial gradient-echo MRI with image reconstruction by nonlinear inversion. The contours of each articulator have been manually traced for a set of about 60 images selected by hierarchical clustering to optimally represent the diversity of the speaker articulations The data serve to build articulator-specific Principal Component Analysis (PCA) models of contours and associated image intensities, as well as multilinear regression (MLR) models that predict contour parameters from image parameters. The contours obtained by MLR are then refined, using the local information about pixel intensity profiles along the contours' normals, by means of modified Active Shape Models (ASM) trained on the same data. The method reaches RMS of predicted points to reference contour distances between 0.54 and 0 93 mm, depending on articulators. The processing of the corpus demonstrated the efficiency of the procedure, despite the possibility of further improvements. This work opens new perspectives for studying articulatory motion in speech.
引用
收藏
页码:470 / +
页数:3
相关论文
共 50 条
  • [1] Automatic segmentation of speech articulators from real-time midsagittal MRI based on supervised learning
    Labrunie, Mathieu
    Badin, Pierre
    Voit, Dirk
    Joseph, Arun A.
    Frahm, Jens
    Lamalle, Laurent
    Vilain, Coriandre
    Boe, Louis-Jean
    SPEECH COMMUNICATION, 2018, 99 : 27 - 46
  • [2] Recommendations for real-time speech MRI
    Lingala, Sajan Goud
    Sutton, Brad P.
    Miquel, Marc E.
    Nayak, Krishna S.
    JOURNAL OF MAGNETIC RESONANCE IMAGING, 2016, 43 (01) : 28 - 44
  • [3] Real-time MRI and articulatory coordination in speech
    Demolin, D
    Hassid, S
    Metens, T
    Soquet, A
    COMPTES RENDUS BIOLOGIES, 2002, 325 (04) : 547 - 556
  • [4] Analysis of speech production real-time MRI
    Ramanarayanan, Vikram
    Tilsen, Sam
    Proctor, Michael
    Toger, Johannes
    Goldstein, Louis
    Nayak, Krishna S.
    Narayanan, Shrikanth
    COMPUTER SPEECH AND LANGUAGE, 2018, 52 : 1 - 22
  • [5] Speech Synthesis from Articulatory Movements Recorded by Real-time MRI
    Otani, Yuto
    Sawada, Shun
    Ohmura, Hidefumi
    Katsurada, Kouichi
    INTERSPEECH 2023, 2023, : 127 - 131
  • [6] Speech production real-time MRI at 0.55 T
    Lim, Yongwan
    Kumar, Prakash
    Nayak, Krishna S.
    MAGNETIC RESONANCE IN MEDICINE, 2024, 91 (01) : 337 - 343
  • [7] Real-time head tracking from the deformation of eye contours using a piecewise affine camera
    Colombo, C
    Del Bimbo, A
    PATTERN RECOGNITION LETTERS, 1999, 20 (07) : 721 - 730
  • [8] Reconstructing Speech from Real-Time Articulatory MRI Using Neural Vocoders
    Yu, Yide
    Shandiz, Amin Honarmandi
    Toth, Laszlo
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 945 - 949
  • [9] Real-Time Visual Tracking Using Geometric Active Contours and Particle Filters
    Ha, Jincheol
    Johnson, Eric N.
    Tannenbaum, Allen
    JOURNAL OF AEROSPACE COMPUTING INFORMATION AND COMMUNICATION, 2008, 5 (10): : 361 - 379
  • [10] Deep Active Contours for Real-time 6-DoF Object Tracking
    Wang, Long
    Yan, Shen
    Zhen, Jianan
    Liu, Yu
    Zhang, Maojun
    Zhang, Guofeng
    Zhou, Xiaowei
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 13988 - 13998