Post-processing speech recordings during MRI

被引:3
作者
Kuortti, Juha [1 ]
Malinen, Jarmo [1 ,2 ]
Ojalammi, Antti [1 ]
机构
[1] Aalto Univ, Dept Math & Syst Anal, Espoo, Finland
[2] Aalto Univ, Dept Signal Proc & Acoust, Espoo, Finland
关键词
Speech; MRI; Noise reduction; DSP; Helmholtz; ACOUSTIC NOISE;
D O I
10.1016/j.bspc.2017.07.017
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
We discuss post-processing of speech samples that have been recorded simultaneously during Magnetic Resonance Imaging (MRI) of the upper airways. Speech recordings contain acoustic noise, from the MRI scanner. The required noise reduction is based on adaptive comb filtering designed for accurate formant extraction. Two kinds of speech materials were used to validate the post-processing algorithm. The primary material consists of samples of prolonged vowel productions during MRI. The comparison data was obtained from the same test subject, and it was recorded in anechoic chamber in a similar configuration as used during the MRI. Spectral envelopes and vowel formants were computed from the post-processed speech and from the comparison data. Vowel samples (with a known formant structure) were artificially contaminated using MRI scanner noise to determine performance of the post-processing algorithm. Resonances computed from a numerical acoustic model and spectra measured from 3D printed vocal tract physical models were used as comparison data. The properties of the recording instrumentation or the post-processing algorithm do not explain the observed frequency dependent discrepancy between the vowel formant data from two kinds of experiments: recordings during MRI and comparison data. It is shown that the discrepancy is statistically significant, in particular, where it is largest at ca. 1 kHz and 2 kHz. Numerical and experimental evidence suggests that the surfaces of the MRI head coil change the acoustics of speech which results in "exterior formants" at these frequencies. The discrepancy is too large to be neglected if the recordings during MRI are to be used for parameter estimation or validation of a numerical speech model, based on the MR images. However, the role of test subject adaptation to noise and constrained space acoustics during an MRI examination cannot be ruled out. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:11 / 22
页数:12
相关论文
共 36 条
[1]  
Aalto A., 2017, ARXIV150601395
[2]  
Aalto D., 2013, Proceedings of the 6th International Conference on Biomedical Electronics and Devices. BIODEVICES 2013, P257
[3]  
Aalto D., 2011, P INT C PHON SCI, P180
[4]   Large scale data acquisition of simultaneous MRI and speech [J].
Aalto, Daniel ;
Aaltonen, Olli ;
Happonen, Risto-Pekka ;
Jaasaari, Paivi ;
Kivela, Atle ;
Kuortti, Juha ;
Luukinen, Jean-Marc ;
Malinen, Jarmo ;
Murtola, Tiina ;
Parkkola, Riitta ;
Saunavaara, Jani ;
Soukka, Tero ;
Vainio, Martti .
APPLIED ACOUSTICS, 2014, 83 :64-75
[5]  
Aalto D, 2011, BIODEVICES 2011, P168
[6]   Spectral tilt change in stop consonant perception [J].
Alexander, Joshua M. ;
Kluender, Keith R. .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 123 (01) :386-396
[7]   GLOTTAL WAVE ANALYSIS WITH PITCH SYNCHRONOUS ITERATIVE ADAPTIVE INVERSE FILTERING [J].
ALKU, P .
SPEECH COMMUNICATION, 1992, 11 (2-3) :109-118
[8]   Glottal inverse filtering analysis of human voice production - A review of estimation and parameterization methods of the glottal excitation and their applications [J].
Alku, Paavo .
SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2011, 36 (05) :623-650
[9]  
[Anonymous], 2005, TURBOSQUID HEAD
[10]   Effects of head geometry simplifications on acoustic radiation of vowel sounds based on time-domain finite-element simulations [J].
Arnela, Marc ;
Guasch, Oriol ;
Alias, Francesc .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 134 (04) :2946-2954