Post-processing speech recordings during MRI

被引：3

作者：

Kuortti, Juha ^{[1
]}

Malinen, Jarmo ^{[1
,2
]}

Ojalammi, Antti ^{[1
]}

机构：

[1] Aalto Univ, Dept Math & Syst Anal, Espoo, Finland

[2] Aalto Univ, Dept Signal Proc & Acoust, Espoo, Finland

来源：

BIOMEDICAL SIGNAL PROCESSING AND CONTROL | 2018年 / 39卷

关键词：

Speech; MRI; Noise reduction; DSP; Helmholtz; ACOUSTIC NOISE;

D O I：

10.1016/j.bspc.2017.07.017

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

We discuss post-processing of speech samples that have been recorded simultaneously during Magnetic Resonance Imaging (MRI) of the upper airways. Speech recordings contain acoustic noise, from the MRI scanner. The required noise reduction is based on adaptive comb filtering designed for accurate formant extraction. Two kinds of speech materials were used to validate the post-processing algorithm. The primary material consists of samples of prolonged vowel productions during MRI. The comparison data was obtained from the same test subject, and it was recorded in anechoic chamber in a similar configuration as used during the MRI. Spectral envelopes and vowel formants were computed from the post-processed speech and from the comparison data. Vowel samples (with a known formant structure) were artificially contaminated using MRI scanner noise to determine performance of the post-processing algorithm. Resonances computed from a numerical acoustic model and spectra measured from 3D printed vocal tract physical models were used as comparison data. The properties of the recording instrumentation or the post-processing algorithm do not explain the observed frequency dependent discrepancy between the vowel formant data from two kinds of experiments: recordings during MRI and comparison data. It is shown that the discrepancy is statistically significant, in particular, where it is largest at ca. 1 kHz and 2 kHz. Numerical and experimental evidence suggests that the surfaces of the MRI head coil change the acoustics of speech which results in "exterior formants" at these frequencies. The discrepancy is too large to be neglected if the recordings during MRI are to be used for parameter estimation or validation of a numerical speech model, based on the MR images. However, the role of test subject adaptation to noise and constrained space acoustics during an MRI examination cannot be ruled out. (C) 2017 Elsevier Ltd. All rights reserved.

引用

页码：11 / 22

页数：12

共 36 条

[1]

Aalto A., 2017, ARXIV150601395

[2]

Aalto D., 2013, Proceedings of the 6th International Conference on Biomedical Electronics and Devices. BIODEVICES 2013, P257

[3]

Aalto D., 2011, P INT C PHON SCI, P180

[4] Large scale data acquisition of simultaneous MRI and speech [J].