SPEECH SYNTHESIS USING EEG

被引：0

作者：

Krishna, Gautam ^{[1
]}

Tran, Co ^{[1
]}

Han, Yan ^{[1
]}

Carnahan, Mason ^{[1
]}

Tewfik, Ahmed H. ^{[1
]}

机构：

[1] Univ Texas Austin, Brain Machine Interface Lab, Austin, TX 78712 USA

来源：

2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2020年

关键词：

Speech synthesis; EEG; Deep Learning;

D O I：

10.1109/icassp40776.2020.9053340

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper we demonstrate speech synthesis using different electroencephalography (EEG) feature sets recently introduced in [1]. We make use of a recurrent neural network (RNN) regression model to predict acoustic features directly from EEG features. We demonstrate our results using EEG features recorded in parallel with spoken speech as well as using EEG recorded in parallel with listening utterances. We provide EEG based speech synthesis results for four subjects in this paper and our results demonstrate the feasibility of synthesizing speech directly from EEG features.

引用

页码：1235 / 1238

页数：4

共 13 条

[1] AMERICAN-ELECTROENCEPHALOGRAPHIC-SOCIETY GUIDELINES FOR STANDARD ELECTRODE POSITION NOMENCLATURE
不详
[J]. JOURNAL OF CLINICAL NEUROPHYSIOLOGY, 1991, 8 (02) : 200 - 202
[2] Speech synthesis from neural decoding of spoken sentences
Anumanchipalli, Gopala K.
Chartier, Josh
Chang, Edward F.
[J]. NATURE, 2019, 568 (7753) : 493 - +
[3] Chung J., 2014, ARXIV14123555, DOI DOI 10.48550/ARXIV.1412.3555
[4] EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis
Delorme, A
Makeig, S
[J]. JOURNAL OF NEUROSCIENCE METHODS, 2004, 134 (01) : 9 - 21
[5] Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
[6] Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
[7] SIGNAL ESTIMATION FROM MODIFIED SHORT-TIME FOURIER-TRANSFORM
GRIFFIN, DW
LIM, JS
[J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (02): : 236 - 243
[8] Kingma DP., 2017, A method for stochastic optimization, DOI DOI 10.48550/ARXIV.1412.6980
[9] Kominek J., 2008, Spoken Languages Technologies for Under-Resourced Languages
[10] Krishna G, 2019, ARXIV190805743

← 1 2 →