SPEECH SYNTHESIS USING EEG

被引:0
作者
Krishna, Gautam [1 ]
Tran, Co [1 ]
Han, Yan [1 ]
Carnahan, Mason [1 ]
Tewfik, Ahmed H. [1 ]
机构
[1] Univ Texas Austin, Brain Machine Interface Lab, Austin, TX 78712 USA
来源
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2020年
关键词
Speech synthesis; EEG; Deep Learning;
D O I
10.1109/icassp40776.2020.9053340
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we demonstrate speech synthesis using different electroencephalography (EEG) feature sets recently introduced in [1]. We make use of a recurrent neural network (RNN) regression model to predict acoustic features directly from EEG features. We demonstrate our results using EEG features recorded in parallel with spoken speech as well as using EEG recorded in parallel with listening utterances. We provide EEG based speech synthesis results for four subjects in this paper and our results demonstrate the feasibility of synthesizing speech directly from EEG features.
引用
收藏
页码:1235 / 1238
页数:4
相关论文
共 13 条
  • [2] Speech synthesis from neural decoding of spoken sentences
    Anumanchipalli, Gopala K.
    Chartier, Josh
    Chang, Edward F.
    [J]. NATURE, 2019, 568 (7753) : 493 - +
  • [3] Chung J., 2014, ARXIV14123555, DOI DOI 10.48550/ARXIV.1412.3555
  • [4] EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis
    Delorme, A
    Makeig, S
    [J]. JOURNAL OF NEUROSCIENCE METHODS, 2004, 134 (01) : 9 - 21
  • [5] Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
  • [6] Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
  • [7] SIGNAL ESTIMATION FROM MODIFIED SHORT-TIME FOURIER-TRANSFORM
    GRIFFIN, DW
    LIM, JS
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (02): : 236 - 243
  • [8] Kingma DP., 2017, A method for stochastic optimization, DOI DOI 10.48550/ARXIV.1412.6980
  • [9] Kominek J., 2008, Spoken Languages Technologies for Under-Resourced Languages
  • [10] Krishna G, 2019, ARXIV190805743