End-to-End Deep Image Reconstruction From Human Brain Activity

被引:99
作者
Shen, Guohua [1 ]
Dwivedi, Kshitij [1 ]
Majima, Kei [2 ]
Horikawa, Tomoyasu [1 ]
Kamitani, Yukiyasu [1 ,2 ]
机构
[1] Adv Telecommun Res Inst Int, Computat Neurosci Labs, Kyoto, Japan
[2] Kyoto Univ, Grad Sch Informat, Kyoto, Japan
关键词
brain decoding; visual image reconstruction; functional magnetic resonance imaging; deep neural networks; generative adversarial networks; NEURAL-NETWORKS; REPRESENTATIONS;
D O I
10.3389/fncom.2019.00021
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Deep neural networks (DNNs) have recently been applied successfully to brain decoding and image reconstruction from functional magnetic resonance imaging (fMRI) activity. However, direct training of a DNN with fMRI data is often avoided because the size of available data is thought to be insufficient for training a complex network with numerous parameters. Instead, a pre-trained DNN usually serves as a proxy for hierarchical visual representations, and fMRI data are used to decode individual DNN features of a stimulus image using a simple linear model, which are then passed to a reconstruction module. Here, we directly trained a DNN model with fMRI data and the corresponding stimulus images to build an end-to-end reconstruction model. We accomplished this by training a generative adversarial network with an additional loss term that was defined in high-level feature space (feature loss) using up to 6,000 training data samples (natural images and fMRI responses). The above model was tested on independent datasets and directly reconstructed image using an fMRI pattern as the input. Reconstructions obtained from our proposed method resembled the test stimuli (natural and artificial images) and reconstruction accuracy increased as a function of training-data size. Ablation analyses indicated that the feature loss that we employed played a critical role in achieving accurate reconstruction. Our results show that the end-to-end model can learn a direct mapping between brain activity and perception.
引用
收藏
页数:11
相关论文
共 28 条
[21]   Visual Image Reconstruction from Human Brain Activity using a Combination of Multiscale Local Image Decoders [J].
Miyawaki, Yoichi ;
Uchida, Hajime ;
Yamashita, Okito ;
Sato, Masa-aki ;
Morito, Yusuke ;
Tanabe, Hiroki C. ;
Sadato, Norihiro ;
Kamitani, Yukiyasu .
NEURON, 2008, 60 (05) :915-929
[22]   Bayesian Reconstruction of Natural Images from Human Brain Activity [J].
Naselaris, Thomas ;
Prenger, Ryan J. ;
Kay, Kendrick N. ;
Oliver, Michael ;
Gallant, Jack L. .
NEURON, 2009, 63 (06) :902-915
[23]   Reconstructing Visual Experiences from Brain Activity Evoked by Natural Movies [J].
Nishimoto, Shinji ;
Vu, An T. ;
Naselaris, Thomas ;
Benjamini, Yuval ;
Yu, Bin ;
Gallant, Jack L. .
CURRENT BIOLOGY, 2011, 21 (19) :1641-1646
[24]   Generative adversarial networks for reconstructing natural images from brain activity [J].
Seeliger, K. ;
Guclu, U. ;
Ambrogioni, L. ;
Gucluturk, Y. ;
van Gerven, M. A. J. .
NEUROIMAGE, 2018, 181 :775-785
[25]  
Shen G., 2018, bioRxiv p, P272518, DOI DOI 10.1101/272518
[26]   Deep image reconstruction from human brain activity [J].
Shen, Guohua ;
Horikawa, Tomoyasu ;
Majima, Kei ;
Kamitani, Yukiyasu .
PLOS COMPUTATIONAL BIOLOGY, 2019, 15 (01)
[27]   Matrix correlations for high-dimensional data: the modified RV-coefficient [J].
Smilde, A. K. ;
Kiers, H. A. L. ;
Bijlsma, S. ;
Rubingh, C. M. ;
van Erk, M. J. .
BIOINFORMATICS, 2009, 25 (03) :401-405
[28]   Image quality assessment: From error visibility to structural similarity [J].
Wang, Z ;
Bovik, AC ;
Sheikh, HR ;
Simoncelli, EP .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2004, 13 (04) :600-612