Reconstructing Perceived Images From Human Brain Activities With Bayesian Deep Multiview Learning

被引:77
作者
Du, Changde [1 ,2 ]
Du, Changying [3 ,4 ]
Huang, Lijie [1 ,2 ]
He, Huiguang [1 ,2 ,5 ]
机构
[1] Chinese Acad Sci, Inst Automat, Res Ctr Brain Inspired Intelligence, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[3] Chinese Acad Sci, Inst Software, Lab Parallel Software & Computat Sci, Beijing 100190, Peoples R China
[4] 360 Search Lab, Beijing 100015, Peoples R China
[5] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep neural network (DNN); image reconstruction; multiview learning; neural decoding; variational Bayesian inference; NEURAL-NETWORKS; NATURAL IMAGES; FMRI; REPRESENTATIONS; CATEGORIES; INFERENCE; PATTERNS; OBJECTS; MODELS; FACES;
D O I
10.1109/TNNLS.2018.2882456
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural decoding, which aims to predict external visual stimuli information from evoked brain activities, plays an important role in understanding human visual system. Many existing methods are based on linear models, and most of them only focus on either the brain activity pattern classification or visual stimuli identification. Accurate reconstruction of the perceived images from the measured human brain activities still remains challenging. In this paper, we propose a novel deep generative multiview model for the accurate visual image reconstruction from the human brain activities measured by functional magnetic resonance imaging (fMRI). Specifically, we model the statistical relationships between the two views (i.e., the visual stimuli and the evoked fMRI) by using two view-specific generators with a shared latent space. On the one hand, we adopt a deep neural network architecture for visual image generation, which mimics the stages of human visual processing. On the other hand, we design a sparse Bayesian linear model for fMRI activity generation, which can effectively capture voxel correlations, suppress data noise, and avoid overfitting. Furthermore, we devise an efficient mean-field variational inference method to train the proposed model. The proposed method can accurately reconstruct visual images via Bayesian inference. In particular, we exploit a posterior regularization technique in the Bayesian inference to regularize the model posterior. The quantitative and qualitative evaluations conducted on multiple fMRI data sets demonstrate the proposed method can reconstruct visual images more accurately than the state of the art.
引用
收藏
页码:2310 / 2323
页数:14
相关论文
共 61 条
[1]  
[Anonymous], 2015, Nature, DOI [10.1038/nature14539, DOI 10.1038/NATURE14539]
[2]  
[Anonymous], BIORXIV
[3]  
[Anonymous], 2016, CORR ABS160605579
[4]  
[Anonymous], 2017, P ASM 36 INT C OC P ASM 36 INT C OC
[5]  
[Anonymous], 2012, P 29 INT C MACHINE L
[6]  
[Anonymous], 2016, ADV NEURAL INFORM PR
[7]  
Arjovsky M., 2017, P 34 INT C MACH LEAR, V70, P214
[8]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[9]   Correlational Neural Networks [J].
Chandar, Sarath ;
Khapra, Mitesh M. ;
Larochelle, Hugo ;
Ravindran, Balaraman .
NEURAL COMPUTATION, 2016, 28 (02) :257-285
[10]   Survey of encoding and decoding of visual stimulus via FMRI: an image analysis perspective [J].
Chen, Mo ;
Han, Junwei ;
Hu, Xintao ;
Jiang, Xi ;
Guo, Lei ;
Liu, Tianming .
BRAIN IMAGING AND BEHAVIOR, 2014, 8 (01) :7-23