Neural Encoding and Decoding With Distributed Sentence Representations

被引:23
作者
Sun, Jingyuan [1 ,2 ]
Wang, Shaonan [1 ,2 ]
Zhang, Jiajun [1 ,2 ]
Zong, Chengqing [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[3] CAS Ctr Excellence Brain Sci & Intelligence Techn, Beijing 200031, Peoples R China
关键词
Decoding; Encoding; Semantics; Brain modeling; Linguistics; Task analysis; Brain– machine interfaces; distributed semantic representations; neural decoding; neural encoding; NETWORK;
D O I
10.1109/TNNLS.2020.3027595
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Building computational models to account for the cortical representation of language plays an important role in understanding the human linguistic system. Recent progress in distributed semantic models (DSMs), especially transformer-based methods, has driven advances in many language understanding tasks, making DSM a promising methodology to probe brain language processing. DSMs have been shown to reliably explain cortical responses to word stimuli. However, characterizing the brain activities for sentence processing is much less exhaustively explored with DSMs, especially the deep neural network-based methods. What is the relationship between cortical sentence representations against DSMs? What linguistic features that a DSM catches better explain its correlation with the brain activities aroused by sentence stimuli? Could distributed sentence representations help to reveal the semantic selectivity of different brain areas? We address these questions through the lens of neural encoding and decoding, fueled by the latest developments in natural language representation learning. We begin by evaluating the ability of a wide range of 12 DSMs to predict and decipher the functional magnetic resonance imaging (fMRI) images from humans reading sentences. Most models deliver high accuracy in the left middle temporal gyrus (LMTG) and left occipital complex (LOC). Notably, encoders trained with transformer-based DSMs consistently outperform other unsupervised structured models and all the unstructured baselines. With probing and ablation tasks, we further find that differences in the performance of the DSMs in modeling brain activities can be at least partially explained by the granularity of their semantic representations. We also illustrate the DSM's selectivity for concept categories and show that the topics are represented by spatially overlapping and distributed cortical patterns. Our results corroborate and extend previous findings in understanding the relation between DSMs and neural activation patterns and contribute to building solid brain-machine interfaces with deep neural network representations.
引用
收藏
页码:589 / 603
页数:15
相关论文
共 41 条
  • [1] Abnar S, 2019, BLACKBOXNLP WORKSHOP ON ANALYZING AND INTERPRETING NEURAL NETWORKS FOR NLP AT ACL 2019, P191
  • [2] Predicting Neural Activity Patterns Associated with Sentences Using a Neurobiologically Motivated Model of Semantic Representation
    Anderson, Andrew James
    Binder, Jeffrey R.
    Fernandino, Leonardo
    Humphries, Colin J.
    Conant, Lisa L.
    Aguilar, Mario
    Wang, Xixi
    Doko, Donias
    Raizada, Rajeev D. S.
    [J]. CEREBRAL CORTEX, 2017, 27 (09) : 4379 - 4395
  • [3] Arora S., 2016, P INT C LEARN REPR I
  • [4] Evidence for conceptual combination in the left anterior temporal lobe
    Baron, Sean G.
    Osherson, Daniel
    [J]. NEUROIMAGE, 2011, 55 (04) : 1847 - 1852
  • [5] Where Is the Semantic System? A Critical Review and Meta-Analysis of 120 Functional Neuroimaging Studies
    Binder, Jeffrey R.
    Desai, Rutvik H.
    Graves, William W.
    Conant, Lisa L.
    [J]. CEREBRAL CORTEX, 2009, 19 (12) : 2767 - 2796
  • [6] The brain's default network - Anatomy, function, and relevance to disease
    Buckner, Randy L.
    Andrews-Hanna, Jessica R.
    Schacter, Daniel L.
    [J]. YEAR IN COGNITIVE NEUROSCIENCE 2008, 2008, 1124 : 1 - 38
  • [7] Cer D., 2017, Semeval-2017 task 1: semantic textual similarity multilingual and cross-lingual focused evaluation, P1, DOI DOI 10.18653/V1/S17-2001
  • [8] Conneau A, 2018, PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, P2126
  • [9] Conneau Alexis, 2017, P 2017 C EMPIRICAL M, P670, DOI [10.18653/v1/D17-1070, DOI 10.18653/V1/D17-1070]
  • [10] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171