Integrating Experiential and Distributional Data to Learn Semantic Representations

被引:269
作者
Andrews, Mark [1 ]
Vigliocco, Gabriella [1 ]
Vinson, David [1 ]
机构
[1] UCL, Div Psychol & Language Sci, London WC1 6BT, England
基金
英国生物技术与生命科学研究理事会; 英国经济与社会研究理事会;
关键词
semantic representations; probabilistic models; Bayesian models; computational models; distributional data;
D O I
10.1037/a0016261
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
The authors identify 2 major types of statistical data from which semantic representations can be learned. These are denoted as experiential data and distributional data. Experiential data are derived by way of experience with the physical world and comprise the sensory-motor data obtained through sense receptors. Distributional data, by contrast, describe the statistical distribution of words across spoken and written language. The authors claim that experiential and distributional data represent distinct data types and that each is a nontrivial source of semantic information. Their theoretical proposal is that human semantic representations are derived from an optimal statistical combination of these 2 data types. Using a Bayesian probabilistic model, they demonstrate how word meanings can be learned by treating experiential and distributional data as a single joint distribution and learning the statistical structure that underlies it. The semantic representations that are learned in this manner are measurably more realistic-as verified by comparison to a set of human-based measures of semantic representation-than those available from either data type individually or from both sources independently. This is not a result of merely using quantitatively more data, but rather it is because experiential and distributional data are qualitatively distinct, yet intercorrelated, types of data. The semantic representations that are learned are based on statistical structures that exist both within and between the experiential and distributional data types.
引用
收藏
页码:463 / 498
页数:36
相关论文
共 117 条
[1]  
Anderson J., 1983, The architecture of cognition
[2]  
[Anonymous], 2006, Data Analysis-A Bayesian Tutorial
[3]  
[Anonymous], 1993, LINGUISTIC DISORDERS, DOI DOI 10.1515/9783110113242
[4]  
[Anonymous], CONCEPTUAL CHANGE IN
[5]  
[Anonymous], DIMENSIONS MEANING
[6]   Congruent embodied representations for visually presented actions and linguistic phrases describing actions [J].
Aziz-Zadeh, Lisa ;
Wilson, Stephen M. ;
Rizzolatti, Giacomo ;
Iacoboni, Marco .
CURRENT BIOLOGY, 2006, 16 (18) :1818-1823
[7]  
Barsalou L.W., 2005, Grounding cognition: The role of perception and action in memory, language, and thought, P129, DOI DOI 10.1017/CBO9780511499968
[8]   Parallel visual motion processing streams for manipulable objects and human movements [J].
Beauchamp, MS ;
Lee, KE ;
Haxby, JV ;
Martin, A .
NEURON, 2002, 34 (01) :149-159
[9]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[10]  
Blin F, 2016, LANG STUD SCI ENGINE, V2, P41, DOI 10.1075/lsse.2.03bli