The role of corpus size and syntax in deriving lexico-semantic representations for a wide range of concepts

被引:12
作者
De Deyne, Simon [1 ]
Verheyen, Steven [1 ]
Storms, Gert [1 ]
机构
[1] Katholieke Univ Leuven, Dept Psychol, B-3000 Louvain, Belgium
基金
比利时弗兰德研究基金会;
关键词
Word associations; Text corpora; Similarity; Syntactic dependency; Semantic memory; ASSOCIATION; INFORMATION; ACQUISITION; CATEGORIES; SIMULATION; NETWORKS; OBJECTS; WORDS;
D O I
10.1080/17470218.2014.994098
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
One of the most significant recent advances in the study of semantic processing is the advent of models based on text and other corpora. In this study, we address what impact both the quantitative and qualitative properties of corpora have on mental representations derived from them. More precisely, we evaluate models with different linguistic and mental constraints on their ability to predict semantic relatedness between items from a vast range of domains and categories. We find that a model based on syntactic dependency relations captures significantly less of the variability for all kinds of words, regardless of the semantic relation between them or their abstractness. The largest difference was found for concrete nouns, which are commonly used to assess semantic processing. For both models we find that limited amounts of data suffice in order to obtain reliable predictions. Together, these findings suggest new constraints for the construction of mental models from corpora, both in terms of the corpus size and in terms of the linguistic properties that contribute to mental representations.
引用
收藏
页码:1643 / 1664
页数:22
相关论文
共 76 条
[1]  
Aitchison J., 2003, WORDS MIND INTRO MEN, V3rd
[2]  
Andrews M., 2005, P 27 M COGN SCI SOC, P127
[3]  
[Anonymous], P 34 ANN C COGN SCI
[4]  
[Anonymous], 2012, Language Grounding in Robots
[5]  
[Anonymous], P 31 ANN M OH STAT U
[6]  
[Anonymous], COMPUTATIONAL LINGUI
[7]  
[Anonymous], 2004, Advances in Neural Information Processing Systems
[8]  
Aston G., 1997, BNC HDB EXPLORING BR
[9]   Networks in Cognitive Science [J].
Baronchelli, Andrea ;
Ferrer-i-Cancho, Ramon ;
Pastor-Satorras, Romualdo ;
Chater, Nick ;
Christiansen, Morten H. .
TRENDS IN COGNITIVE SCIENCES, 2013, 17 (07) :348-360
[10]  
Barsalou L.W., 2008, Symbols, Embodiment, and Meaning, P245, DOI DOI 10.1093/ACPROF:OSO/9780199217274.003.0013