A database of orthography-semantics consistency (OSC) estimates for 15,017 English words

被引:41
作者
Marelli, Marco [1 ]
Amenta, Simona [2 ]
机构
[1] Univ Milano Bicocca, Dept Psychol, Pzza Ateneo Nuovo 1, I-20126 Milan, MI, Italy
[2] Univ Ghent, Dept Expt Psychol, Ghent, Belgium
关键词
Orthography-semantics consistency; Form-meaning mapping; Word recognition; Lexical resources; Distributional semantic models; LEXICAL DECISION; FEEDBACK SEMANTICS; FREQUENCY; NEIGHBORHOOD; RECOGNITION; SPACE; TRANSPARENCY; ACTIVATION; INDUCTION; RICHNESS;
D O I
10.3758/s13428-018-1017-8
中图分类号
B841 [心理学研究方法];
学科分类号
040201 ;
摘要
Orthography-semantics consistency (OSC) is a measure that quantifies the degree of semantic relatedness between a word and its orthographic relatives. OSC is computed as the frequency-weighted average semantic similarity between the meaning of a given word and the meanings of all the words containing that very same orthographic string, as captured by distributional semantic models. We present a resource including optimized estimates of OSC for 15,017 English words. In a series of analyses, we provide a progressive optimization of the OSC variable. We show that computing OSC from word-embeddings models (in place of traditional count models), limiting preprocessing of the corpus used for inducing semantic vectors (in particular, avoiding part-of-speech tagging and lemmatization), and relying on a wider pool of orthographic relatives provide better performance for the measure in a lexical-processing task. We further show that OSC is an important and significant predictor of reaction times in visual word recognition and word naming, one that correlates only weakly with other psycholinguistic variables (e.g., family size, word frequency), indicating that it captures a novel source of variance in lexical access. Finally, some theoretical and methodological implications are discussed of adopting OSC as one of the predictors of reaction times in studies of visual word recognition.
引用
收藏
页码:1482 / 1495
页数:14
相关论文
共 59 条
[1]  
Amenta S., 2015, 1 QUANT MORPH M BELG
[2]   From sound to meaning: Phonology-to-Semantics mapping in visual word recognition [J].
Amenta, Simona ;
Marelli, Marco ;
Sulpizio, Simone .
PSYCHONOMIC BULLETIN & REVIEW, 2017, 24 (03) :887-893
[3]   The effect of orthographic similarity on lexical retrieval: Resolving neighborhood conflicts [J].
Andrews, S .
PSYCHONOMIC BULLETIN & REVIEW, 1997, 4 (04) :439-461
[4]  
[Anonymous], 1966, Soviet Physics Doklady
[5]  
[Anonymous], NATURAL LANGUAGE PRO
[6]  
[Anonymous], 1993, The CELEX Lexical Database (Release 1) CD-ROM
[7]   Learning Topic Models - Going beyond SVD [J].
Arora, Sanjeev ;
Ge, Rong ;
Moitra, Ankur .
2012 IEEE 53RD ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS), 2012, :1-10
[8]   An Amorphous Model for Morphological Processing in Visual Comprehension Based on Naive Discriminative Learning [J].
Baayen, R. Harald ;
Milin, Petar ;
Durdevic, Dusica Filipovic ;
Hendrix, Peter ;
Marelli, Marco .
PSYCHOLOGICAL REVIEW, 2011, 118 (03) :438-481
[9]   Visual word recognition of single-syllable words [J].
Balota, DA ;
Cortese, MJ ;
Sergent-Marshall, SD ;
Spieler, DH ;
Yap, MJ .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 2004, 133 (02) :283-316
[10]   The English Lexicon Project [J].
Balota, David A. ;
Yap, Melvin J. ;
Cortese, Michael J. ;
Hutchison, Keith A. ;
Kessler, Brett ;
Loftis, Bjorn ;
Neely, James H. ;
Nelson, Douglas L. ;
Simpson, Greg B. ;
Treiman, Rebecca .
BEHAVIOR RESEARCH METHODS, 2007, 39 (03) :445-459