EsPal: One-stop shopping for Spanish word properties

被引:363
作者
Duchon, Andrew [1 ]
Perea, Manuel [2 ]
Sebastian-Galles, Nuria [3 ]
Marti, Antonia [4 ]
Carreiras, Manuel [1 ,5 ]
机构
[1] Basque Ctr Cognit Brain & Language, Donostia San Sebastian, Spain
[2] Univ Valencia, Valencia, Spain
[3] Univ Pompeu Fabra, Barcelona, Spain
[4] Univ Barcelona, Barcelona, Spain
[5] Basque Fdn Sci, IKERBASQUE, Bilbao, Spain
关键词
Word frequency; Subtitles; Word recognition; Corpus linguistics; Psycholinguistics; LEXICAL DECISION; ORTHOGRAPHIC NEIGHBORHOOD; CONTEXTUAL DIVERSITY; SYLLABLE FREQUENCY; EYE-MOVEMENTS; RECOGNITION; STATISTICS; PROGRAM; INFORMATION; MEMORY;
D O I
10.3758/s13428-013-0326-1
中图分类号
B841 [心理学研究方法];
学科分类号
040201 ;
摘要
This article introduces EsPal: a Web-accessible repository containing a comprehensive set of properties of Spanish words. EsPal is based on an extensible set of data sources, beginning with a 300 million token written database and a 460 million token subtitle database. Properties available include word frequency, orthographic structure and neighborhoods, phonological structure and neighborhoods, and subjective ratings such as imageability. Subword structure properties are also available in terms of bigrams and trigrams, biphones, and bisyllables. Lemma and part-of-speech information and their corresponding frequencies are also indexed. The website enables users either to upload a set of words to receive their properties or to receive a set of words matching constraints on the properties. The properties themselves are easily extensible and will be added over time as they become available. It is freely available from the following website: http://www.bcbl.eu/databases/espal/.
引用
收藏
页码:1246 / 1258
页数:13
相关论文
共 47 条
[1]   Contextual diversity, not word frequency, determines word-naming and lexical decision times [J].
Adelman, James S. ;
Brown, Gordon D. A. ;
Quesada, Jose F. .
PSYCHOLOGICAL SCIENCE, 2006, 17 (09) :814-823
[2]  
Alameda J.R., 1995, Diccionario de frecuencias de las unidades linguisticas del castellano
[3]   SYLLABARIUM: An online application for deriving complete statistics for Basque and Spanish orthographic syllables [J].
Andoni Dunabeitia, Jon ;
Cholin, Joana ;
Corral, Jose ;
Perea, Manuel ;
Carreiras, Manuel .
BEHAVIOR RESEARCH METHODS, 2010, 42 (01) :118-125
[4]   Oral frequency norms for 67,979 Spanish words [J].
Angeles Alonso, Maria ;
Fernandez, Angel ;
Diez, Emiliano .
BEHAVIOR RESEARCH METHODS, 2011, 43 (02) :449-458
[5]  
[Anonymous], 2008, P 6 INT C LANG RES E
[6]   Singulars and plurals in Dutch: Evidence for a parallel dual-route modes [J].
Baayen, RH ;
Dijkstra, T ;
Schreuder, R .
JOURNAL OF MEMORY AND LANGUAGE, 1997, 37 (01) :94-117
[7]   Visual word recognition of single-syllable words [J].
Balota, DA ;
Cortese, MJ ;
Sergent-Marshall, SD ;
Spieler, DH ;
Yap, MJ .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 2004, 133 (02) :283-316
[8]   Dealing with zero word frequencies: A review of the existing rules of thumb and a suggestion for an evidence-based choice [J].
Brysbaert, Marc ;
Diependaele, Kevin .
BEHAVIOR RESEARCH METHODS, 2013, 45 (02) :422-430
[9]   Adding part-of-speech information to the SUBTLEX-US word frequencies [J].
Brysbaert, Marc ;
New, Boris ;
Keuleers, Emmanuel .
BEHAVIOR RESEARCH METHODS, 2012, 44 (04) :991-997
[10]   The Word Frequency Effect A Review of Recent Developments and Implications for the Choice of Frequency Estimates in German [J].
Brysbaert, Marc ;
Buchmeier, Matthias ;
Conrad, Markus ;
Jacobs, Arthur M. ;
Boelte, Jens ;
Boehl, Andrea .
EXPERIMENTAL PSYCHOLOGY, 2011, 58 (05) :412-424