Aralex: A lexical database for Modern Standard Arabic

被引:84
作者
Boudelaa, Sami [1 ]
Marslen-Wilson, William D. [1 ]
机构
[1] MRC Cognit & Brain Sci Unit, Cambridge CB2 2EF, England
基金
英国医学研究理事会;
关键词
PSYCHOLINGUISTIC STATISTICS; MENTAL REPRESENTATION; MORPHOLOGY; LANGUAGE; SYSTEM; PROGRAM;
D O I
10.3758/BRM.42.2.481
中图分类号
B841 [心理学研究方法];
学科分类号
040201 ;
摘要
In this article, we present a new lexical database for Modern Standard Arabic: Aralex. Based on a contemporary text corpus of 40 million words, Aralex provides information about (1) the token frequencies of roots and word patterns, (2) the type frequency, or family size, of roots and word patterns, and (3) the frequency of bigrams, trigrams in orthographic forms, roots, and word patterns. Aralex will be a useful tool for studying the cognitive processing of Arabic through the selection of stimuli on the basis of precise frequency counts. Researchers can use it as a source of information on natural language processing, and it may serve an educational purpose by providing basic vocabulary lists. Aralex is distributed under a GNU-like license, allowing people to interrogate it freely online or to download it from www.mrc-cbu.cam.ac.uk:8081/aralex.online/login.jsp.
引用
收藏
页码:481 / 487
页数:7
相关论文
共 26 条
[1]  
[Anonymous], 1999, DICT ARABE FRANCAIS
[2]  
[Anonymous], 1993, The CELEX Lexical Database (Release 1) CD-ROM
[3]  
[Anonymous], 1995, Modern Arabic: Structures, Functions, and Varieties
[4]   Discontinuous morphology in time: Incremental masked priming in Arabic [J].
Boudelaa, S ;
Marslen-Wilson, WD .
LANGUAGE AND COGNITIVE PROCESSES, 2005, 20 (1-2) :207-260
[5]   A re-examination of the default system for Arabic plurals [J].
Boudelaa, S ;
Gaskell, MG .
LANGUAGE AND COGNITIVE PROCESSES, 2002, 17 (03) :321-343
[6]   Arabic Morphology in the Neural Language System [J].
Boudelaa, Sami ;
Pulvermueller, Friedemann ;
Hauk, Olaf ;
Shtyrov, Yury ;
Marslen-Wilson, William .
JOURNAL OF COGNITIVE NEUROSCIENCE, 2010, 22 (05) :998-1010
[7]  
Buckwalter T., 2002, ARABIC TRANSLITERATI
[8]  
CONTENT A, 1990, ANN PSYCHOL, V90, P551
[9]   N-Watch: A program for deriving neighborhood size and other psycholinguistic statistics [J].
Davis, CJ .
BEHAVIOR RESEARCH METHODS, 2005, 37 (01) :65-70
[10]  
DICHY J, 1998, P 6 INT C EXH MULT C, P1