Word frequency and readability: Predicting the text-level readability with a lexical-level attribute

被引:29
作者
Chen, Xiaobin [1 ]
Meurers, Detmar [1 ]
机构
[1] Univ Tubingen, LEAD Grad Sch & Res Network, Seminar Sprachwissensch, Tubingen, Germany
关键词
COH-METRIX; VOCABULARY KNOWLEDGE; DIVERSITY; THRESHOLD;
D O I
10.1111/1467-9817.12121
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Assessment of text readability is important for assigning texts at the appropriate level to readers at different proficiency levels. The present research approached readability assessment from the lexical perspective of word frequencies derived from corpora assumed to reflect typical language experience. Three studies were conducted to test how the word-level feature of word frequency can be aggregated to characterise text-level readability. The results show that an effective use of word frequency for text readability assessment should take a range of characteristics of the distribution of words frequencies into account. For characterizing text readability, taking into account the standard deviation in addition to the mean word frequencies already significantly increases results. The best results are obtained using the mean frequencies of the words in language frequency bands or in bands obtained by agglomerative clustering of the word frequencies in the documents - though a comparison of within-corpus and cross-corpus results shows the limited generalizability of using high numbers of fine-grained frequency bands. Overall, the study advances our understanding of the relationship between word frequency and text readability and provides concrete options for more effectively making use of lexical frequency information in practice.
引用
收藏
页码:486 / 510
页数:25
相关论文
共 74 条
  • [1] Contextual diversity, not word frequency, determines word-naming and lexical decision times
    Adelman, James S.
    Brown, Gordon D. A.
    Quesada, Jose F.
    [J]. PSYCHOLOGICAL SCIENCE, 2006, 17 (09) : 814 - 823
  • [2] [Anonymous], COMM COR STAT STAND
  • [3] [Anonymous], 2014, TECHNICAL REPORT
  • [4] [Anonymous], 1975, Technical Report
  • [5] [Anonymous], 2007, TECHNICAL REPORT
  • [6] [Anonymous], 2007, HUMAN LANGUAGE TECHN
  • [7] [Anonymous], 2009, P 12 C EUROPEAN CHAP
  • [8] [Anonymous], 2001, LEARNING VOCABULARY, DOI DOI 10.1017/CB09781139524759
  • [9] [Anonymous], TECHNICAL REPORT
  • [10] ARE LEXICAL DECISIONS A GOOD MEASURE OF LEXICAL ACCESS - THE ROLE OF WORD-FREQUENCY IN THE NEGLECTED DECISION STAGE
    BALOTA, DA
    CHUMBLEY, JI
    [J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 1984, 10 (03) : 340 - 357