MTLD, vocd-D, and HD-D: A validation study of sophisticated approaches to lexical diversity assessment

被引:433
作者
McCarthy, Philip M. [1 ]
Jarvis, Scott [2 ]
机构
[1] Univ Memphis, Dept English, Memphis, TN 38152 USA
[2] Ohio Univ, Athens, OH 45701 USA
基金
美国国家科学基金会;
关键词
LANGUAGE;
D O I
10.3758/BRM.42.2.381
中图分类号
B841 [心理学研究方法];
学科分类号
040201 ;
摘要
The main purpose of this study was to examine the validity of the approach to lexical diversity assessment known as the measure of textual lexical diversity (MTLD). The index for this approach is calculated as the mean length of word strings that maintain a criterion level of lexical variation. To validate the MTLD approach, we compared it against the performances of the primary competing indices in the field, which include vocd-D, TTR, Maas, Yule's K, and an HD-D index derived directly from the hypergeometric distribution function. The comparisons involved assessments of convergent validity, divergent validity, internal validity, and incremental validity. The results of our assessments of these indices across two separate corpora suggest three major findings. First, MTLD performs well with respect to all four types of validity and is, in fact, the only index not found to vary as a function of text length. Second, HD-D is a viable alternative to the vocd-D standard. And third, three of the indices-MTLD, vocd-D (or HD-D), and Maas-appear to capture unique lexical information. We conclude by advising researchers to consider using MTLD, vocd-D (or HD-D), and Maas in their studies, rather than any single index, noting that lexical diversity can be assessed in many ways and each approach may be informative as to the construct under investigation.
引用
收藏
页码:381 / 392
页数:12
相关论文
共 48 条
  • [1] [Anonymous], 1999, STAND ED PSYCH TEST
  • [2] [Anonymous], DISCOURSE P IN PRESS
  • [3] Best R., 2006, P 7 INT C LEARNING S, P37
  • [4] A TYPOLOGY OF ENGLISH-TEXTS
    BIBER, D
    [J]. LINGUISTICS, 1989, 27 (01) : 3 - 43
  • [5] Biber Douglas., 1988, Variation across speech and writing, DOI 10.1017/CBO9780511621024
  • [6] Biggs A., 2003, GLENCOE SCI SCI LEVE
  • [7] Cohen J., 1988, Statistical power analysis for the behavioral sciences, VSecond
  • [8] Crossley S. A., J RES READI IN PRESS
  • [9] Measuring L2 Lexical Growth Using Hypernymic Relationships
    Crossley, Scott
    Salsbury, Tom
    McNamara, Danielle
    [J]. LANGUAGE LEARNING, 2009, 59 (02) : 307 - 334
  • [10] Computational assessment of lexical differences in L1 and L2 writing
    Crossley, Scott A.
    McNamara, Danielle S.
    [J]. JOURNAL OF SECOND LANGUAGE WRITING, 2009, 18 (02) : 119 - 135