MORPHOLOGICAL ORGANIZATION: THE LOW CONDITIONAL ENTROPY CONJECTURE

被引:126
作者
Ackerman, Farrell [1 ]
Malouf, Robert [2 ]
机构
[1] Univ Calif San Diego, Dept Linguist, La Jolla, CA 92093 USA
[2] San Diego State Univ, Dept Linguist & Asian Middle Eastern Languages, San Diego, CA 92181 USA
关键词
inflectional paradigms; information-theoretic measures; word-based morphology; morphological typology; word-and-paradigm models; morphological complexity; LANGUAGE; REGULARITIES; INFLECTION; GRAMMAR; GENDER; USAGE;
D O I
10.1353/lan.2013.0054
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Crosslinguistically, inflectional morphology exhibits a spectacular range of complexity in both the structure of individual words and the organization of systems that words participate in. We distinguish two dimensions in the analysis of morphological complexity. ENUMERATIVE COMPLEXITY (E-complexity) reflects the number of morphosyntactic distinctions that languages make and the strategies employed to encode them, concerning either the internal composition of words or the arrangement of classes of words into inflection classes. This, we argue, is constrained by INTEGRATIVE COMPLEXITY (I-complexity). The I-complexity of an inflectional system reflects the difficulty that a paradigmatic system poses for language users (rather than lexicographers) in information-theoretic terms. This becomes clear by distinguishing AVERAGE PARADIGM ENTROPY from AVERAGE CONDITIONAL ENTROPY. The average entropy of a paradigm is the uncertainty in guessing the realization for a particular cell of the paradigm of a particular lexeme (given knowledge of the possible exponents). This gives one a measure of the complexity of a morphological system systems with more exponents and more inflection classes will in general have higher average paradigm entropy but it presupposes a problem that adult native speakers will never encounter. In order to know that a lexeme exists, the speaker must have heard at least one word form, so in the worst case a speaker will be faced with predicting a word form based on knowledge of one other word form of that lexeme. Thus, a better measure of morphological complexity is the average conditional entropy, the average uncertainty in guessing the realization of one randomly selected cell in the paradigm of a lexeme given the realization of one other randomly selected cell. This is the I-complexity of paradigm organization. Viewed from this information-theoretic perspective, languages that appear to differ greatly in their E-complexity the number of exponents, inflectional classes, and principal parts can actually be quite similar in terms of the challenge they pose for a language user who already knows how the system works. We adduce evidence for this hypothesis from three sources: a comparison between languages of varying degrees of E-complexity, a case study from the particularly challenging conjugational system of Chiquihuitlan Mazatec, and a Monte Carlo simulation modeling the encoding of morphosyntactic properties into formal expressions. The results of these analyses provide evidence for the crucial status of words and paradigms for understanding morphological organization.*
引用
收藏
页码:429 / 464
页数:36
相关论文
共 105 条
[1]  
Ackerman F., 2004, PROJECTING MORPHOLOG, P111
[2]  
Alberch P., 1989, Geobios Memoire Special (Villeurbanne), P21
[3]   Rules vs. analogy in English past tenses: a computational/experimental study [J].
Albright, A ;
Hayes, B .
COGNITION, 2003, 90 (02) :119-161
[4]  
Albright Adam., 2005, PARADIGMS PHONOLOGIC, P17
[5]  
Anderson S. R., 1992, Amorphous morphology
[6]  
Anderson StephenR., 1985, PHONOLOGY 20 CENTURY
[7]  
[Anonymous], 2006, Self-organization in the evolution of speech
[8]  
[Anonymous], 1993, LEXICON ACQUISITION, DOI DOI 10.1017/CBO9780511554377
[9]  
[Anonymous], FUR GRAMMAR PHONOLOG
[10]  
Aronoff M., 1993, Number 22 in Linguistic Inquiry Monographs