Part-of-speech studies in Chinese

被引:6
作者
Wang, Lu [1 ]
机构
[1] Univ Trier, Computat Linguist & Digital Humanities, Trier, Germany
关键词
D O I
10.1080/09296174.2016.1169851
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
This paper studies parts of speech in Chinese on data taken from the Modern Chinese Dictionary (5th edition). First, the part-of-speech polyfunctionality (ambiguity) of words is determined; then the corresponding distribution and rank-frequency sequence are analysed. The Waring and right truncated modified Zipf-Alekseev distributions are successfully fitted to the data. Second, the 121 patterns that the total 3742 polyfunctional words yield are presented. The polyfunctionality of patterns distributes according to the positive Cohen-binomial distribution, while the rank-frequency sequence abides by the negative hypergeometric distribution. Third, we discuss the mechanism behind the polyfunctionality phenomenon: Chinese words diversify in the dimension of their function but not in their form as can be expected from an analytic language. The Popescu-Altmann function captures the distribution of the variants of each part of speech. Fourth, we analyse the polyfunctionality distributions of individual parts of speech. Out of the 12 parts of speech which the dictionary distinguishes, six can be modelled by the Poisson distribution, four by the mixed Poisson, and two by the Singh-Poisson distribution. In order to obtain a general form, we apply the mixed Poisson distribution for all the parts of speech by controlling one parameter. We make a first attempt to plot the polyfunctionality distributions of individual parts of speech in Ord's system, which surprisingly shows approximately a hyperbola.
引用
收藏
页码:235 / 255
页数:21
相关论文
共 15 条
  • [1] [Anonymous], PROBLEMS QUANTITATIV
  • [2] Best K.-H., 1994, Journal of Quantitative Linguistics, V1, P144
  • [3] Fan FX, 2008, GLOTTOMETRICS, V17, P66
  • [4] Hammerl R., 1990, Glottometrika, V11, P142
  • [5] Judt B., 1995, WORTARTENHAUFIGKEITE
  • [6] Kohler R., 1991, Diversification processes in language: Grammar, P47
  • [7] Kohler R., 2009, PROBLEMS QUANTITATIV
  • [8] Kohler Reinhard., 1986, LINGUISTISCHEN SYNER
  • [9] Probability Distribution of Dependencies Based on a Chinese Dependency Treebank
    Liu, Haitao
    [J]. JOURNAL OF QUANTITATIVE LINGUISTICS, 2009, 16 (03) : 256 - 273
  • [10] Evaluating goodness-of-fit of discrete distribution models in quantitative linguistics
    Macutek, Jan
    Wimmer, Gejza
    [J]. JOURNAL OF QUANTITATIVE LINGUISTICS, 2013, 20 (03) : 227 - 240