Speech parts as Poisson processes

被引:4
作者
Badalamenti, AF [1 ]
机构
[1] Nathan S Kline Inst Psychiat Res, Orangeburg, NY 10962 USA
关键词
speech; words; narrative; stochastic; Poisson; model;
D O I
10.1023/A:1010465529988
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
This paper presents evidence that six of the seven parts of speech occur in written text as Poisson processes, simple or recurring. The six major parts are nouns, verbs, adjectives, adverbs, prepositions, and conjunctions, with the interjection occurring too infrequently to support a model. The data consist of more than the first 5000 words of works by four major authors coded to label the parts of speech, as well as periods (sentence terminators). Sentence length is measured via the period and found to be normally distributed with no stochastic model identified for its occurrence. The models for all six speech parts but the noun significantly distinguish some pairs of authors and likewise for the joint use of all words types. Any one author is significantly distinguished from any other by at least one word type and sentence length very significantly distinguishes each from all others. The variety of word type use, measured by Shannon entropy, builds to about 90% of its maximum possible value. The rate constants for nouns are close to the fractions of maximum entropy achieved. This finding together with the stochastic models and the relations among them suggest that the noun may be a primitive organizer of written text.
引用
收藏
页码:497 / 527
页数:31
相关论文
共 25 条
[11]  
Cox DR., 1965, The Theory of Stochastic Proceesses, DOI DOI 10.1016/J.PHYSA.2011
[12]  
DAHL H, 1967, WORD FREQUENCIES SPO
[13]  
FAST J, 1970, ENTROPY
[14]  
Hollander M., 1973, Nonparametric Statistical Methods
[15]  
Kendall M., 1976, TIME SERIES
[16]  
KHINCHIN AI, 1957, MATH FDN INFORMATION
[17]  
Kucera H., 1967, COMPUTATIONAL ANAL P
[18]   STOCHASTIC-ANALYSIS OF THE DURATION OF THE SPEAKER ROLE IN PSYCHOTHERAPY [J].
LANGS, RJ ;
BADALAMENTI, AF .
PERCEPTUAL AND MOTOR SKILLS, 1990, 70 (02) :675-689
[19]   FINAL NOTE ON A CLASS OF SKEW DISTRIBUTION FUNCTIONS - ANALYSIS AND CRITIQUE OF A MODEL DUE SIMON,HA [J].
MANDELBROT, B .
INFORMATION AND CONTROL, 1961, 4 (2-3) :198-&
[20]  
Martin N, 1981, MATH THEORY ENTROPY