Speech parts as Poisson processes

被引:4
作者
Badalamenti, AF [1 ]
机构
[1] Nathan S Kline Inst Psychiat Res, Orangeburg, NY 10962 USA
关键词
speech; words; narrative; stochastic; Poisson; model;
D O I
10.1023/A:1010465529988
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
This paper presents evidence that six of the seven parts of speech occur in written text as Poisson processes, simple or recurring. The six major parts are nouns, verbs, adjectives, adverbs, prepositions, and conjunctions, with the interjection occurring too infrequently to support a model. The data consist of more than the first 5000 words of works by four major authors coded to label the parts of speech, as well as periods (sentence terminators). Sentence length is measured via the period and found to be normally distributed with no stochastic model identified for its occurrence. The models for all six speech parts but the noun significantly distinguish some pairs of authors and likewise for the joint use of all words types. Any one author is significantly distinguished from any other by at least one word type and sentence length very significantly distinguishes each from all others. The variety of word type use, measured by Shannon entropy, builds to about 90% of its maximum possible value. The rate constants for nouns are close to the fractions of maximum entropy achieved. This finding together with the stochastic models and the relations among them suggest that the noun may be a primitive organizer of written text.
引用
收藏
页码:497 / 527
页数:31
相关论文
共 25 条
[1]  
[Anonymous], 1965, PSYCHOBIOLOGY LANGUA
[2]  
[Anonymous], 1949, Human behaviour and the principle of least-effort
[3]  
ASH RB, 1965, INFORMATION THEORY
[4]   STOCHASTIC-ANALYSIS OF THE DURATION OF THE SPEAKER ROLE IN THE PSYCHOTHERAPY OF AN AIDS PATIENT [J].
BADALAMENTI, A ;
LANGS, R .
AMERICAN JOURNAL OF PSYCHOTHERAPY, 1992, 46 (02) :207-225
[5]   THE PROGRESSION OF THE ENTROPY OF A 5-DIMENSIONAL PSYCHOTHERAPEUTIC SYSTEM [J].
BADALAMENTI, AF ;
LANGS, RJ .
SYSTEMS RESEARCH, 1992, 9 (03) :3-28
[6]   POISSON EVOLUTION IN WORD SELECTION [J].
BADALAMENTI, AF ;
LANGS, R ;
CRAMER, G ;
ROBINSON, J .
MATHEMATICAL AND COMPUTER MODELLING, 1994, 19 (12) :27-36
[7]   LAWFUL SYSTEMS DYNAMICS IN HOW POETS CHOOSE THEIR WORDS [J].
BADALAMENTI, AF ;
LANGS, RJ ;
ROBINSON, J .
BEHAVIORAL SCIENCE, 1994, 39 (01) :46-71
[8]   AN EMPIRICAL-INVESTIGATION OF HUMAN DYADIC SYSTEMS IN THE TIME AND FREQUENCY DOMAINS [J].
BADALAMENTI, AF ;
LANGS, RJ .
BEHAVIORAL SCIENCE, 1991, 36 (02) :100-114
[9]  
Brillinger DR., 1975, TIME SERIES DATA ANA
[10]  
BROWN PF, 1992, AM J COMPUTATIONAL L, V18, P31