Bayesian analysis of a multinomial sequence and homogeneity of literary style

被引:20
作者
Girón, J [1 ]
Ginebra, J
Riba, A
机构
[1] Univ Malaga, Fac Ciencias, Dept Estadist, E-29071 Malaga, Spain
[2] Univ Politecn Catalunya, Dept Estadist, Barcelona 08028, Spain
关键词
Gibbs sampler; multinomial change-point analysis; multinomial cluster analysis; stylometry; word length;
D O I
10.1198/000313005X21311
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
To help settle the debate around the authorship of Tirant lo Blanc, all words in each chapter are categorized according to their length, and the appearances of certain words are counted, thus forming two contingency tables of ordered rows. A Bayesian multinomial change-point analysis of the sequence of rows, reveals a clear stylistic boundary, estimated to be near chapters 371 and 382. A Bayesian cluster analysis of these rows confirms the existence of that boundary. and reveals a few chapters that are misclassified by the estimated change-point. The statistical evidence supports the hypotheses of one main author writing about four fifths of the book, with a second author finishing the book by filling in material. mainly at the end of it.
引用
收藏
页码:19 / 30
页数:12
相关论文
共 73 条
[1]  
[Anonymous], QUESTIIO
[2]  
[Anonymous], 2021, Bayesian Data Analysis
[3]  
[Anonymous], APPL STAT-J ROY ST C
[4]   MODEL-BASED GAUSSIAN AND NON-GAUSSIAN CLUSTERING [J].
BANFIELD, JD ;
RAFTERY, AE .
BIOMETRICS, 1993, 49 (03) :803-821
[5]   A BAYESIAN-ANALYSIS FOR CHANGE POINT PROBLEMS [J].
BARRY, D ;
HARTIGAN, JA .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (421) :309-319
[6]   Inference in model-based cluster analysis [J].
Bensmail, H ;
Celeux, G ;
Raftery, AE ;
Robert, CP .
STATISTICS AND COMPUTING, 1997, 7 (01) :1-10
[7]   BAYESIAN COMPUTATION AND STOCHASTIC-SYSTEMS [J].
BESAG, J ;
GREEN, P ;
HIGDON, D ;
MENGERSEN, K .
STATISTICAL SCIENCE, 1995, 10 (01) :3-41
[8]   NONPARAMETRIC TESTS FOR SHIFT AT AN UNKNOWN TIME POINT [J].
BHATTACHARYYA, GK ;
JOHNSON, RA .
ANNALS OF MATHEMATICAL STATISTICS, 1968, 39 (05) :1731-+
[9]  
BINDER DA, 1978, BIOMETRIKA, V65, P31, DOI 10.2307/2335273
[10]  
Binongo J. N. G., 1994, Literary & Linguistic Computing, V9, P267, DOI 10.1093/llc/9.4.267