Information categorization approach to literary authorship disputes

被引:48
作者
Yang, ACC [1 ]
Peng, CK
Yien, HW
Goldberger, AL
机构
[1] Harvard Univ, Beth Israel Deaconess Med Ctr, Sch Med, Margret & HA Rey Inst Nonlinear Dynam Med, Boston, MA 02215 USA
[2] Harvard Univ, Beth Israel Deaconess Med Ctr, Sch Med, Div Cardiovasc, Boston, MA 02215 USA
[3] Natl Yang Ming Univ, Sch Med, Taipei 112, Taiwan
[4] Taipei Vet Gen Hosp, Taipei, Taiwan
关键词
linguistic analysis; authorship; Shannon entropy;
D O I
10.1016/S0378-4371(03)00622-8
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Scientific analysis of the linguistic styles of different authors has generated considerable interest. We present a generic approach to measuring the similarity of two symbolic sequences that requires minimal background knowledge about a given human language. Our analysis is based on word rank order frequency statistics and phylogenetic tree construction. We demonstrate the applicability of this method to historic authorship questions related to the classic Chinese novel "The Dream of the Red Chamber," to the plays of William Shakespeare, and to the Federalist papers. This method may also provide a simple approach to other large databases based on their information content. (C) 2003 Elsevier B.V. All rights reserved.
引用
收藏
页码:473 / 483
页数:11
相关论文
共 21 条
[1]  
[Anonymous], [No title captured], P351
[2]  
Bloom Harold., 1998, Shakespeare: The Invention of the Human
[3]  
Brinegar C., 1963, J AM STAT ASSOC, V58, P85, DOI [10.1080/01621459.1963.10500834, DOI 10.1080/01621459.1963.10500834], DOI 10.1080/01621459.1963.10500834]
[4]   Questions of authorship: Attribution and beyond - A lecture delivered on the occasion of the Roberto Busa Award ACH-ALLC 2001, New York [J].
Burrows, J .
COMPUTERS AND THE HUMANITIES, 2003, 37 (01) :5-32
[5]  
FELSENSTEIN J, 1993, COMPUTER PROGRAM PHY
[6]   THE DISTANCE BETWEEN ZIPF PLOTS [J].
HAVLIN, S .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 1995, 216 (1-2) :148-150
[7]  
Holmes D. I., 1998, Literary & Linguistic Computing, V13, P111, DOI 10.1093/llc/13.3.111
[8]   A STYLOMETRIC ANALYSIS OF MORMON SCRIPTURE AND RELATED TEXTS [J].
HOLMES, DI .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 1992, 155 :91-120
[9]  
HU S, 1985, COLLECTED CHINESE PA
[10]  
Merriam T., 2000, Literary & Linguistic Computing, V15, P157, DOI 10.1093/llc/15.2.157