Linguistic Variation and Change in 250 Years of English Scientific Writing: A Data-Driven Approach

被引:14
作者
Bizzoni, Yuri [1 ]
Degaetano-Ortlieb, Stefania [1 ]
Fankhauser, Peter [2 ]
Teich, Elke [1 ]
机构
[1] Saarland Univ, Language Sci & Technol, Saarbrucken, Germany
[2] Inst Deutsch Sprache, Digital Linguist, Mannheim, Germany
来源
FRONTIERS IN ARTIFICIAL INTELLIGENCE | 2020年 / 3卷
关键词
linguistic change; diachronic variation in language use; register variation; evolution of Scientific English; computational language models; EVOLUTION;
D O I
10.3389/frai.2020.00073
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We trace the evolution of Scientific English through the Late Modern period to modern time on the basis of a comprehensive corpus composed of the Transactions and Proceedings of the Royal Society of London, the first and longest-running English scientific journal established in 1665. Specifically, we explore the linguistic imprints of specialization and diversification in the science domain which accumulate in the formation of "scientific language" and field-specific sublanguages/registers (chemistry, biology etc.). We pursue an exploratory, data-driven approach using state-of-the-art computational language models and combine them with selected information-theoretic measures (entropy, relative entropy) for comparing models along relevant dimensions of variation (time, register). Focusing on selected linguistic variables (lexis, grammar), we show how we deploy computational language models for capturing linguistic variation and change and discuss benefits and limitations.
引用
收藏
页数:15
相关论文
共 78 条
[1]  
Aitchison J., 2017, HDB HIST LINGUISTICS, P736, DOI [10.1002/9781405166201.ch25, DOI 10.1002/9781405166201.CH25]
[2]  
[Anonymous], 1988, Variation across Speech and Writing, DOI [DOI 10.1017/CBO9780511621024, 10.1017/CBO9780511621024]
[3]  
[Anonymous], 2008, POSTGR C CORP LING
[4]  
[Anonymous], 2011, P 5 ACL HLT WORKSH L
[5]   Register in computational language research [J].
Argamon, Shlomo Engelson .
REGISTER STUDIES, 2019, 1 (01) :100-135
[6]  
Atkinson D., 1998, Scientific Discourse in Sociohistorical Context: The Philosophical Transactions of the Royal Society of London, 1675-1975, V1st, DOI DOI 10.4324/9781410601704
[7]  
Banks David., 2008, The Development of Scientific Writing: Linguistic Features and Historical Context
[8]  
Biber D, 2016, GRAMMATICAL COMPLEXI, DOI [10.1017/CBO9780511920776, DOI 10.1017/CBO9780511920776]
[9]  
Biber D, 2011, STUD CORPUS LINGUIST, V47, P11
[10]  
Bizzoni Y., 2019, P 22 NORD C COMP LIN