Expressing Time in English and Czech Children's Literature: A Contrastive N-gram Based Study of Typologically Distant Languages

被引：0

作者：

Sebestova, Denisa ^{[1
]}

Mala, Marketa ^{[1
]}

机构：

[1] Charles Univ Prague, Prague, Czech Republic

来源：

LANGUAGE USE AND LINGUISTIC STRUCTURE (OLINCO 2018) | 2019年 / 7卷

关键词：

n-grams; children's literature; contrastive analysis; typologically distant languages; CORPUS;

D O I：

暂无

中图分类号：

H0 [语言学];

学科分类号：

030303 ; 0501 ; 050102 ;

摘要：

The study explores the expression of time in English and Czech children's fiction using n-gram extraction. This raises the methodological question of the contribution of n-gram based approaches to language comparison. We extract 2-5-grams (i.e. continuous sequences of 2-5 words) from comparable corpora of English and Czech children's fiction. The consistently higher type/token ratios in Czech point to a higher variability of Czech, characterized by morphological variability and free word-order. The qualitative part of the analysis focuses on n-grams relating to time. While n-grams proved a useful starting point in cross-linguistic analysis, highlighting typological characteristics of the languages, the study suggests that more flexible units may be needed for exploring the means of expressing time. We propose relying on patterns which are based on partly lemmatised frequent n-grams and admit some variation.

引用

页码：469 / 483

页数：15