Research on Text Value and Linguistic Characteristics in Ancient Literature Based on Text Mining Technology

被引:0
作者
Wu Y. [1 ]
机构
[1] College of Arts, Xi'an Siyuan University, Shaanxi, Xi'an
关键词
Final word frequency ratio; Interclass dispersion; LDA; Linguistic features; TF-IDF;
D O I
10.2478/amns-2024-0390
中图分类号
学科分类号
摘要
By combining mining algorithms, this paper provides a new method for quantifying the characteristics of literary language, thereby reducing the subjectivity of the value and language discussion of ancient literature in Chinese. Firstly, the TF-IDF algorithm is improved, the methods of inter-class dispersion and intra-class dispersion are adopted, and the optimal number of topics K of the LDA topic model is determined by using the confusion degree, and the quantitative model of the value and language characteristics of ancient literary texts is constructed. Furthermore, the concept of maximum word-to-frequency ratio is introduced. It is integrated into the traditional information gain method, and an old academic text recognition algorithm based on XGBoost model is constructed. The model's results were applied to the network corpus mining and analysis, and the results showed that the word “cherishing spring” ranked first with a frequency of 7085 occurrences, followed by “hurt autumn” with 4598 occurrences. Among the eight themes, “natural imagery” (Topic 3) accounted for the highest proportion, reaching 23.68%, followed by “landscape and pastoral” (Topic 7) and “euphemistic words” (Topic2), accounting for 16.29% and 14.54%, respectively. The method of this paper not only provides a new perspective and tool for the quantitative analysis of the linguistic characteristics of literary works, but also points out a new research direction for the in-depth discussion of textual value and linguistic characteristics in the future. © 2023 Yujun Wu, published by Sciendo.
引用
收藏
相关论文
共 17 条
[1]  
Shen X., Chen X., Doing power threatening acts (ptas) in ancient china: an empirical study of chinese jian discourse, Journal of Historical Pragmatics, 20, 1, pp. 132-156, (2019)
[2]  
O'Sullivan P., Special topic in ancient literature and culture: concepts of art and literature from homer to aristotle, Journal of Law Medicine & Ethics, 9, 4, pp. 25-27, (2017)
[3]  
Chang Y.W., Evolution from ancient chinese legends to contemporary arts and designs in sky and space, Acta Astronautica, 185, (2021)
[4]  
Isay G.C., Non-forgetfulness and forgetfulness (wang) in ancient chinese philosophical texts, Memory Studies, 15, 2, pp. 465-479, (2022)
[5]  
Zhao J., Wei Y., Wu B., Analysis of the social network and the evolution of the influence of ancient chinese poets, Social science computer review, (2022)
[6]  
Weingarten, Oliver, Chunyu kun: motifs, narratives, and personas in early chinese anecdotal literature, Journal of the Royal Asiatic Society, 27, pp. 1-21, (2017)
[7]  
Serbu G., Beyond the ancient quarrel: literature, philosophy, and j. m. coetzee, Philosophy Today, 62, (2018)
[8]  
Lan C., Jia D., Conceptual metonymies and metaphors behind shui (water) and huo (fire) in ancient and modern chinese, Applied Linguistics Review, 11, 2, pp. 281-310, (2020)
[9]  
Howley J.A., Book-burning and the uses of writing in ancient rome: destructive practice between literature and document, The Journal of Roman Studies, 107, pp. 1-24, (2017)
[10]  
Junkiert M., Ancient revolutions in the literature of polish romanticism, Comparative Critical Studies, 15, 2, pp. 207-226, (2018)