Analysis of Literary Works from Turkish and World Literature with Natural Language Processing

被引:0
作者
Karaca, Uyesi Mehmet Fatih [1 ]
Bayir, Uyesi Safak [2 ]
机构
[1] Tokat Gaziosmanpasa Univ, Erbaa Meslek Yuksekokulu, Bilgisayar Teknol Bolumu, Tokat, Turkey
[2] Karabuk Univ, Edebiyat Fak, Egitim Bilimleri Bolumu, Karabuk, Turkey
来源
SELCUK UNIVERSITESI EDEBIYAT FAKULTESI DERGISI-SELCUK UNIVERSITY JOURNAL OF FACULTY OF LETTERS | 2020年 / 44卷
关键词
Turkish literature; world literature; natural language processing; Turkish; grammar;
D O I
暂无
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
There are various reading books published by many publishers for the students' utilization. Publishers prepare these literary works separately for the stages of primary, secondary and high schools by considering the characteristics of the groups. Literary works prepared for the secondary school students were used in this study. 5 literary works from Turkish and 5 from world literature having the same title out of 5 publishers were analysed by means of NLP (Natural Language Processing) techniques within the setting of Turkish grammar rules. With this study, it is aimed to determine the differences and similarities among publishers in terms of composing literary works and Turkish use in them, at the same time, it is also aimed to reveal the distinctive characteristics of the Turkish use. Developed software and Zemberek, Turkish NLP Library, were used in the analysis processes. Results which were obtained via analyses of literary works were found out as similar in terms of proportion and the differences were at the low level in general among 5 publishers. Differences were only found out in terms of unanalysable word number, average sentence number of literary works, average word number in literary works, average word number in a sentence and the ratio of the use of punctuation marks (dot and comma). Despite the fact that the titles of the literary works are the same among publishers, these distinctions occurred as a result of the difference in terms of the contents.
引用
收藏
页码:379 / 404
页数:26
相关论文
共 59 条
[1]  
Akarsu C., 2016, IEEE 24 SINYAL ISLEM
[2]  
Akcay E., 2009, THESIS
[3]  
Aktan O., 2012, THESIS
[4]  
Alemdar Ozer B., 2012, THESIS
[5]  
Amasyali M. F., 2005, TURKIYE BILISIM VAKF, V1, P37
[6]  
[Anonymous], 2012, THESIS
[7]  
[Anonymous], 2007, Elektrik Muhendisligi
[8]  
[Anonymous], 2007, THESIS
[9]  
Ari G., 2014, TURKIYE SOSYAL ARAST, V173, P307, DOI [10.20296/tsad.40333., DOI 10.20296/TSAD.40333]
[10]  
Arican S., 2010, THESIS