Two genre classification of Japanese literary works written by Ryuunosuke Akutagawa and Kenji Miyazawa based on word vectors

被引:1
作者
Takenaka, Shiori [1 ]
Kuroiwa, Jousuke [1 ]
Odaka, Tomohiro [1 ]
机构
[1] Univ Fukui, Grad Sch Engn, 3-9-1 Bunkyo, Fukui 9108507, Japan
来源
IEICE NONLINEAR THEORY AND ITS APPLICATIONS | 2023年 / 14卷 / 02期
关键词
word vector; word2vec; continuous bag-of-words; support vector machine; natural language processing;
D O I
10.1587/nolta.14.428
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Word vectors are applied various tasks in natural language processing. However, the potentiality of the word vectors of Japanese has not been discussed as much as those of English. Therefore, the purpose of this paper is to classify the genre of modern Japanese literary works using characteristic features evaluated by the word vectors. The accuracy of the classification between novels and poetic works was 95%, and the one between novels and essays was 90%. The word vectors are applicable in the genre classification problem in modern Japanese literary works.
引用
收藏
页码:428 / 435
页数:8
相关论文
共 11 条
[1]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297
[2]   DISTRIBUTIONAL STRUCTURE [J].
Harris, Zellig S. .
WORD-JOURNAL OF THE INTERNATIONAL LINGUISTIC ASSOCIATION, 1954, 10 (2-3) :146-162
[3]  
Hinton Geoffrey E., 1986, Parallel Distributed Processing: Explorations in the Microstructure of Cognition, V1, P77
[4]  
Mikolov T., 2013, P 2013 C N AM CHAPTE, P746
[5]  
Mikolov T, 2013, Arxiv, DOI arXiv:1301.3781
[6]  
Mori K., 2014, Nippon Decimal Classification New And Revised, V10th
[7]  
Pantel P., 2005, P 43 ANN M ASS COMPU, P125
[8]  
Rong X, 2016, Arxiv, DOI arXiv:1411.2738
[9]  
Sahlgren M, 2008, ITAL J LINGUIST, V20, P33
[10]  
Takenaka S., 2022, Prec. NOLTA'22, P367