Deep Neural Networks for Czech Multi-label Document Classification

被引:11
作者
Lenc, Ladislav [1 ,2 ]
Kral, Pavel [1 ,2 ]
机构
[1] Univ West Bohemia, Fac Appl Sci, Dept Comp Sci & Engn, Plzen, Czech Republic
[2] Univ West Bohemia, Fac Appl Sci, NTIS New Technol Informat Soc, Plzen, Czech Republic
来源
COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, (CICLING 2016), PT II | 2018年 / 9624卷
关键词
Czech; Deep neural networks; Document classification; Multi-label; FEATURES;
D O I
10.1007/978-3-319-75487-1_36
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper is focused on automatic multi-label document classification of Czech text documents. The current approaches usually use some pre-processing which can have negative impact (loss of information, additional implementation work, etc). Therefore, we would like to omit it and use deep neural networks that learn from simple features. This choice was motivated by their successful usage in many other machine learning fields. Two different networks are compared: the first one is a standard multi-layer perceptron, while the second one is a popular convolutional network. The experiments on a Czech newspaper corpus show that both networks significantly outperform baseline method which uses a rich set of features with maximum entropy classifier. We have also shown that convolutional network gives the best results.
引用
收藏
页码:460 / 471
页数:12
相关论文
共 25 条
  • [1] [Anonymous], 2010, P PYTH SCI C
  • [2] [Anonymous], 1996, Numerical recipes in C
  • [3] [Anonymous], USING SYNTACTIC INFO
  • [4] Novel unsupervised features for Czech multi-label document classification
    [J]. Brychcín, Tomáš (brychcin@kiv.zcu.cz), 1600, Springer Verlag (8856): : 70 - 79
  • [5] Collobert R, 2011, J MACH LEARN RES, V12, P2493
  • [6] Inducing features of random fields
    DellaPietra, S
    DellaPietra, V
    Lafferty, J
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (04) : 380 - 393
  • [7] A tutorial survey of architectures, algorithms, and applications for deep learning
    Deng, Li
    [J]. APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2014, 3
  • [8] PCA document reconstruction for email classification
    Gomez, Juan Carlos
    Moens, Marie-Francine
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2012, 56 (03) : 741 - 751
  • [9] Evaluation of the Document Classification Approaches
    Hrala, Michal
    Kral, Pavel
    [J]. PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS CORES 2013, 2013, 226 : 877 - 885
  • [10] Hrala M, 2013, LECT NOTES COMPUT SC, V8082, P343, DOI 10.1007/978-3-642-40585-3_44