An Automatic Document Classifier System Based on Genetic Algorithm and Taxonomy

被引:10
作者
Diaz-Manriquez, Alan [1 ]
Bertha Rios-Alvarado, Ana [1 ]
Hugo Barron-Zambrano, Jose [1 ]
Yukary Guerrero-Melendez, Tania [1 ]
Carlos Elizondo-Leal, Juan [1 ]
机构
[1] Univ Autonoma Tamaulipas, Fac Ingn & Ciencias, Ciudad Victoria 87000, Mexico
来源
IEEE ACCESS | 2018年 / 6卷
关键词
Classification algorithms; genetic algorithms; evolutionary computation; optimization;
D O I
10.1109/ACCESS.2018.2815992
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The use of the Web has increased the creation of digital information in an accelerated way and about multiple subjects. Text classification is widely used to filter emails, classify Web pages, and organize results retrieved by Web browsers. In this paper, we propose to raise the problem of automatic classification of scientific texts as an optimization problem, which will allow obtaining groups from a data set. The use of evolutionary algorithms to solve classification problems has been a recurrent approach. However, there are a few approaches in which classification problems are solved, where the data attributes to be classified are text-type. In this way, it is proposed to use the association for computing machinery taxonomy to obtain the similarity between documents, where each document consists of a set of keywords. According to the results obtained, the algorithm is competitive, which indicates that the proposal of a knowledge-based genetic algorithm is a viable approach to solve the classification problem.
引用
收藏
页码:21552 / 21559
页数:8
相关论文
共 21 条
  • [1] Almuhareb A., 2004, P EMNLP, P1
  • [2] [Anonymous], ADAPTATION NATURAL A
  • [3] [Anonymous], 2014, GENETIC EVOLUTIONARY
  • [4] Barbosa Ricardo, 2015, 2015 45th IEEE International Conference on Dependable Systems and Networks Workshops (DSN-W), P39, DOI 10.1109/DSN-W.2015.20
  • [5] Bijalwan V., 2014, CORR, V6, P115
  • [6] CHANG YH, 2008, INT C MACH LEARN CYB, P3144
  • [7] Cui XH, 2005, 2005 IEEE SWARM INTELLIGENCE SYMPOSIUM, P185
  • [8] Document Clustering for Forensic Analysis: An Approach for Improving Computer Inspection
    da Cruz Nassif, Luis Filipe
    Hruschka, Eduardo Raul
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2013, 8 (01) : 46 - 54
  • [9] Eiben A. E., 2003, INTRO EVOLUTIONAIY C, V53
  • [10] Hong S.S., 2015, INT J ADV SOFT COMPU, V7, P2074