A Parallel Genetic Algorithm for Pattern Recognition in Mixed Databases

被引:2
作者
Kuri-Morales, Angel [1 ]
Sagastuy-Brena, Javier [1 ]
机构
[1] Inst Tecnol Autonomo Mexico, Rio Hondo 1, Mexico City 01000, DF, Mexico
来源
PATTERN RECOGNITION (MCPR 2017) | 2017年 / 10267卷
关键词
Categorical databases; Neural networks; Genetic algorithms; Parallel computation;
D O I
10.1007/978-3-319-59226-8_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Structured data bases may include both numerical and non-numerical attributes (categorical or CA). Databases which include CAs are called "mixed" databases (MD). Metric clustering algorithms are ineffectual when presented with MDs because, in such algorithms, the similarity between the objects is determined by measuring the differences between them, in accordance with some predefined metric. Nevertheless, the information contained in the CAs of MDs is fundamental to understand and identify the patterns therein. A practical alternative is to encode the instances of the CAs numerically. To do this we must consider the fact that there is a limited subset of codes which will preserve the patterns in the MD. To identify such pattern-preserving codes (PPC) we appeal to neural networks (NN) and genetic algorithms (GA). It is possible to identify a set of PPCs by trying out a bounded number of codes (the individuals of a GA's population) and demanding the GA to identify the best individual. Such individual is the best practical PPC for the MD. The computational complexity of this task is considerable. To decrease processing time we appeal to multi-core architectures and the implementation of multiple threads in an algorithm called ParCENG. In this paper we discuss the method and establish experimental bounds on its parameters. This will allow us to tackle larger databases in much shorter execution times.
引用
收藏
页码:13 / 21
页数:9
相关论文
共 10 条
  • [1] Barbara D., 2002, Proceedings of the Eleventh International Conference on Information and Knowledge Management. CIKM 2002, P582, DOI 10.1145/584792.584888
  • [2] Castro F., 2013, LNCS LNAI, V8266, DOI [10.1007/978-3-642-45111-9, DOI 10.1007/978-3-642-45111-9]
  • [3] Cybenko G., 1989, Mathematics of Control, Signals, and Systems, V2, P303, DOI 10.1007/BF02551274
  • [4] Deb K., 2000, Parallel Problem Solving from Nature PPSN VI. 6th International Conference. Proceedings (Lecture Notes in Computer Science Vol.1917), P849
  • [5] Goebel Michael., 1999, SIGKDD EXPLOR NEWSL, V1, P20
  • [6] Kuri-Morales A. F., 2015, WSEAS P 6 INT C APPL, P167
  • [7] Norusis M.J., 2008, SPSS statistics 17. 0 statistical procedures companion
  • [8] CONVERGENCE ANALYSIS OF CANONICAL GENETIC ALGORITHMS
    RUDOLPH, G
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (01): : 96 - 101
  • [9] SOKAL R.R., 1985, COMPUTER ASSISTED BA, P1
  • [10] 30 YEARS OF ADAPTIVE NEURAL NETWORKS - PERCEPTRON, MADALINE, AND BACKPROPAGATION
    WIDROW, B
    LEHR, MA
    [J]. PROCEEDINGS OF THE IEEE, 1990, 78 (09) : 1415 - 1442