Formation of Fuzzy Patterns in Logical Analysis of Data Using a Multi-Criteria Genetic Algorithm

被引:7
作者
Masich, Igor S. [1 ,2 ]
Kulachenko, Margarita A. [1 ]
Stanimirovic, Predrag S. [3 ]
Popov, Aleksey M. [1 ]
Tovbis, Elena M. [1 ]
Stupina, Alena A. [1 ,4 ]
Kazakovtsev, Lev A. [1 ,4 ]
机构
[1] Reshetnev Siberian State Univ Sci & Technol, Inst Informat & Telecommun, 31 Krasnoyarsky Rabochy Av, Krasnoyarsk 660037, Russia
[2] Siberian Fed Univ, Inst Space & Informat Technol, 79 Svobodny Pr, Krasnoyarsk 660041, Russia
[3] Univ Nis, Fac Sci & Math, Visegradska 33, Nish 18000, Serbia
[4] Siberian Fed Univ, Inst Business Proc Management, 79 Svobodny Pr, Krasnoyarsk 660041, Russia
来源
SYMMETRY-BASEL | 2022年 / 14卷 / 03期
关键词
logical analysis of data; pattern generation; genetic algorithm; ATRIAL-FIBRILLATION PREDICTION; FAULT-DIAGNOSIS; CLASSIFICATION; OPTIMIZATION; INDUCTION;
D O I
10.3390/sym14030600
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The formation of patterns is one of the main stages in logical data analysis. Fuzzy approaches to pattern generation in logical analysis of data allow the pattern to cover not only objects of the target class, but also a certain proportion of objects of the opposite class. In this case, pattern search is an optimization problem with the maximum coverage of the target class as an objective function, and some allowed coverage of the opposite class as a constraint. We propose a more flexible and symmetric optimization model which does not impose a strict restriction on the pattern coverage of the opposite class observations. Instead, our model converts such a restriction (purity restriction) into an additional criterion. Both, coverage of the target class and the opposite class are two objective functions of the optimization problem. The search for a balance of these criteria is the essence of the proposed optimization method. We propose a modified evolutionary algorithm based on the Non-dominated Sorting Genetic Algorithm-II (NSGA-II) to solve this problem. The new algorithm uses pattern formation as an approximation of the Pareto set and considers the solution's representation in logical analysis of data and the informativeness of patterns. We have tested our approach on two applied medical problems of classification under conditions of sample asymmetry: one class significantly dominated the other. The classification results were comparable and, in some cases, better than the results of commonly used machine learning algorithms in terms of accuracy, without losing the interpretability.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] MULTI-CRITERIA DECISION MAKING USING FUZZY PREFERENCE RELATIONS
    Borzecka, Hanna
    OPERATIONS RESEARCH AND DECISIONS, 2012, 22 (03) : 5 - 21
  • [22] Genetic algorithm-based multi-criteria project portfolio selection
    Yu, Lean
    Wang, Shouyang
    Wen, Fenghua
    Lai, Kin Keung
    ANNALS OF OPERATIONS RESEARCH, 2012, 197 (01) : 71 - 86
  • [23] Genetic algorithm-based multi-criteria project portfolio selection
    Lean Yu
    Shouyang Wang
    Fenghua Wen
    Kin Keung Lai
    Annals of Operations Research, 2012, 197 : 71 - 86
  • [24] The multi-criteria minimum spanning tree problem based genetic algorithm
    Chen, Guolong
    Chen, Shuili
    Guo, Wenzhong
    Chen, Huowang
    INFORMATION SCIENCES, 2007, 177 (22) : 5050 - 5063
  • [25] GENETIC ALGORITHM-BASED MULTI-CRITERIA APPROACH TO PRODUCT MODULARIZATION
    Kumar, Binay
    Singh, Ritesh Kumar
    Kumar, Surendra
    INTERNATIONAL JOURNAL OF TECHNOLOGY, 2018, 9 (04) : 775 - 786
  • [26] Optimization of blasting patterns in Esfordi phosphate mine using hybrid analysis of data envelopment analysis and multi-criteria decision making
    Khademian, Amir
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [27] Spanned patterns for the logical analysis of data
    Alexe, G
    Hammer, PL
    DISCRETE APPLIED MATHEMATICS, 2006, 154 (07) : 1039 - 1049
  • [28] Accelerated algorithm for pattern detection in logical analysis of data
    Alexe, S
    Hammer, PL
    DISCRETE APPLIED MATHEMATICS, 2006, 154 (07) : 1050 - 1063
  • [29] Identification of homogeneous rainfall regions using a genetic algorithm involving multi-criteria decision making techniques
    Debbarma, Nilotpal
    Choudhury, Parthasarathi
    Roy, Parthajit
    WATER SUPPLY, 2019, 19 (05) : 1491 - 1499
  • [30] Paired Patterns in Logical Analysis of Data for Decision Support in Recognition
    Masich, Igor S.
    Tyncheko, Vadim S.
    Nelyub, Vladimir A.
    Bukhtoyarov, Vladimir V.
    Kurashkin, Sergei O.
    Borodulin, Aleksey S.
    COMPUTATION, 2022, 10 (10)