Optimizing Twins Decision Tree Classification, Using Genetic Algorithms

被引:0
|
作者
Seifi, Farid [1 ]
Ahmadi, Hamed [2 ]
Kangavari, Mohammad Reza [1 ]
Lotfi, Ehsan [3 ]
Imaniyan, Sanaz [3 ]
Lagzian, Somayeh [3 ]
机构
[1] Iran Univ Sci & Technol, Dept Comp Engn, Tehran, Iran
[2] Natl Aerosp Univ Kharkiv, Dept Automated Syst Software, Kharkov, Ukraine
[3] Islamic Azad Univ, Dept Comp Engn, Mashhad, Iran
来源
PROCEEDINGS OF THE 2008 7TH IEEE INTERNATIONAL CONFERENCE ON CYBERNETIC INTELLIGENT SYSTEMS | 2008年
关键词
Massive Data; Decision Tree Classification; Data Mining; Genetic Algorithms;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Decision tree classification is one of the most practical and effective methods which is used in inductive learning. Many, different approaches, which are usually, used for decision making and prediction, have been invented to construct decision tree classifiers. These approaches try to optimize parameters such as accuracy, speed of classification. size of constructed trees, learning speed, and the amount of used memory. There is a trade off between these parameters. That is to say that optimization of one may cause obstruction in the other, hence all existing approaches try. to establish equilibrium. In this study, considering the effect of the whole data set on class assigning of any, data, we propose a new approach to construct not perfectly accurate, but less complex trees in a short time, using small amount of memory. To achieve this purpose, a multi-step process has been used. We trace the training data set twice in any, step. from the beginning to the end and vice versa, to extract the class pattern for attribute selection. Using the selected attribute, we make new branches in the tree. After making branches, the selected attribute and some records of training data set are deleted at the end of any, step. This process continues alternatively in several steps for remaining data and attributes until the tree is completely constructed. In order to have an optimized tree the parameters which we use in this algorithm are optimized using genetic algorithms. In order to compare this new approach with previous ones we used some known data sets which have been used in different researches. This approach has been compared with others based on the classification accuracy, and also the decision tree size. Experimental results show that it is efficient to use this approach particularly in cases of massive data sets, memory. restrictions or short learning time.
引用
收藏
页码:311 / +
页数:3
相关论文
共 50 条
  • [21] Using homomorphic encryption for privacy-preserving collaborative decision tree classification
    Zhan, Justin
    2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, VOLS 1 AND 2, 2007, : 637 - 645
  • [22] A fuzzy decision tree approach to start a genetic algorithm for data classification
    Espíndola, RP
    Ebecken, NFF
    DATA MINING V: DATA MINING, TEXT MINING AND THEIR BUSINESS APPLICATIONS, 2004, 10 : 133 - 142
  • [23] Decision tree classification: Ranking journals using IGIDI
    Shaheen, Muhammad
    Zafar, Tanveer
    Ali Khan, Sajid
    JOURNAL OF INFORMATION SCIENCE, 2020, 46 (03) : 325 - 339
  • [24] CMARPGA: Classification Based on Multiple Association Rules Using Parallel Genetic Algorithm Pruned Decision Tree
    HanChern-Tong
    Aziz, Izzatdin
    RECENT TRENDS IN INFORMATION AND COMMUNICATION TECHNOLOGY, 2018, 5 : 554 - 560
  • [25] Optimizing preventive maintenance for mechanical components using genetic algorithms
    Tsai, YT
    Wang, KS
    Teng, HY
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2001, 74 (01) : 89 - 97
  • [26] Alternatives and challenges in optimizing industrial safety using genetic algorithms
    Martorell, S
    Sánchez, A
    Carlos, S
    Serradell, V
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2004, 86 (01) : 25 - 38
  • [27] Optimizing the structure of hierarchical mixture of experts using genetic algorithms
    Karras, DA
    Vlitakis, CE
    Boutalis, YS
    Mertzios, BG
    2004 2ND INTERNATIONAL IEEE CONFERENCE INTELLIGENT SYSTEMS, VOLS 1 AND 2, PROCEEDINGS, 2004, : 144 - 149
  • [28] Optimizing Propagation Models on Railway Communications using Genetic Algorithms
    Beire, Ana Rita
    Pita, Helder
    Cota, Nuno
    CONFERENCE ON ELECTRONICS, TELECOMMUNICATIONS AND COMPUTERS - CETC 2013, 2014, 17 : 50 - 57
  • [29] Using decision tree algorithms as a basis for a heart sound diagnosis decision support system
    Stasis, AC
    Loukis, EN
    Pavlopoulos, SA
    Koutsouris, D
    ITAB 2003: 4TH INTERNATIONAL IEEE EMBS SPECIAL TOPIC CONFERENCE ON INFORMATION TECHNOLOGY APPLICATIONS IN BIOMEDICINE, CONFERENCE PROCEEDINGS: NEW SOLUTIONS FOR NEW CHALLENGES, 2003, : 354 - 357
  • [30] Feature Selection For Text Classification Using Genetic Algorithms
    Bidi, Noria
    Elberrichi, Zakaria
    PROCEEDINGS OF 2016 8TH INTERNATIONAL CONFERENCE ON MODELLING, IDENTIFICATION & CONTROL (ICMIC 2016), 2016, : 806 - 810