An algorithm for classification in data mining based on classification codes

被引:0
|
作者
Sankar, H. Ravi [1 ]
Naidu, M. M. [2 ]
机构
[1] CTRI, Rajahmundry 533105, India
[2] SV Univ, Sri Venkateswara Univ Coll Engn, Tirupati, Andhra Pradesh, India
关键词
data mining; classification; decision tree; algorithm; database;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classification is a key data mining technique whereby database tuples acting as training samples are analyzed in order to produce a model of the given data. A number of classification techniques from the statistics and machine learning have been proposed. A well accepted method of classification is the induction of decision trees. The efficiency of existing decision tree algorithms has been established for small data sets. in decision tree, more number of rules are to be generated to classify the given data, because the algorithm performs the testing on attribute by attribute at level by level which is a time consuming and occupies more memory to store. In rule based classification, all combination of the fields in the table is to be taken to generate more rules for classifying the given data. To overcome these, a new algorithm is proposed which modifies the consideration of the decision tree for classification at the data warehousing level by grouping the samples using classification codes in each branch of the tree. At run time, only the code field and class field are transferred to main memory, which makes the effective usage of main memory, though the database is very large. With this construction, the number of rules to be generated is decreased and the number of tests to be performed also decreased which makes execution fast and increases the throughput. The proposed algorithm proves to be effective and efficient.
引用
收藏
页码:766 / +
页数:2
相关论文
共 50 条
  • [21] Data mining for data classification based on the KNN-fuzzy method supported by genetic algorithm
    Rosa, JLA
    Ebecken, NFF
    HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2002, 2003, 2565 : 126 - 133
  • [22] A PSO-Based classification rule mining algorithm
    Wang, Ziqiang
    Sun, Xia
    Zhang, Dexian
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, PROCEEDINGS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2007, 4682 : 377 - 384
  • [23] An IA-based classification rule mining algorithm
    Wang, Ziqiang
    Zhang, Qingzhou
    Zhang, Dexian
    SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 3, PROCEEDINGS, 2007, : 309 - +
  • [24] A Novel Classification Algorithm Based on Association Rules Mining
    Bay Vo
    Bac Le
    KNOWLEDGE ACQUISITION: APPROACHES, ALGORITHMS AND APPLICATIONS, 2009, 5465 : 61 - +
  • [25] AClass: Classification algorithm based on association rule mining
    Computational Science and Engineering Department, Istanbul Technical University , Maslak 34469, Turkey
    WSEAS Trans. Inf. Sci. Appl., 2006, 3 (570-575):
  • [26] A supervised clustering and classification algorithm for mining data with mixed variables
    Li, XY
    Ye, N
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2006, 36 (02): : 396 - 406
  • [27] An Incremental Classification Algorithm for Mining Data with Feature Space Heterogeneity
    Wang, Yu
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
  • [28] Customer Relationship Management Based on SPRINT Classification Algorithm under Data Mining Technology
    Sun, Yazhou
    Tan, Xueqing
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [29] Research on the high robustness data classification and the mining algorithm based on hierarchical clustering and KNN
    Li, Haohang
    Wang, Shen
    Tang, Rui
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON COMMUNICATION AND ELECTRONICS SYSTEMS (ICCES), 2016, : 1049 - 1054
  • [30] Medical Health Big Data Classification Based on KNN Classification Algorithm
    Xing, Wenchao
    Bei, Yilin
    IEEE ACCESS, 2020, 8 (28808-28819) : 28808 - 28819