An algorithm for classification in data mining based on classification codes

被引:0
|
作者
Sankar, H. Ravi [1 ]
Naidu, M. M. [2 ]
机构
[1] CTRI, Rajahmundry 533105, India
[2] SV Univ, Sri Venkateswara Univ Coll Engn, Tirupati, Andhra Pradesh, India
关键词
data mining; classification; decision tree; algorithm; database;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classification is a key data mining technique whereby database tuples acting as training samples are analyzed in order to produce a model of the given data. A number of classification techniques from the statistics and machine learning have been proposed. A well accepted method of classification is the induction of decision trees. The efficiency of existing decision tree algorithms has been established for small data sets. in decision tree, more number of rules are to be generated to classify the given data, because the algorithm performs the testing on attribute by attribute at level by level which is a time consuming and occupies more memory to store. In rule based classification, all combination of the fields in the table is to be taken to generate more rules for classifying the given data. To overcome these, a new algorithm is proposed which modifies the consideration of the decision tree for classification at the data warehousing level by grouping the samples using classification codes in each branch of the tree. At run time, only the code field and class field are transferred to main memory, which makes the effective usage of main memory, though the database is very large. With this construction, the number of rules to be generated is decreased and the number of tests to be performed also decreased which makes execution fast and increases the throughput. The proposed algorithm proves to be effective and efficient.
引用
收藏
页码:766 / +
页数:2
相关论文
共 50 条
  • [1] Algorithm for classification of biological data based on data mining
    Garcia, Eduardo Moniz
    Fonseca, Simone A. S.
    Beingolea, Jorge R.
    PROCEEDINGS OF THE 2019 IEEE 1ST SUSTAINABLE CITIES LATIN AMERICA CONFERENCE (SCLA), 2019,
  • [2] Data mining based fuzzy classification algorithm for imbalanced data
    Xu, Le
    Chow, Mo-Yuen
    Taylor, Leroy S.
    2006 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2006, : 825 - +
  • [3] Research on Classification of Data Mining Based Niche Genetic Algorithm
    Zhang, Beibei
    Zhu, Li
    Li, Yanli
    PROGRESS IN INTELLIGENCE COMPUTATION AND APPLICATIONS, 2008, : 197 - 199
  • [4] Research on Data Classification Algorithm in Big Data Mining
    Liu Weigang
    2019 2ND INTERNATIONAL CONFERENCE ON MECHANICAL, ELECTRONIC AND ENGINEERING TECHNOLOGY (MEET 2019), 2019, : 174 - 179
  • [5] Entropy-based associative classification algorithm for mining manufacturing data
    Siradeghyan, Y.
    Zakarian, A.
    Mohanty, P.
    INTERNATIONAL JOURNAL OF COMPUTER INTEGRATED MANUFACTURING, 2008, 21 (07) : 825 - 838
  • [6] A study of classification algorithm for Data Mining based on Hybrid Intelligent Systems
    Wang, Gang
    Zhang, Chenghong
    Huang, Lihua
    PROCEEDINGS OF NINTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING, 2008, : 371 - 375
  • [7] Research on the application of the classification algorithm in the data mining in the cloud data
    Che, Min
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 127 : 249 - 249
  • [8] Improving Classification in Data mining using Hybrid algorithm
    Ahlawat, Akanksha
    Suri, Bharti
    2016 1ST INDIA INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING (IICIP), 2016,
  • [9] Music Genre Classification Using Data Mining Algorithm
    Panchwagh, Mangesh M.
    Katkar, Vijay D.
    2016 CONFERENCE ON ADVANCES IN SIGNAL PROCESSING (CASP), 2016, : 49 - 53
  • [10] Elegant decision tree algorithm for classification in data mining
    Chandra, B
    Mazumdar, S
    Arena, V
    Parimi, N
    WISE 2002: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING (WORKSHOPS), 2002, : 160 - 169