An algorithm for classification in data mining based on classification codes

被引:0
|
作者
Sankar, H. Ravi [1 ]
Naidu, M. M. [2 ]
机构
[1] CTRI, Rajahmundry 533105, India
[2] SV Univ, Sri Venkateswara Univ Coll Engn, Tirupati, Andhra Pradesh, India
关键词
data mining; classification; decision tree; algorithm; database;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classification is a key data mining technique whereby database tuples acting as training samples are analyzed in order to produce a model of the given data. A number of classification techniques from the statistics and machine learning have been proposed. A well accepted method of classification is the induction of decision trees. The efficiency of existing decision tree algorithms has been established for small data sets. in decision tree, more number of rules are to be generated to classify the given data, because the algorithm performs the testing on attribute by attribute at level by level which is a time consuming and occupies more memory to store. In rule based classification, all combination of the fields in the table is to be taken to generate more rules for classifying the given data. To overcome these, a new algorithm is proposed which modifies the consideration of the decision tree for classification at the data warehousing level by grouping the samples using classification codes in each branch of the tree. At run time, only the code field and class field are transferred to main memory, which makes the effective usage of main memory, though the database is very large. With this construction, the number of rules to be generated is decreased and the number of tests to be performed also decreased which makes execution fast and increases the throughput. The proposed algorithm proves to be effective and efficient.
引用
收藏
页码:766 / +
页数:2
相关论文
共 50 条
  • [31] Data mining classification algorithms
    Saouabi, Mohamed
    Ezzati, Abdellah
    INTERNATIONAL JOURNAL OF MATHEMATICS AND COMPUTER SCIENCE, 2020, 15 (01): : 389 - 394
  • [32] Classification and data mining in musicology
    Beran, J
    Classification - the Ubiquitous Challenge, 2005, : 3 - 10
  • [33] Phishing detection based Associative Classification data mining
    Abdelhamid, Neda
    Ayesh, Aladdin
    Thabtah, Fadi
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (13) : 5948 - 5959
  • [34] Clustering Based Classification in Data Mining Method Recommendation
    Kazik, Ondrej
    Peskova, Klara
    Smid, Jakub
    Neruda, Roman
    2013 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2013), VOL 2, 2013, : 356 - 361
  • [35] A new classification mining model based on the data warehouse
    Zhang, SL
    Zhang, JF
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 168 - 171
  • [36] Data mining for AMD screening: A classification based approach
    Hijazi, Mohd Hanafi Ahmad
    Coenen, Frans
    Zheng, Yalin
    International Journal of Simulation: Systems, Science and Technology, 2014, 15 (02): : 57 - 69
  • [37] A Classification Based Framework for Privacy Preserving Data Mining
    Tripathy, Animesh
    Dansana, Jayanti
    Mishra, Ranjita
    PROCEEDINGS OF THE 2012 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI'12), 2012, : 660 - 666
  • [38] Data mining based Bayesian networks for best classification
    Ouali, Abdelaziz
    Cherif, Amar Ramdane
    Krebs, Marie-Odile
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 51 (02) : 1278 - 1292
  • [39] Emerging pattern based classification in relational data mining
    Ceci, Michelangelo
    Appice, Annalisa
    Malerba, Donato
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2008, 5181 : 283 - 296
  • [40] Classification of digital teaching resources based on data mining
    Cao Z.
    Ingenierie des Systemes d'Information, 2020, 25 (04): : 521 - 526