Pre-Processing Methods of Data Mining

被引:0
|
作者
Saleem, Asma [1 ]
Asif, Khadim Hussain [1 ]
Ali, Ahmad [2 ]
Awan, Shahid Mahmood [3 ]
AlGhamdi, Mohammed A. [4 ]
机构
[1] Univ Engn & Technol, Dept Comp Sci & Engn, Lahore, Pakistan
[2] COMSATS Inst Informat Technol, Dept Biosci, Sahiwal, Pakistan
[3] Univ Engn & Technol, Al Khawarizmi Inst Comp Sci, Lahore, Pakistan
[4] Umm Al Qura Univ, Inst Innovat & Entrepreneurship, Mecca, Saudi Arabia
来源
2014 IEEE/ACM 7TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC) | 2014年
关键词
data pre-processing; data mining; outliers; missing values;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Data generation, handling and its processing have emerged as the most reliable source of understanding and discovery of new facts, knowledge and products in the world of natural and material sciences. The emergence of the most efficient techniques in statistical or bioinformatics situations has therefore become a routine practice in research and industrial sectors. Under practical conditions, dealing with large datasets, it's likely to have inconsistencies and anomalies of all kinds to prevent to know real outcomes for practical problems. For accurate data mining computer based techniques of data pre-processing offer solutions that help the data under processing to conform normal structures which in turn considerably improve the performance of machine learning algorithms. In this process, accurate determination of outliers, extreme values and filling up gaps poses formidable challenges. Multiple methodologies have therefore been developed to detect these deviated or inconsistent values called outliers. Different data pre-processing techniques discussed in this paper could offer most suitable solutions for handling missing values and outliers in all kinds of large datasets such as electric load and weather datasets.
引用
收藏
页码:451 / 456
页数:6
相关论文
共 50 条
  • [1] A framework of irregularity enlightenment for data pre-processing in data mining
    Au, Siu-Tong
    Duan, Rong
    Hesar, Siamak G.
    Jiang, Wei
    ANNALS OF OPERATIONS RESEARCH, 2010, 174 (01) : 47 - 66
  • [2] A framework of irregularity enlightenment for data pre-processing in data mining
    Siu-Tong Au
    Rong Duan
    Siamak G. Hesar
    Wei Jiang
    Annals of Operations Research, 2010, 174 : 47 - 66
  • [3] Survey of Pre-processing Techniques for Mining Big Data
    Hariharakrishnan, Jayaram
    Mohanavalli, S.
    Srividya
    Kumar, Sundhara K. B.
    2017 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION AND SIGNAL PROCESSING (ICCCSP), 2017, : 77 - 81
  • [4] Analysis of Pre-processing and Post-processing Methods and Using Data Mining to Diagnose Heart Diseases
    Hamidi, H.
    Daraei, A.
    INTERNATIONAL JOURNAL OF ENGINEERING, 2016, 29 (07): : 921 - 930
  • [5] Importance of Data Pre-processing in Credit Scoring Models Based on Data Mining Approaches
    Nalic, Jasmina
    Svraka, Amar
    2018 41ST INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2018, : 1046 - 1051
  • [6] PRESISTANT: Data Pre-processing Assistant
    Bilalli, Besim
    Abello, Alberto
    Aluja-Banet, Tomas
    Munir, Rana Faisal
    Wrembel, Robert
    INFORMATION SYSTEMS IN THE BIG DATA ERA, 2018, 317 : 57 - 65
  • [7] Intelligent assistance for data pre-processing
    Bilalli, Besim
    Abello, Alberto
    Aluja-Banet, Tomas
    Wrembel, Robert
    COMPUTER STANDARDS & INTERFACES, 2018, 57 : 101 - 109
  • [8] A data pre-processing method for web content mining based on XML
    Zhang, Zhonglin
    Chen, Zhi
    2007 International Symposium on Computer Science & Technology, Proceedings, 2007, : 525 - 528
  • [9] Data Pre-Processing Evaluation for Text Mining: Transaction/Sequence Model
    Munkova, Dasa
    Munk, Michal
    Vozar, Martin
    2013 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, 2013, 18 : 1198 - 1207
  • [10] ANALYSIS OF DATA PRE-PROCESSING METHODS FOR SENTIMENT ANALYSIS OF REVIEWS
    Parlar, Tuba
    Ozel, Selma Ayse
    Song, Fei
    COMPUTER SCIENCE-AGH, 2019, 20 (01): : 123 - 141