Pre-Processing Methods of Data Mining

被引:0
|
作者
Saleem, Asma [1 ]
Asif, Khadim Hussain [1 ]
Ali, Ahmad [2 ]
Awan, Shahid Mahmood [3 ]
AlGhamdi, Mohammed A. [4 ]
机构
[1] Univ Engn & Technol, Dept Comp Sci & Engn, Lahore, Pakistan
[2] COMSATS Inst Informat Technol, Dept Biosci, Sahiwal, Pakistan
[3] Univ Engn & Technol, Al Khawarizmi Inst Comp Sci, Lahore, Pakistan
[4] Umm Al Qura Univ, Inst Innovat & Entrepreneurship, Mecca, Saudi Arabia
来源
2014 IEEE/ACM 7TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC) | 2014年
关键词
data pre-processing; data mining; outliers; missing values;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Data generation, handling and its processing have emerged as the most reliable source of understanding and discovery of new facts, knowledge and products in the world of natural and material sciences. The emergence of the most efficient techniques in statistical or bioinformatics situations has therefore become a routine practice in research and industrial sectors. Under practical conditions, dealing with large datasets, it's likely to have inconsistencies and anomalies of all kinds to prevent to know real outcomes for practical problems. For accurate data mining computer based techniques of data pre-processing offer solutions that help the data under processing to conform normal structures which in turn considerably improve the performance of machine learning algorithms. In this process, accurate determination of outliers, extreme values and filling up gaps poses formidable challenges. Multiple methodologies have therefore been developed to detect these deviated or inconsistent values called outliers. Different data pre-processing techniques discussed in this paper could offer most suitable solutions for handling missing values and outliers in all kinds of large datasets such as electric load and weather datasets.
引用
收藏
页码:451 / 456
页数:6
相关论文
共 50 条
  • [31] Data Pre-Processing by Genetic Algorithms for Bankruptcy Prediction
    Tsai, Chih-Fong
    Chou, Jui-Sheng
    2011 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM), 2011, : 1780 - 1783
  • [32] Data Pre-Processing Method for Industrie 4.0 Applications
    Czwick, Cordula
    Anderl, Reiner
    3RD INTERNATIONAL CONFERENCE ON INDUSTRY 4.0 AND SMART MANUFACTURING, 2022, 200 : 327 - 336
  • [33] The method of data pre-processing in grey information systems
    Wu, S. X.
    Liu, S. F.
    Li, M. Q.
    2006 9TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, VOLS 1- 5, 2006, : 1988 - +
  • [34] Data Pre-processing Techniques for Publication Performance Analysis
    Zulkepli, Fatin Shahirah
    Ibrahin, Roliana
    Saeed, Faisal
    RECENT TRENDS IN INFORMATION AND COMMUNICATION TECHNOLOGY, 2018, 5 : 59 - 65
  • [35] Research of VGOS baseband data pre-processing system
    Gan Jiangying
    Guo Shaoguang
    He Xuan
    Liu Cong
    Sun Zhengxiong
    Li Jiyun
    Ma Langming
    Shu Fengchun
    Zhang Xiuzhong
    CHINESE SPACE SCIENCE AND TECHNOLOGY, 2022, 42 (06) : 46 - 53
  • [36] A systematic approach for pre-processing electronic health records for mining: case study of heart disease
    Sorkhabi, Leila Baradaran
    Gharehchopogh, Farhad Soleimanian
    Shahamfar, Jafar
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2020, 24 (02) : 97 - 120
  • [37] An overview of pre-processing methods available for hyperspectral imaging applications
    Cozzolino, D.
    Williams, P. J.
    Hoffman, L. C.
    MICROCHEMICAL JOURNAL, 2023, 193
  • [38] Application for pre-processing and visualization of electrodermal activity wearable data
    Suoja, K.
    Liukkonen, J.
    Jussila, J.
    Salonius, H.
    Venho, N.
    Sillanpaa, V.
    Vuori, V.
    Helander, N.
    EMBEC & NBC 2017, 2018, 65 : 93 - 96
  • [39] A data pre-processing method based on multi-threshold
    Su-bida
    Wang-shuhua
    Wang-Jingfeng
    Zhong-Hua
    Deng-Rong
    Hua-Hao
    Yang-suhui
    INTERNATIONAL SYMPOSIUM ON OPTOELECTRONIC TECHNOLOGY AND APPLICATION 2014: OPTICAL REMOTE SENSING TECHNOLOGY AND APPLICATIONS, 2014, 9299
  • [40] Text Data Pre-Processing for Time-series Modelling
    Pomenkova, Jitka
    Korab, Petr
    Strba, David
    2023 33RD INTERNATIONAL CONFERENCE RADIOELEKTRONIKA, RADIOELEKTRONIKA, 2023,