An Enhanced Pre-Processing Model for Big Data Processing: A Quality Framework

被引:0
|
作者
Lincy, Blessy Trencia S. S. [1 ]
Kumar, N. Suresh [1 ]
机构
[1] VIT Univ, Sch Comp Sci & Engn, Vellore, Tamil Nadu, India
关键词
Big data; pre-processing; relief algorithm; fast mRMR; SparkR; FEATURE-SELECTION;
D O I
暂无
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
With the ever growing trends and technologies a huge volume of data is being evolved each and every second big data has become a supreme approach in data inception, accession, processing and analyzing the heterogeneous, huge amount of data so as to derive useful insights out of it. With data and without quality there is no point in having the data. Thus, data with quality is required to use or leverage the data in a more appropriate manner. With the evolution of big data many technologies are being developed. The input to it must be processed in such a way that the quality data yields quality effective results. An effective pre-processing model is proposed in this paper for the processing of the big data. Using relief algorithm and fast mRMR together as a hybrid approach can be used for the pre-processing of the data. Analysis shows that this hybrid approach is more effective and can greatly enhance the quality of the data. This approach can yield better performance upon the big data platform using the Spark framework.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Big Data Pre-Processing: A Quality Framework
    Taleb, Ikbal
    Dssouli, Rachida
    Serhani, Mohamed Adel
    2015 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2015, 2015, : 191 - 198
  • [2] Pre-Processing Data In Weather Monitoring Application By Using Big Data Quality Framework
    Labeeb, Kashshaf
    Chowdhury, Kuraish Bin Quader
    Riha, Rabea Basri
    Abedin, Mohammad Zoynul
    Yesmin, Sarmila
    Khan, Mohammad Nasfikur Rahman
    PROCEEDINGS OF 2020 6TH IEEE INTERNATIONAL WOMEN IN ENGINEERING (WIE) CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (WIECON-ECE 2020), 2020, : 292 - 295
  • [3] Big Data Pre-Processing: Closing the Data Quality Enforcement Loop
    Taleb, Ikbal
    Serhani, Mohamed Adel
    2017 IEEE 6TH INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS 2017), 2017, : 498 - 501
  • [4] The application of data pre-processing technology in the geoscience big data
    Wang ChengBin
    Ma XiaoGang
    Chen JianGuo
    ACTA PETROLOGICA SINICA, 2018, 34 (02) : 303 - 313
  • [5] Survey of Pre-processing Techniques for Mining Big Data
    Hariharakrishnan, Jayaram
    Mohanavalli, S.
    Srividya
    Kumar, Sundhara K. B.
    2017 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION AND SIGNAL PROCESSING (ICCCSP), 2017, : 77 - 81
  • [6] A framework of irregularity enlightenment for data pre-processing in data mining
    Au, Siu-Tong
    Duan, Rong
    Hesar, Siamak G.
    Jiang, Wei
    ANNALS OF OPERATIONS RESEARCH, 2010, 174 (01) : 47 - 66
  • [7] A framework of irregularity enlightenment for data pre-processing in data mining
    Siu-Tong Au
    Rong Duan
    Siamak G. Hesar
    Wei Jiang
    Annals of Operations Research, 2010, 174 : 47 - 66
  • [8] Pre-processing and Indexing Techniques for Constellation Queries in Big Data
    Khatibi, Amir
    Porto, Fabio
    Rittmeyer, Joao Guilherme
    Ogasawara, Eduardo
    Valduriez, Patrick
    Shasha, Dennis
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2017, 2017, 10440 : 164 - 172
  • [9] Pre-processing and quality assessment of crosshole georadar data
    Tronicke, J
    Dietrich, P
    Appel, E
    GPR 2000: PROCEEDINGS OF THE EIGHTH INTERNATIONAL CONFERENCE ON GROUND PENETRATING RADAR, 2000, 4084 : 579 - 583
  • [10] Methods for pre-processing smartcard data to improve data quality
    Robinson, Steve
    Narayanan, Baskaran
    Toh, Nelson
    Pereira, Francisco
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2014, 49 : 43 - 58