Prediction with Partitioning: Big Data Analytics Using Regression Techniques

被引:0
|
作者
Saritha, K. [1 ]
Abraham, Sajimon [2 ]
机构
[1] Mahatma Gandhi Univ, Sch Comp Sci, Kottayam, Kerala, India
[2] Mahatma Gandhi Univ, Sch Management & Business Studies, Kottayam, Kerala, India
关键词
Big Data Analytics; Pradictive modeling; Exploratory Data Analysis; Linear Regression; Partitioning; CHALLENGES; TRENDS;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The cumulative growth of data from various sources has led to the era of big data. Big Data analytics give rise opportunities in designing of competitive offer packages for customers to provide reliable services, but analysis must be accurate and timely for successful decision making. For testing and analyzing Big Data, various statistical methods are developed. Traditional statistical analysis focuses on sampling for generating a predictive mode. To overcome this limitation, Big Data is partition into sub data sets and statistical analysis is employed on each subsets. As the structure of data sets are to be studied initially we have to go through various steps in statistical modeling up to Exploratory Data Analysis (EDA). Dependent variable and independent variables are identified and suitable parametric modeling is suggested. Regression techniques are used to describe the relation between dependent and independent variables. Here we focused different linear regression techniques. The performance are evaluated through simulation methods in the experimental data sets from UCI machine learning repository and its seen that multivariate linear regression shows better performance in parametric modeling.
引用
收藏
页码:208 / 214
页数:7
相关论文
共 50 条
  • [1] HARDWARE PARTITIONING FOR BIG DATA ANALYTICS
    Wu, Lisa
    Barker, Raymond J.
    Kim, Martha A.
    Ross, Kenneth A.
    IEEE MICRO, 2014, 34 (03) : 109 - 119
  • [2] Big Data Analytics Using Data Mining Techniques: A Survey
    Mittal, Shweta
    Sangwan, Om Prakash
    ADVANCED INFORMATICS FOR COMPUTING RESEARCH, ICAICR 2018, PT I, 2019, 955 : 264 - 273
  • [3] Big Data Analytics using Machine Learning Techniques
    Mittal, Shweta
    Sangwan, Om Prakash
    2019 9TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2019), 2019, : 203 - 207
  • [4] A Framework for Pandemic Prediction Using Big Data Analytics
    Ahmed, Imran
    Ahmad, Misbah
    Jeon, Gwanggil
    Piccialli, Francesco
    BIG DATA RESEARCH, 2021, 25
  • [5] Stocks Analysis and Prediction Using Big Data Analytics
    Peng, Zhihao
    2019 INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION, BIG DATA & SMART CITY (ICITBS), 2019, : 309 - 312
  • [6] Prediction of Corrosion Rate Using Big Data Analytics
    Samudrala, Suryaprakash
    Talanki, Suresha
    Shoba, M.
    Sachin
    Varsha, S.
    Roy, Jeet
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (13): : 229 - 234
  • [7] Data-driven techniques for temperature data prediction: big data analytics approach
    Oloyede, Adamson
    Ozuomba, Simeon
    Asuquo, Philip
    Olatomiwa, Lanre
    Longe, Omowunmi Mary
    ENVIRONMENTAL MONITORING AND ASSESSMENT, 2023, 195 (02)
  • [8] Data-driven techniques for temperature data prediction: big data analytics approach
    Adamson Oloyede
    Simeon Ozuomba
    Philip Asuquo
    Lanre Olatomiwa
    Omowunmi Mary Longe
    Environmental Monitoring and Assessment, 2023, 195
  • [9] Big data analytics: six techniques
    Shu, Hong
    GEO-SPATIAL INFORMATION SCIENCE, 2016, 19 (02) : 119 - 128
  • [10] Big Data Analytics Techniques: A Survey
    Vashisht, Poonam
    Gupta, Vishal
    2015 INTERNATIONAL CONFERENCE ON GREEN COMPUTING AND INTERNET OF THINGS (ICGCIOT), 2015, : 264 - 269