Prediction with Partitioning: Big Data Analytics Using Regression Techniques

被引:0
|
作者
Saritha, K. [1 ]
Abraham, Sajimon [2 ]
机构
[1] Mahatma Gandhi Univ, Sch Comp Sci, Kottayam, Kerala, India
[2] Mahatma Gandhi Univ, Sch Management & Business Studies, Kottayam, Kerala, India
关键词
Big Data Analytics; Pradictive modeling; Exploratory Data Analysis; Linear Regression; Partitioning; CHALLENGES; TRENDS;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The cumulative growth of data from various sources has led to the era of big data. Big Data analytics give rise opportunities in designing of competitive offer packages for customers to provide reliable services, but analysis must be accurate and timely for successful decision making. For testing and analyzing Big Data, various statistical methods are developed. Traditional statistical analysis focuses on sampling for generating a predictive mode. To overcome this limitation, Big Data is partition into sub data sets and statistical analysis is employed on each subsets. As the structure of data sets are to be studied initially we have to go through various steps in statistical modeling up to Exploratory Data Analysis (EDA). Dependent variable and independent variables are identified and suitable parametric modeling is suggested. Regression techniques are used to describe the relation between dependent and independent variables. Here we focused different linear regression techniques. The performance are evaluated through simulation methods in the experimental data sets from UCI machine learning repository and its seen that multivariate linear regression shows better performance in parametric modeling.
引用
收藏
页码:208 / 214
页数:7
相关论文
共 50 条
  • [21] Benchmarking Business Analytics Techniques in Big Data
    Oliveira, Catia
    Guimaraes, Tiago
    Portela, Filipe
    Santos, Manuel
    10TH INT CONF ON EMERGING UBIQUITOUS SYST AND PERVAS NETWORKS (EUSPN-2019) / THE 9TH INT CONF ON CURRENT AND FUTURE TRENDS OF INFORMAT AND COMMUN TECHNOLOGIES IN HEALTHCARE (ICTH-2019) / AFFILIATED WORKOPS, 2019, 160 : 690 - 695
  • [22] Big Data Analytics Concepts and Management Techniques
    Elarabi, Tarek
    Sharma, Bhanu
    Pahwa, Karan
    Deep, Vishal
    2016 INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT), VOL 2, 2016, : 234 - 239
  • [23] Feature Selection Techniques for Big Data Analytics
    Albattah, Waleed
    Khan, Rehan Ullah
    Alsharekh, Mohammed F.
    Khasawneh, Samer F.
    ELECTRONICS, 2022, 11 (19)
  • [24] Challenges in Big Data Analytics Techniques: A Survey
    Komalavalli, C.
    Laroiya, Chetna
    2019 9TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2019), 2019, : 223 - 228
  • [25] A PERSONALIZED RECOMMENDATION ENGINE FOR PREDICTION OF DISORDERS USING BIG DATA ANALYTICS
    Shobana, V
    Kumar, N.
    2017 IEEE INTERNATIONAL CONFERENCE ON INNOVATIONS IN GREEN ENERGY AND HEALTHCARE TECHNOLOGIES (IGEHT), 2017,
  • [26] Runoff prediction using Big Data analytics based on ARIMA Model
    Dhote, Vijay
    SatanandMishra
    Shukla, Jai Prakash
    Pandey, S. K.
    INDIAN JOURNAL OF GEO-MARINE SCIENCES, 2018, 47 (11) : 2163 - 2170
  • [27] SEPSIS PREDICTION USING BIG DATA ANALYTICS-BASED TOOLS
    Pessach, Itai
    Keler, Uri
    Lipsky, Ari
    Pickering, Brian
    Herasevich, Vitaly
    CRITICAL CARE MEDICINE, 2018, 46 (01) : 734 - 734
  • [28] Farm Biosecurity Hot Spots Prediction Using Big Data Analytics
    Li, Cecil
    Dutta, Ritaban
    Smith, Daniel
    Das, Aruneema
    Aryal, Jagannath
    2015 13TH IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW), 2015, : 101 - 104
  • [29] An Effective Model for Consumer Need Prediction Using Big Data Analytics
    Tian, Yihao
    JOURNAL OF INTERCONNECTION NETWORKS, 2022, 22 (SUPP02)
  • [30] Big Data Analytics for Crop Prediction Mode Using Optimization Technique
    Sharma, Shivi
    Rathee, Geetanjali
    Saini, Hemraj
    2018 FIFTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (IEEE PDGC), 2018, : 760 - 764