Prediction with Partitioning: Big Data Analytics Using Regression Techniques

被引:0
|
作者
Saritha, K. [1 ]
Abraham, Sajimon [2 ]
机构
[1] Mahatma Gandhi Univ, Sch Comp Sci, Kottayam, Kerala, India
[2] Mahatma Gandhi Univ, Sch Management & Business Studies, Kottayam, Kerala, India
关键词
Big Data Analytics; Pradictive modeling; Exploratory Data Analysis; Linear Regression; Partitioning; CHALLENGES; TRENDS;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The cumulative growth of data from various sources has led to the era of big data. Big Data analytics give rise opportunities in designing of competitive offer packages for customers to provide reliable services, but analysis must be accurate and timely for successful decision making. For testing and analyzing Big Data, various statistical methods are developed. Traditional statistical analysis focuses on sampling for generating a predictive mode. To overcome this limitation, Big Data is partition into sub data sets and statistical analysis is employed on each subsets. As the structure of data sets are to be studied initially we have to go through various steps in statistical modeling up to Exploratory Data Analysis (EDA). Dependent variable and independent variables are identified and suitable parametric modeling is suggested. Regression techniques are used to describe the relation between dependent and independent variables. Here we focused different linear regression techniques. The performance are evaluated through simulation methods in the experimental data sets from UCI machine learning repository and its seen that multivariate linear regression shows better performance in parametric modeling.
引用
收藏
页码:208 / 214
页数:7
相关论文
共 50 条
  • [41] Survey on Sentiment Analysis based Stock Prediction using Big data Analytics
    Balaji, S. Naveen
    Paul, P. Victer
    Saravanan, R.
    2017 INNOVATIONS IN POWER AND ADVANCED COMPUTING TECHNOLOGIES (I-PACT), 2017,
  • [42] Heart Failure Prediction Models using Big Data Techniques
    Rammal, Heba F.
    Emam, Ahmed Z.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (05) : 363 - 371
  • [43] Big Educational Data Analytics, Prediction and Recommendation: A Survey
    Sun, Xuegeng
    Fu, Yuan
    Zheng, Weiyi
    Huang, Yanxia
    Li, Yuqi
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2022, 31 (09)
  • [44] Big Data Analytics for Prediction Modelling in Healthcare Databases
    Chauhan, Ritu
    Yafi, Eiad
    PROCEEDINGS OF THE 2021 15TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM 2021), 2021,
  • [45] A SURVEY FOR MOBILITY BIG DATA ANALYTICS FOR GEOLOCATION PREDICTION
    Xu, Guangxia
    Gao, Shiyi
    Daneshmand, Mahmoud
    Wang, Chonggang
    Liu, Yanbing
    IEEE WIRELESS COMMUNICATIONS, 2017, 24 (01) : 111 - 119
  • [46] Prediction of Heart Disease at early stage using Data Mining and Big Data Analytics: A Survey
    Banu, Salma N. K.
    Swamy, Suma
    2016 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2016, : 256 - 261
  • [47] Big data analytics in health care by data mining and classification techniques
    Jayasri, N. P.
    Aruna, R.
    ICT EXPRESS, 2022, 8 (02): : 250 - 257
  • [48] Tuning small analytics on Big Data: Data partitioning and secondary indexes in the Hadoop ecosystem
    Romero, Oscar
    Herrero, Victor
    Abello, Alberto
    Ferrarons, Jaume
    INFORMATION SYSTEMS, 2015, 54 : 336 - 356
  • [49] EDUCATIONAL BIG DATA ANALYTICS FOR FUTURISTIC SMART LEARNING USING DEEP LEARNING TECHNIQUES
    YU R.
    YAO T.
    BAI F.
    Scalable Computing, 2024, 25 (04): : 2728 - 2735
  • [50] Enhancing beef production and quality using big data analytics and computer vision techniques
    Rosa, Guilherme J.
    Aiken, Vera C.
    Fernandes, Arthur
    Dorea, Joao R.
    JOURNAL OF ANIMAL SCIENCE, 2020, 98 : 124 - 124