From big data to smart data: a sample gradient descent approach for machine learning

被引:2
|
作者
Ganie, Aadil Gani [1 ]
Dadvandipour, Samad [1 ]
机构
[1] Univ Miskolc, H-3515 Miskolc, Hungary
关键词
Big data; Gradient decent; Machine learning; PCA; Loss function;
D O I
10.1186/s40537-023-00839-9
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This research paper presents an innovative approach to gradient descent known as ''Sample Gradient Descent''. This method is a modification of the conventional batch gradient descent algorithm, which is often associated with space and time complexity issues. The proposed approach involves the selection of a representative sample of data, which is subsequently subjected to batch gradient descent. The selection of this sample is a crucial task, as it must accurately represent the entire dataset. To achieve this, the study employs the use of Principle Component Analysis (PCA), which is applied to the training data, with a condition that only those rows and columns of data that explain 90% of the overall variance are retained. This approach results in a convex loss function, where a global minimum can be readily attained. Our results indicate that the proposed method offers faster convergence rates, with reduced computation times, when compared to the conventional batch gradient descent algorithm. These findings demonstrate the potential utility of the ''Sample Gradient Descent'' technique in various domains, ranging from machine learning to optimization problems. In our experiments, both approaches were run for 30 epochs, with each epoch taking approximately 3.41 s. Notably, our ''Sample Gradient Descent'' approach exhibited remarkable performance, converging in just 8 epochs, while the conventional batch gradient descent algorithm required 20 epochs to achieve convergence. This substantial difference in convergence rates, along with reduced computation times, highlights the superior efficiency of our proposed method. These findings underscore the potential utility of the ''Sample Gradient Descent'' technique across diverse domains, ranging from machine learning to optimization problems. The significant improvements in convergence rates and computation times make our algorithm particularly appealing to practitioners and researchers seeking enhanced efficiency in gradient descent optimization.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] From big data to smart data: a sample gradient descent approach for machine learning
    Aadil Gani Ganie
    Samad Dadvandipour
    Journal of Big Data, 10
  • [2] RECENT TRENDS IN STOCHASTIC GRADIENT DESCENT FOR MACHINE LEARNING AND BIG DATA
    Newton, David
    Pasupathy, Raghu
    Yousefian, Farzad
    2018 WINTER SIMULATION CONFERENCE (WSC), 2018, : 366 - 380
  • [3] Coded Decentralized Learning With Gradient Descent for Big Data Analytics
    Yue, Jing
    Xiao, Ming
    IEEE COMMUNICATIONS LETTERS, 2020, 24 (02) : 362 - 366
  • [4] Big data and machine learning:A roadmap towards smart plants
    Bogdan DORNEANU
    Sushen ZHANG
    Hang RUAN
    Mohamed HESHMAT
    Ruijuan CHEN
    Vassilios S.VASSILIADIS
    Harvey ARELLANO-GARCIA
    Frontiers of Engineering Management, 2022, 9 (04) : 623 - 639
  • [5] Big data and machine learning: A roadmap towards smart plants
    Dorneanu, Bogdan
    Zhang, Sushen
    Ruan, Hang
    Heshmat, Mohamed
    Chen, Ruijuan
    Vassiliadis, Vassilios S.
    Arellano-Garcia, Harvey
    FRONTIERS OF ENGINEERING MANAGEMENT, 2022, 9 (04) : 623 - 639
  • [6] Big data and machine learning: A roadmap towards smart plants
    Bogdan Dorneanu
    Sushen Zhang
    Hang Ruan
    Mohamed Heshmat
    Ruijuan Chen
    Vassilios S. Vassiliadis
    Harvey Arellano-Garcia
    Frontiers of Engineering Management, 2022, 9 : 623 - 639
  • [7] Robust Bayesian Kernel Machine via Stein Variational Gradient Descent for Big Data
    Khanh Nguyen
    Trung Le
    Tu Dinh Nguyen
    Dinh Phung
    Webb, Geoffrey I.
    KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 2003 - 2011
  • [8] Machine Learning in Big Data
    Wang, Lidong
    Alexander, Cheryl Ann
    INTERNATIONAL JOURNAL OF MATHEMATICAL ENGINEERING AND MANAGEMENT SCIENCES, 2016, 1 (02) : 52 - 61
  • [9] Machine Learning on Big Data
    Condie, Tyson
    Mineiro, Paul
    Polyzotis, Neoklis
    Weimer, Markus
    2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2013, : 1242 - 1244
  • [10] Cyber Security of Smart Grids in the Context of Big Data and Machine Learning
    Dogaru, Delia Ioana
    Dumitrache, Ioan
    2019 22ND INTERNATIONAL CONFERENCE ON CONTROL SYSTEMS AND COMPUTER SCIENCE (CSCS), 2019, : 61 - 67