From big data to smart data: a sample gradient descent approach for machine learning

被引:2
|
作者
Ganie, Aadil Gani [1 ]
Dadvandipour, Samad [1 ]
机构
[1] Univ Miskolc, H-3515 Miskolc, Hungary
关键词
Big data; Gradient decent; Machine learning; PCA; Loss function;
D O I
10.1186/s40537-023-00839-9
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This research paper presents an innovative approach to gradient descent known as ''Sample Gradient Descent''. This method is a modification of the conventional batch gradient descent algorithm, which is often associated with space and time complexity issues. The proposed approach involves the selection of a representative sample of data, which is subsequently subjected to batch gradient descent. The selection of this sample is a crucial task, as it must accurately represent the entire dataset. To achieve this, the study employs the use of Principle Component Analysis (PCA), which is applied to the training data, with a condition that only those rows and columns of data that explain 90% of the overall variance are retained. This approach results in a convex loss function, where a global minimum can be readily attained. Our results indicate that the proposed method offers faster convergence rates, with reduced computation times, when compared to the conventional batch gradient descent algorithm. These findings demonstrate the potential utility of the ''Sample Gradient Descent'' technique in various domains, ranging from machine learning to optimization problems. In our experiments, both approaches were run for 30 epochs, with each epoch taking approximately 3.41 s. Notably, our ''Sample Gradient Descent'' approach exhibited remarkable performance, converging in just 8 epochs, while the conventional batch gradient descent algorithm required 20 epochs to achieve convergence. This substantial difference in convergence rates, along with reduced computation times, highlights the superior efficiency of our proposed method. These findings underscore the potential utility of the ''Sample Gradient Descent'' technique across diverse domains, ranging from machine learning to optimization problems. The significant improvements in convergence rates and computation times make our algorithm particularly appealing to practitioners and researchers seeking enhanced efficiency in gradient descent optimization.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] From big to smart data
    ZWF Zeitschrift fuer Wirtschaftlichen Fabrikbetrieb, 2018, 113 (09):
  • [22] A Data Science Approach to Cost Estimation Decision Making - Big Data and Machine Learning
    Fernandez-Revuelta Perez, Luis
    Romero Blasco, Alvaro
    REVISTA DE CONTABILIDAD-SPANISH ACCOUNTING REVIEW, 2022, 25 (01) : 45 - 57
  • [23] Distributed Coordinate Descent Method for Learning with Big Data
    Richtarik, Peter
    Takac, Martin
    JOURNAL OF MACHINE LEARNING RESEARCH, 2016, 17
  • [24] Distributed coordinate descent method for learning with big data
    Richtárik, Peter
    Takáč, Martin
    Journal of Machine Learning Research, 2016, 17
  • [25] Topical collection on machine learning for big data analytics in smart healthcare systems
    Mian Ahmad Jan
    Houbing Song
    Fazlullah Khan
    Ateeq Ur Rehman
    Lie-Liang Yang
    Neural Computing and Applications, 2023, 35 : 14469 - 14471
  • [26] Topical collection on machine learning for big data analytics in smart healthcare systems
    Jan, Mian Ahmad
    Song, Houbing
    Khan, Fazlullah
    Ur Rehman, Ateeq
    Yang, Lie-Liang
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (20): : 14469 - 14471
  • [27] Online gradient descent algorithms for functional data learning
    Chen, Xiaming
    Tang, Bohao
    Fan, Jun
    Guo, Xin
    JOURNAL OF COMPLEXITY, 2022, 70
  • [28] Machine learning for big data analytics
    Oja, E. (erkki.oja@aalto.fi), 1600, Springer Verlag (384):
  • [29] Big data and machine learning in health
    Carvalho, D.
    Cruz, R.
    EUROPEAN JOURNAL OF PUBLIC HEALTH, 2020, 30 : 10 - 11
  • [30] Data Analytics and Machine Learning for Smart Process Manufacturing: Recent Advances and Perspectives in the Big Data Era
    Shang, Chao
    You, Fengqi
    ENGINEERING, 2019, 5 (06) : 1010 - 1016