A parameter-level parallel optimization algorithm for large-scale spatio-temporal data mining

被引:0
|
作者
Zhiqiang Liu
Xuanhua Shi
Ligang He
Dongxiao Yu
Hai Jin
Chen Yu
Hulin Dai
Zezhao Feng
机构
[1] Huazhong University of Science and Technology,National Engineering Research Center for Big Data Technology and System, Service Computing Technology and System Lab, School of Computer Science and Technology
[2] University of Warwick,Department of Computer Science
来源
关键词
Spatio-temporal data mining; Stochastic gradient descent; Block; Convergent rate; Redundant update;
D O I
暂无
中图分类号
学科分类号
摘要
The goal of spatio-temporal data mining is to discover previously unknown but useful patterns from the spatial and temporal data. However, explosive growth of the spatiotemporal data emphasizes the need for developing novel computationally efficient methods for large-scale data mining applications. Since lots of spatiotemporal data mining problems can be converted to an optimization problem, in this paper, we propose an efficient parameter-level parallel optimization algorithm for large-scale spatiotemporal data mining. In detail, most of previous optimization methods are based on gradient descent methods, which iteratively update the model and provide model-level convergence control for all parameters. Namely, they treat all parameters equally and keep updating all parameters until every parameter has converged. However, we find that during the iterative process, the convergence rates of model parameters are different from each other. This may cause redundant computation and reduce the performance. To solve this problem, we propose a parameter-level stochastic gradient descent (plpSGD), in which the convergence of each parameter is considered independently and only unconvergent parameters are updated in each iteration. Moreover, the updating of model parameters are parallelized in plpSGD to further improve the performance of SGD. We have conducted extensive experiments to evaluate the performance of plpSGD. The experimental results show that compared to previous SGD methods, plpSGD can significantly accelerate the convergence of SGD and achieve the excellent scalability with little sacrifice of the solution accuracy.
引用
收藏
页码:739 / 765
页数:26
相关论文
共 50 条
  • [21] Obtaining scalable and accurate classification in large-scale spatio-temporal domains
    Vainer, Igor
    Kraus, Sarit
    Kaminka, Gal A.
    Slovin, Hamutal
    KNOWLEDGE AND INFORMATION SYSTEMS, 2011, 29 (03) : 527 - 564
  • [22] Spatio-temporal performance of large-scale Gezira Irrigation Scheme, Sudan
    Al Zayed, Islam Sabry
    Elagib, Nadir Ahmed
    Ribbe, Lars
    Heinrich, Juergen
    AGRICULTURAL SYSTEMS, 2015, 133 : 131 - 142
  • [23] A spatio-temporal ensemble method for large-scale traffic state prediction
    Liu, Yang
    Liu, Zhiyuan
    Vu, Hai L.
    Lyu, Cheng
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2020, 35 (01) : 26 - 44
  • [24] Learning Spatio-Temporal Aggregations for Large-Scale Capacity Expansion Problems
    Brenner, Aron
    Khorramfar, Rahman
    Amin, Saurabh
    PROCEEDINGS OF THE 2023 ACM/IEEE 14TH INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL SYSTEMS, WITH CPS-IOTWEEK 2023, 2023, : 68 - 77
  • [25] Large-Scale Distributed Linkage of Records Containing Spatio-Temporal Information
    Karapiperis, Dimitrios
    Gkoulalas-Divanis, Aris
    Verykios, Vassilios S.
    2020 IEEE INTERNATIONAL SMART CITIES CONFERENCE (ISC2), 2020,
  • [26] PRACTICAL LARGE-SCALE SPATIO-TEMPORAL MODELING OF PARTICULATE MATTER CONCENTRATIONS
    Paciorek, Christopher J.
    Yanosky, Jeff D.
    Puett, Robin C.
    Laden, Francine
    Suh, Helen H.
    ANNALS OF APPLIED STATISTICS, 2009, 3 (01): : 370 - 397
  • [27] Obtaining scalable and accurate classification in large-scale spatio-temporal domains
    Igor Vainer
    Sarit Kraus
    Gal A. Kaminka
    Hamutal Slovin
    Knowledge and Information Systems, 2011, 29 : 527 - 564
  • [28] PFNet: Large-Scale Traffic Forecasting With Progressive Spatio-Temporal Fusion
    Wang, Chen
    Zuo, Kaizhong
    Zhang, Shaokun
    Lei, Hanwen
    Hu, Peng
    Shen, Zhangyi
    Wang, Rui
    Zhao, Peize
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (12) : 14580 - 14597
  • [29] A Distributed Framework for Spatio-temporal Analysis on Large-scale Camera Networks
    Hong, Kirak
    Voelz, Marco
    Govindaraju, Venu
    Jayaraman, Bharat
    Ramachandran, Umakishore
    2013 33RD IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS WORKSHOPS (ICDCSW 2013), 2013, : 309 - 314
  • [30] Spatial effect detection regression for large-scale spatio-temporal covariates
    Zhang, Chenlin
    Zhou, Ling
    Guo, Bin
    Lin, Huazhen
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2025,