A parameter-level parallel optimization algorithm for large-scale spatio-temporal data mining

被引:0
|
作者
Zhiqiang Liu
Xuanhua Shi
Ligang He
Dongxiao Yu
Hai Jin
Chen Yu
Hulin Dai
Zezhao Feng
机构
[1] Huazhong University of Science and Technology,National Engineering Research Center for Big Data Technology and System, Service Computing Technology and System Lab, School of Computer Science and Technology
[2] University of Warwick,Department of Computer Science
来源
关键词
Spatio-temporal data mining; Stochastic gradient descent; Block; Convergent rate; Redundant update;
D O I
暂无
中图分类号
学科分类号
摘要
The goal of spatio-temporal data mining is to discover previously unknown but useful patterns from the spatial and temporal data. However, explosive growth of the spatiotemporal data emphasizes the need for developing novel computationally efficient methods for large-scale data mining applications. Since lots of spatiotemporal data mining problems can be converted to an optimization problem, in this paper, we propose an efficient parameter-level parallel optimization algorithm for large-scale spatiotemporal data mining. In detail, most of previous optimization methods are based on gradient descent methods, which iteratively update the model and provide model-level convergence control for all parameters. Namely, they treat all parameters equally and keep updating all parameters until every parameter has converged. However, we find that during the iterative process, the convergence rates of model parameters are different from each other. This may cause redundant computation and reduce the performance. To solve this problem, we propose a parameter-level stochastic gradient descent (plpSGD), in which the convergence of each parameter is considered independently and only unconvergent parameters are updated in each iteration. Moreover, the updating of model parameters are parallelized in plpSGD to further improve the performance of SGD. We have conducted extensive experiments to evaluate the performance of plpSGD. The experimental results show that compared to previous SGD methods, plpSGD can significantly accelerate the convergence of SGD and achieve the excellent scalability with little sacrifice of the solution accuracy.
引用
收藏
页码:739 / 765
页数:26
相关论文
共 50 条
  • [31] A Spatio-temporal Data Compression Algorithm
    Wang, Lei
    Guo, Yiming
    Chen, Chen
    Yan, Yaowei
    2012 FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION NETWORKING AND SECURITY (MINES 2012), 2012, : 421 - 424
  • [32] Fine-Grained Parallel Optimization of Large-Scale Data for PMVS Algorithm
    Liu J.
    Li Y.
    Jiang Z.
    Deng J.
    Sui H.
    Pan J.
    Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2019, 44 (04): : 608 - 616
  • [33] A Parallel EM Algorithm for Model-Based Clustering Applied to the Exploration of Large Spatio-Temporal Data
    Chen, Wei-Chen
    Ostrouchov, George
    Pugmire, David
    Prabhat
    Wehner, Michael
    TECHNOMETRICS, 2013, 55 (04) : 513 - 523
  • [34] Mapping and interpreting spatio-temporal trends in vegetation restoration following mining disturbances in large-scale surface coal mining areas
    Xu, Yaling
    Yang, Guozhu
    Zhang, Yicong
    Guo, Junting
    Zhang, Chengye
    FRONTIERS IN ENVIRONMENTAL SCIENCE, 2025, 13
  • [35] Exploratory spatio-temporal data mining and visualization
    Compieta, P.
    Di Martino, S.
    Bertolotto, M.
    Ferrucci, F.
    Kechadi, T.
    JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 2007, 18 (03): : 255 - 279
  • [36] A new approach for spatio-temporal data mining
    Cassat, Sabine
    Irani, Pourang
    Serrano, Marcos
    Dubois, Emmanuel
    ACTES DE LA 30 CONFERENCE FRANCOPHONE SUR L'INTERACTION HOMME-MACHINE - (IHM 2018), 2018, : 163 - 169
  • [37] A visual approach for spatio-temporal data mining
    Kechadi, M-Tahar
    Bertolotto, Michela
    IRI 2006: PROCEEDINGS OF THE 2006 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION, 2006, : 504 - +
  • [38] Mining Spatio-Temporal Patterns in Trajectory Data
    Kang, Juyoung
    Yong, Hwan-Seung
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2010, 6 (04): : 521 - 536
  • [39] STORM: Spatio-Temporal Online Reasoning and Management of Large Spatio-Temporal Data
    Christensen, Robert
    Wang, Lu
    Li, Feifei
    Yi, Ke
    Tang, Jun
    Villa, Natalee
    SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, : 1111 - 1116
  • [40] Parallel indexing technique for spatio-temporal data
    He, Zhenwen
    Kraak, Menno-Jan
    Huisman, Otto
    Ma, Xiaogang
    Xiao, Jing
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2013, 78 : 116 - 128