A parameter-level parallel optimization algorithm for large-scale spatio-temporal data mining

被引:0
|
作者
Zhiqiang Liu
Xuanhua Shi
Ligang He
Dongxiao Yu
Hai Jin
Chen Yu
Hulin Dai
Zezhao Feng
机构
[1] Huazhong University of Science and Technology,National Engineering Research Center for Big Data Technology and System, Service Computing Technology and System Lab, School of Computer Science and Technology
[2] University of Warwick,Department of Computer Science
来源
关键词
Spatio-temporal data mining; Stochastic gradient descent; Block; Convergent rate; Redundant update;
D O I
暂无
中图分类号
学科分类号
摘要
The goal of spatio-temporal data mining is to discover previously unknown but useful patterns from the spatial and temporal data. However, explosive growth of the spatiotemporal data emphasizes the need for developing novel computationally efficient methods for large-scale data mining applications. Since lots of spatiotemporal data mining problems can be converted to an optimization problem, in this paper, we propose an efficient parameter-level parallel optimization algorithm for large-scale spatiotemporal data mining. In detail, most of previous optimization methods are based on gradient descent methods, which iteratively update the model and provide model-level convergence control for all parameters. Namely, they treat all parameters equally and keep updating all parameters until every parameter has converged. However, we find that during the iterative process, the convergence rates of model parameters are different from each other. This may cause redundant computation and reduce the performance. To solve this problem, we propose a parameter-level stochastic gradient descent (plpSGD), in which the convergence of each parameter is considered independently and only unconvergent parameters are updated in each iteration. Moreover, the updating of model parameters are parallelized in plpSGD to further improve the performance of SGD. We have conducted extensive experiments to evaluate the performance of plpSGD. The experimental results show that compared to previous SGD methods, plpSGD can significantly accelerate the convergence of SGD and achieve the excellent scalability with little sacrifice of the solution accuracy.
引用
收藏
页码:739 / 765
页数:26
相关论文
共 50 条
  • [1] A parameter-level parallel optimization algorithm for large-scale spatio-temporal data mining
    Liu, Zhiqiang
    Shi, Xuanhua
    He, Ligang
    Yu, Dongxiao
    Jin, Hai
    Yu, Chen
    Dai, Hulin
    Feng, Zezhao
    DISTRIBUTED AND PARALLEL DATABASES, 2020, 38 (03) : 739 - 765
  • [2] Spatio-temporal correlation mining method for large-scale traffic networks
    Fan X.
    Peng Z.
    Zheng C.
    Wang C.
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2023, 63 (09): : 1317 - 1325
  • [3] The use of cultural algorithms with evolutionary programming to control the data mining of large-scale spatio-temporal databases
    Reynolds, R
    AlShehri, H
    SMC '97 CONFERENCE PROCEEDINGS - 1997 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: CONFERENCE THEME: COMPUTATIONAL CYBERNETICS AND SIMULATION, 1997, : 4098 - 4103
  • [4] Mining spatio-temporal data
    Gennady Andrienko
    Donato Malerba
    Michael May
    Maguelonne Teisseire
    Journal of Intelligent Information Systems, 2006, 27 : 187 - 190
  • [5] Mining spatio-temporal data
    Andrienko, Gennady
    Malerba, Donato
    May, Michael
    Teisseire, Maguelonne
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2006, 27 (03) : 187 - 190
  • [6] Large-Scale Spatio-Temporal Patterns of Mediterranean Cephalopod Diversity
    Keller, Stefanie
    Bartolino, Valerio
    Hidalgo, Manuel
    Bitetto, Isabella
    Casciaro, Loredana
    Cuccu, Danila
    Esteban, Antonio
    Garcia, Cristina
    Garofalo, Germana
    Josephides, Marios
    Jadaud, Angelique
    Lefkaditou, Evgenia
    Maiorano, Porzia
    Manfredi, Chiara
    Marceta, Bojan
    Massut, Enric
    Micallef, Reno
    Peristeraki, Panagiota
    Relini, Giulio
    Sartor, Paolo
    Spedicato, Maria Teresa
    Tserpes, George
    Quetglas, Antoni
    PLOS ONE, 2016, 11 (01):
  • [7] Spatio-temporal models for large-scale indicators of extreme weather
    Heaton, Matthew J.
    Katzfuss, Matthias
    Ramachandar, Shahla
    Pedings, Kathryn
    Gilleland, Eric
    Mannshardt-Shamseldin, Elizabeth
    Smith, Richard L.
    ENVIRONMETRICS, 2011, 22 (03) : 294 - 303
  • [8] Mobility Genome™- A Framework for Mobility Intelligence from Large-Scale Spatio-Temporal Data
    The Anh Dang
    Deepak, Jayakumaran
    Wang, Jingxuan
    Luo, Shixin
    Jin, Yunye
    Ng, Yibin
    Lim, Aloysius
    Li, Ying
    2017 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2017, : 449 - 458
  • [9] A multi-source spatio-temporal data cube for large-scale geospatial analysis
    Gao, Fan
    Yue, Peng
    Cao, Zhipeng
    Zhao, Shuaifeng
    Shangguan, Boyi
    Jiang, Liangcun
    Hu, Lei
    Fang, Zhe
    Liang, Zheheng
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2022, 36 (09) : 1853 - 1884
  • [10] A comparative study of urban mobility patterns using large-scale spatio-temporal data
    The Anh Dang
    Chiam, Jodi
    Li, Ying
    2018 18TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2018, : 572 - 579