Methodology of Data Popularity Forecasting in High-Energy Physics Experiments on Unbalanced and Irregular Time-series Data

被引:0
|
作者
Grigorieva, M. A. [1 ,2 ]
Popova, N. N. [2 ]
Vartanov, D. A. [2 ]
Shubin, M. V. [2 ]
机构
[1] Moscow Ctr Fundamental & Appl Math, Moscow 119234, Russia
[2] Lomonosov Moscow State Univ, Moscow 119991, Russia
关键词
data popularity; high-energy physics; distributed computing; machine learning; predictive analytics; time series analysis;
D O I
10.1134/S1995080224603771
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
This study introduces a method to forecast data popularity in high energy physics (HEP) experiments, focusing on unbalanced and irregular time-series data. The goal is to predict the popularity of specific datasets accurately over time, which is crucial for optimizing data replication and placement strategies and enhancing distributed computing efficiency in HEP experiments. The methodology utilizes advanced machine learning techniques and time-series analysis to tackle the challenges posed by the unbalanced nature of the data. The paper outlines the key components of the methodology, including data preprocessing and balancing techniques, filtration, and model selection. To evaluate the effectiveness of the presented approach, the authors conduct experiments on real-world HEP datasets, comparing their predictions against actual data. The findings of this study have important implications for resource management and decision-making in distributed computing of various large-scale scientific projects. By providing forecasts of data popularity, researchers and administrators can efficiently allocate resources, optimize data storage and retrieval mechanisms, and improve overall data processing efficiency.
引用
收藏
页码:3072 / 3084
页数:13
相关论文
共 50 条
  • [1] Exploring Hierarchical Forecasting of Data Popularity in High-Energy Physics Experiments
    Grigorieva, M. A.
    Popova, N. N.
    Vartanov, D. A.
    Shubin, M. V.
    LOBACHEVSKII JOURNAL OF MATHEMATICS, 2023, 44 (08) : 3076 - 3090
  • [2] Exploring Hierarchical Forecasting of Data Popularity in High-Energy Physics Experiments
    M. A. Grigorieva
    N. N. Popova
    D. A. Vartanov
    M. V. Shubin
    Lobachevskii Journal of Mathematics, 2023, 44 : 3076 - 3090
  • [3] DATA MANAGEMENT FOR HIGH-ENERGY PHYSICS EXPERIMENTS - PRELIMINARY PROPOSALS
    OLKEN, F
    LOKEN, SC
    ROTEM, D
    SHOSHANI, A
    TRIPPE, TG
    COMPUTER PHYSICS COMMUNICATIONS, 1987, 45 (1-3) : 379 - 383
  • [4] OVERLAY SYSTEM FOR DATA ACQUISITION IN HIGH-ENERGY PHYSICS EXPERIMENTS
    DUFOURNAUD, D
    MINARD, MN
    WILLITTS, TR
    NUCLEAR INSTRUMENTS & METHODS, 1975, 126 (01): : 103 - 108
  • [5] Mining and Forecasting of Big Time-series Data
    Sakurai, Yasushi
    Matsubara, Yasuko
    Faloutsos, Christos
    SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, : 919 - 922
  • [6] Mining and Forecasting of Big Time-series Data
    Sakurai, Yasushi
    2019 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS (PERCOM WORKSHOPS), 2019, : 607 - 607
  • [7] Data Driven Financial Time-Series Forecasting
    Zhong, Qiang
    Li, Dan
    SEVENTH WUHAN INTERNATIONAL CONFERENCE ON E-BUSINESS, VOLS I-III: UNLOCKING THE FULL POTENTIAL OF GLOBAL TECHNOLOGY, 2008, : 1744 - 1749
  • [8] Isolating cyclical patterns in irregular time-series data
    Hurn, AS
    McDonald, AD
    MATHEMATICS AND COMPUTERS IN SIMULATION, 1997, 43 (3-6) : 405 - 412
  • [9] A time-series clustering methodology for knowledge extraction in energy consumption data
    Ruiz, L.G.B.
    Pegalajar, M.C.
    Arcucci, R.
    Molina-Solana, M.
    Expert Systems with Applications, 2020, 160
  • [10] A time-series clustering methodology for knowledge extraction in energy consumption data
    Ruiz, L. G. B.
    Pegalajar, M. C.
    Arcucci, R.
    Molina-Solana, M.
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 160