Methodology of Data Popularity Forecasting in High-Energy Physics Experiments on Unbalanced and Irregular Time-series Data

被引:0
|
作者
Grigorieva, M. A. [1 ,2 ]
Popova, N. N. [2 ]
Vartanov, D. A. [2 ]
Shubin, M. V. [2 ]
机构
[1] Moscow Ctr Fundamental & Appl Math, Moscow 119234, Russia
[2] Lomonosov Moscow State Univ, Moscow 119991, Russia
关键词
data popularity; high-energy physics; distributed computing; machine learning; predictive analytics; time series analysis;
D O I
10.1134/S1995080224603771
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
This study introduces a method to forecast data popularity in high energy physics (HEP) experiments, focusing on unbalanced and irregular time-series data. The goal is to predict the popularity of specific datasets accurately over time, which is crucial for optimizing data replication and placement strategies and enhancing distributed computing efficiency in HEP experiments. The methodology utilizes advanced machine learning techniques and time-series analysis to tackle the challenges posed by the unbalanced nature of the data. The paper outlines the key components of the methodology, including data preprocessing and balancing techniques, filtration, and model selection. To evaluate the effectiveness of the presented approach, the authors conduct experiments on real-world HEP datasets, comparing their predictions against actual data. The findings of this study have important implications for resource management and decision-making in distributed computing of various large-scale scientific projects. By providing forecasts of data popularity, researchers and administrators can efficiently allocate resources, optimize data storage and retrieval mechanisms, and improve overall data processing efficiency.
引用
收藏
页码:3072 / 3084
页数:13
相关论文
共 50 条
  • [41] Landslide data analysis using various time-series forecasting models
    Aggarwal, Akarsh
    Alshehri, Mohammed
    Kumar, Manoj
    Alfarraj, Osama
    Sharma, Purushottam
    Pardasani, Kamal Raj
    COMPUTERS & ELECTRICAL ENGINEERING, 2020, 88
  • [42] Irregular Trend Finder: Visualization tool for analyzing time-series big data
    Takeda, Shinnosuke
    Kobayashi, Aimi
    Kobayashi, Hiroaki
    Okubo, Saori
    Misue, Kazuo
    2012 IEEE CONFERENCE ON VISUAL ANALYTICS SCIENCE AND TECHNOLOGY (VAST), 2012, : 305 - 306
  • [43] A Network Architecture for Bidirectional Data Transfer in High-Energy Physics Experiments Using Electroabsorption Modulators
    Papadopoulos, Spyridon
    Papakonstantinou, Ioannis
    Vasey, Francois
    Troska, Jan
    Darwazeh, Izzat
    PROCEEDINGS OF THE 2011 16TH EUROPEAN CONFERENCE ON NETWORKS AND OPTICAL COMMUNICATIONS (NOC 2011), 2011, : 68 - 71
  • [44] OPTICAL-DATA PROCESSOR USING COMPUTER GENERATED HOLOGRAM FOR HIGH-ENERGY PHYSICS EXPERIMENTS
    GRESSER, J
    AMBS, P
    PROCEEDINGS OF THE SOCIETY OF PHOTO-OPTICAL INSTRUMENTATION ENGINEERS, 1983, 437 : 166 - 175
  • [45] FPGA-based, specialized trigger and data acquisition systems for high-energy physics experiments
    Pozniak, Krzysztof T.
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2010, 21 (06)
  • [46] Doubly Structured Data Synthesis for Time-Series Energy-Use Data
    Kim, Jiwoo
    Lee, Changhoon
    Jeon, Jehoon
    Choi, Jungwoong
    Kim, Joseph H. T.
    Sensors, 2024, 24 (24)
  • [47] Teaching Predictive Audit Data Analytic Techniques: Time-Series Forecasting with Transactional and Exogenous Data
    Yan, Zhaokai
    Appelbaum, Deniz
    Kogan, Alexander
    Vasarhelyi, Miklos A.
    JOURNAL OF EMERGING TECHNOLOGIES IN ACCOUNTING, 2023, 20 (01) : 169 - 194
  • [48] A novel pattern based clustering methodology for time-series microarray data
    Phan, Sieu
    Famili, Fazel
    Tang, Zoujian
    Pan, Youlian
    Liu, Ziying
    Ouyang, Junjun
    Lenferink, Anne
    O'Connor, Maureen Mc-Court
    INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2007, 84 (05) : 585 - 597
  • [49] A Generic Preprocessing Optimization Methodology when Predicting Time-Series Data
    Ioannis Kyriakidis
    Kostas Karatzas
    Andrew Ware
    George Papadourakis
    International Journal of Computational Intelligence Systems, 2016, 9 : 638 - 651
  • [50] A Generic Preprocessing Optimization Methodology when Predicting Time-Series Data
    Kyriakidis, Ioannis
    Karatzas, Kostas
    Ware, Andrew
    Papadourakis, George
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2016, 9 (04) : 638 - 651