Methodology of Data Popularity Forecasting in High-Energy Physics Experiments on Unbalanced and Irregular Time-series Data

被引:0
|
作者
Grigorieva, M. A. [1 ,2 ]
Popova, N. N. [2 ]
Vartanov, D. A. [2 ]
Shubin, M. V. [2 ]
机构
[1] Moscow Ctr Fundamental & Appl Math, Moscow 119234, Russia
[2] Lomonosov Moscow State Univ, Moscow 119991, Russia
关键词
data popularity; high-energy physics; distributed computing; machine learning; predictive analytics; time series analysis;
D O I
10.1134/S1995080224603771
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
This study introduces a method to forecast data popularity in high energy physics (HEP) experiments, focusing on unbalanced and irregular time-series data. The goal is to predict the popularity of specific datasets accurately over time, which is crucial for optimizing data replication and placement strategies and enhancing distributed computing efficiency in HEP experiments. The methodology utilizes advanced machine learning techniques and time-series analysis to tackle the challenges posed by the unbalanced nature of the data. The paper outlines the key components of the methodology, including data preprocessing and balancing techniques, filtration, and model selection. To evaluate the effectiveness of the presented approach, the authors conduct experiments on real-world HEP datasets, comparing their predictions against actual data. The findings of this study have important implications for resource management and decision-making in distributed computing of various large-scale scientific projects. By providing forecasts of data popularity, researchers and administrators can efficiently allocate resources, optimize data storage and retrieval mechanisms, and improve overall data processing efficiency.
引用
收藏
页码:3072 / 3084
页数:13
相关论文
共 50 条
  • [31] Topics in statistical data analysis for high-energy physics
    Cowan, G.
    2009 EUROPEAN SCHOOL OF HIGH-ENERGY PHYSICS, 2010, : 197 - 218
  • [32] SATELLITE DATA-TRANSMISSION IN HIGH-ENERGY PHYSICS
    HINE, MGN
    COMPUTER PHYSICS COMMUNICATIONS, 1981, 22 (2-3) : 139 - 148
  • [33] Arbitrated Dynamic Ensemble with Abstaining for Time-Series Forecasting on Data Streams
    Boulegane, Dihia
    Bifet, Albert
    Madhusudan, Giyyarpuram
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 1040 - 1045
  • [34] Time-Series Forecasting of Seasonal Data Using Machine Learning Methods
    Kramar, Vadim
    Alchakov, Vasiliy
    ALGORITHMS, 2023, 16 (05)
  • [35] New model for time-series forecasting using RBFs and exogenous data
    Gorriz, JM
    Puntonet, CG
    de la Rosa, JJG
    Salmerón, M
    INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, 2003, : 3 - 12
  • [36] AutoMixer for Improved Multivariate Time-Series Forecasting on Business and IT Observability Data
    Palaskar, Santosh
    Ekambaram, Vijay
    Jati, Arindam
    Gantayat, Neelamadhav
    Saha, Avirup
    Nagar, Seema
    Nguyen, Nam H.
    Dayama, Pankaj
    Sindhgatta, Renuka
    Mohapatra, Prateeti
    Kumar, Harshit
    Kalagnanam, Jayant
    Hemachandra, Nandyala
    Rangaraj, Narayan
    THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 22962 - 22968
  • [37] Modeling and forecasting the COVID-19 pandemic time-series data
    Doornik, Jurgen A.
    Castle, Jennifer L.
    Hendry, David F.
    SOCIAL SCIENCE QUARTERLY, 2021, 102 (05) : 2070 - 2087
  • [38] Privacy Preserving Time-Series Forecasting of User Health Data Streams
    Imtiaz, Sana
    Horchidan, Sonia-Florina
    Abbas, Zainab
    Arsalan, Muhammad
    Chaudhry, Hassan Nazeer
    Vlassov, Vladimir
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 3428 - 3437
  • [39] Modelling and optimisation of effective hybridisation model for time-series data forecasting
    Khairalla, Mergani
    Ning, Xu
    AL-Jallad, Nashat
    JOURNAL OF ENGINEERING-JOE, 2018, (02): : 117 - 122
  • [40] Forecasting Time-Series Energy Data in Buildings Using an Additive Artificial Intelligence Model for Improving Energy Efficiency
    Ngoc-Son Truong
    Ngoc-Tri Ngo
    Anh-Duc Pham
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021