Smart Caching in a Data Lake for High Energy Physics Analysis

被引:0
|
作者
Tommaso Tedeschi
Marco Baioletti
Diego Ciangottini
Valentina Poggioni
Daniele Spiga
Loriano Storchi
Mirco Tracolli
机构
[1] University of Perugia,Department of Physics and Geology
[2] INFN,Sezione di Perugia
[3] University of Perugia,Department of Mathematics and IT
[4] University “G. D’Annunzio” of Chieti-Pescara,Department of Pharmacy
来源
Journal of Grid Computing | 2023年 / 21卷
关键词
Reinforcement learning; Caching strategies; High energy physics; Data lake;
D O I
暂无
中图分类号
学科分类号
摘要
The continuous growth of data production in almost all scientific areas raises new problems in data access and management, especially in a scenario where the end-users, as well as the resources that they can access, are worldwide distributed. This work is focused on the data caching management in a Data Lake infrastructure in the context of the High Energy Physics field. We are proposing an autonomous method, based on Reinforcement Learning techniques, to improve the user experience and to contain the maintenance costs of the infrastructure.
引用
收藏
相关论文
共 50 条
  • [1] Smart Caching in a Data Lake for High Energy Physics Analysis
    Tedeschi, Tommaso
    Baioletti, Marco
    Ciangottini, Diego
    Poggioni, Valentina
    Spiga, Daniele
    Storchi, Loriano
    Tracolli, Mirco
    JOURNAL OF GRID COMPUTING, 2023, 21 (03)
  • [2] A caching mechanism to exploit object store speed in High Energy Physics analysis
    Eduardo Padulano, Vincenzo
    Saavedra, Enric Tejedor
    Alonso-Jorda, Pedro
    Gomez, Javier Lopez
    Blomer, Jakob
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2023, 26 (05): : 2757 - 2772
  • [3] A caching mechanism to exploit object store speed in High Energy Physics analysis
    Vincenzo Eduardo Padulano
    Enric Tejedor Saavedra
    Pedro Alonso-Jordá
    Javier López Gómez
    Jakob Blomer
    Cluster Computing, 2023, 26 : 2757 - 2772
  • [4] Data analysis in high energy physics, weird or wonderful
    Mount, Richard P.
    ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XVII, 2008, 394 : 57 - 66
  • [5] Using MapReduce for High Energy Physics Data Analysis
    Glaser, Fabian
    Neukirchen, Helmut
    Rings, Thomas
    Grabowski, Jens
    2013 IEEE 16TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE 2013), 2013, : 1271 - 1278
  • [6] Using Hadoop for High Energy Physics Data Analysis
    Huang, Qiulan
    Wei, Zhanchen
    Sun, Gongxing
    Cheng, Yaodong
    Cheng, Zhenjing
    Hu, Qingbao
    BIG SCIENTIFIC DATA MANAGEMENT, 2019, 11473 : 146 - 153
  • [7] What is a data model?An anatomy of data analysis in high energy physics
    Antonis Antoniou
    European Journal for Philosophy of Science, 2021, 11
  • [8] What is a data model? An anatomy of data analysis in high energy physics
    Antoniou, Antonis
    EUROPEAN JOURNAL FOR PHILOSOPHY OF SCIENCE, 2021, 11 (04)
  • [9] Quasi interactive high throughput analysis of high energy physics data (*)
    Bartolini, M.
    Cagnotta, A.
    Diotalevi, T.
    D'onofrio, A.
    Gravili, F. giuseppe
    Simone, F. maria
    Mastrandrea, P.
    Anwar, M. numan
    Sabella, G.
    Spisso, B.
    Tarasio, A.
    Tedeschi, T.
    NUOVO CIMENTO C-COLLOQUIA AND COMMUNICATIONS IN PHYSICS, 2025, 48 (03):
  • [10] DATA-ANALYSIS TECHNIQUES IN HIGH-ENERGY PHYSICS
    JOBES, M
    SHAYLOR, HR
    REPORTS ON PROGRESS IN PHYSICS, 1972, 35 (10) : 1077 - &