Data Lake Architecture for Distribution System Operator

被引:6
作者
Cardoso, Beatriz Batista [1 ]
Righetto, Sophia Boing [1 ]
Martins, Eduardo Luiz [1 ]
Izumida Martins, Marcos Aurelio [1 ]
Pereira, Andre Luiz [2 ]
de Francisci, Silvia [2 ]
机构
[1] CERTI Fdn, Sustainable Energy Ctr, Florianopolis, SC, Brazil
[2] New Technol Enel Distribuicao Sao Paulo, Barneri, Brazil
来源
2021 IEEE POWER & ENERGY SOCIETY INNOVATIVE SMART GRID TECHNOLOGIES CONFERENCE (ISGT) | 2021年
关键词
Data Lake; Network Digital Twin (R); Grid Digitalization;
D O I
10.1109/ISGT49243.2021.9372181
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Ignited by the advent of digital technologies, power distribution utilities are generating more and more data about their own assets and their environment. To handle this amount of data, some solutions emerge to help distribution system operators in understanding their own data and turning this Big Data into actionable insights. One of the solutions is a Data Lake. This article illustrates the architecture of a cloud-based Data Lake developed by Enel Distribuicao Sao Paulo to manage big data from systems such as GIS, SCADA O&M systems and other data generated in a Network Digital Twin (R) model in the city of Sao Paulo This Data Lake has a combination of data sources. It stores data in raw, processed, and refined format using structured, unstructured and semi-structured data. It uses tools to execute queries, searches, processing streams and to visualize data. This paper presents the design and implementation details, as well as usage scenarios of the data lake in a smart grid project.
引用
收藏
页数:5
相关论文
共 8 条
[1]  
Alley D, WHAT IS DATA INGESTI
[2]  
Lock M., 2017, Angling for Insight in Today's Data Lake.
[3]  
Lombardi M., 2019, 25 INT C EL DISTR
[4]  
Makarov V. V., 2019, MANAGEMENT LARGE SCA
[5]  
Mitsui P., 2020, DATA LAKE ARCHITECTU
[6]  
Raiu R., 2018, IEEE AIAA 37 DIG AV
[7]  
Wankowski L, 2019, DATA VIRTUALIZATION
[8]  
Yew J., 2019, State of Design Systems 2019