Smart Meter Data Analytics: Systems, Algorithms, and Benchmarking

被引:37
作者
Liu, Xiufeng [1 ]
Golab, Lukasz [2 ]
Golab, Wojciech [2 ]
Ilyas, Ihab F. [2 ]
Jin, Shichao [2 ]
机构
[1] Tech Univ Denmark, DK-2800 Lyngby, Denmark
[2] Univ Waterloo, 200 Univ Ave West, Waterloo, ON N2L 3G1, Canada
来源
ACM TRANSACTIONS ON DATABASE SYSTEMS | 2017年 / 42卷 / 01期
关键词
Smart meters; data analytics; performance benchmarking; Hadoop; Spark; HIVE;
D O I
10.1145/3004295
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Smart electricity meters have been replacing conventional meters worldwide, enabling automated collection of fine-grained (e.g.,every 15 minutes or hourly) consumption data. A variety of smart meter analytics algorithms and applications have been proposed, mainly in the smart grid literature. However, the focus has been on what can be done with the data rather than how to do it efficiently. In this article, we examine smart meter analytics from a software performance perspective. First, we design a performance benchmark that includes common smart meter analytics tasks. These include offline feature extraction and model building as well as a framework for online anomaly detection that we propose. Second, since obtaining real smart meter data is difficult due to privacy issues, we present an algorithm for generating large realistic datasets from a small seed of real data. Third, we implement the proposed benchmark using five representative platforms: a traditional numeric computing platform (Matlab), a relational DBMS with a built-in machine learning toolkit (PostgreSQL/MADlib), a main-memory column store ('' System C ''), and two distributed data processing platforms (Hive and Spark/Spark Streaming). We compare the five platforms in terms of application development effort and performance on a multicore machine as well as a cluster of 16 commodity servers.
引用
收藏
页数:39
相关论文
共 36 条
[1]   Using pattern recognition to identify habitual behavior in residential electricity consumption [J].
Abreu, Joana M. ;
Pereira, Francisco Camara ;
Ferrao, Paulo .
ENERGY AND BUILDINGS, 2012, 49 :479-487
[2]  
Acs Gergely, 2011, Information Hiding. 13th International Conference, IH 2011. Revised Selected Papers, P118, DOI 10.1007/978-3-642-24178-9_9
[3]  
Albert Adrian, 2013, 2013 IEEE International Conference on Big Data, P723, DOI 10.1109/BigData.2013.6691644
[4]   Smart Meter Driven Segmentation: What Your Consumption Says About You [J].
Albert, Adrian ;
Rajagopal, Ram .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2013, 28 (04) :4019-4030
[5]  
Anderson Eric, 2010, Operating Systems Review, V44, P40, DOI 10.1145/1740390.1740400
[6]  
Anil C, 2013, THESIS
[7]  
[Anonymous], 2010, Phoenix dactylifera, date palm, DOI DOI 10.1145/1791314.1791316
[8]  
[Anonymous], 2005, P INT C VER LARG DAT
[9]  
Ardakanian Omid., 2014, EDBT/ICDT Workshops, V14, P140
[10]  
Arlitt M.F., 2015, INT C PERFORMANCE EN, P133