Ensemble machine learning models for the detection of energy theft

被引:120
作者
Gunturi, Sravan Kumar [1 ]
Sarkar, Dipu [2 ]
机构
[1] Natl Inst Technol Nagaland, Dept Elect & Instrumentat Engn, Chumukedima 797103, Nagaland, India
[2] Natl Inst Technol Nagaland, Dept Elect & Elect Engn, Chumukedima 797103, Nagaland, India
关键词
Ensemble machine learning; Theft detection; Smart grid; NONTECHNICAL LOSS DETECTION; ELECTRICITY THEFT; SECURITY;
D O I
10.1016/j.epsr.2020.106904
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Advanced metering infrastructure allows the two-way sharing of information between smart meters and utilities. However, it makes smart grids more vulnerable to cyber-security threats such as energy theft. This study suggests ensemble machine learning (ML) models for the detection of energy theft in smart grids using customers' consumption patterns. Ensemble ML models are meta-algorithms that create a pool of several ML approaches and combine them smartly into one predictive model to reduce variance and bias. A number of algorithms, including adaptive boosting, categorical boosting, extreme-boosting, light boosting, random forest, and extra trees, were tested to find their false positive and detection rates. A data pre-processing method was employed to improve detection performance. The statistical approach of minority over-sampling was also employed to tackle overfitting. An extensive analysis based on a practical dataset of 5000 customers reveals that bagging models outperform other algorithms. The random forest and extra trees models achieve the highest area under the curve score of 0.90. The precision analysis shows that the proposed bagging methods perform better.
引用
收藏
页数:14
相关论文
共 44 条
[41]   A Tunable Fraud Detection System for Advanced Metering Infrastructure Using Short-Lived Patterns [J].
Zanetti, Marcelo ;
Jamhour, Edgard ;
Pellenz, Marcelo ;
Penna, Manoel ;
Zambenedetti, Voldi ;
Chueiri, Ivan .
IEEE TRANSACTIONS ON SMART GRID, 2019, 10 (01) :830-840
[42]   A Data-Driven Design for Fault Detection of Wind Turbines Using Random Forests and XGboost [J].
Zhang, Dahai ;
Qian, Liyang ;
Mao, Baijin ;
Huang, Can ;
Huang, Bin ;
Si, Yulin .
IEEE ACCESS, 2018, 6 :21020-21031
[43]   Electricity Theft Detection Using Generative Models [J].
Zhang, Qianru ;
Zhang, Meng ;
Chen, Tinghuan ;
Fan, Jinan ;
Yang, Zhou ;
Li, Guoqing .
2018 IEEE 30TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2018, :270-274
[44]   Vehicle Accident Risk Prediction Based on AdaBoost-SO in VANETs [J].
Zhao, Haitao ;
Yu, Hongsu ;
Li, Dapeng ;
Mao, Tianqi ;
Zhu, Hongbo .
IEEE ACCESS, 2019, 7 :14549-14557