Electricity Theft Detection Using Supervised Learning Techniques on Smart Meter Data

被引:63
作者
Khan, Zahoor Ali [1 ]
Adil, Muhammad [2 ]
Javaid, Nadeem [2 ]
Saqib, Malik Najmus [3 ]
Shafiq, Muhammad [4 ]
Choi, Jin-Ghoo [4 ]
机构
[1] Higher Coll Technol, Comp Informat Sci, Fujairah 4114, U Arab Emirates
[2] COMSATS Univ Islamabad, Dept Elect & Comp Engn, Islamabad 44000, Pakistan
[3] Univ Jeddah, Coll Comp Sci & Engn, Dept Cybersecur, Jeddah 21959, Saudi Arabia
[4] Yeungnam Univ, Dept Informat & Commun Engn, Gyongsan 38541, Gyeongbuk, South Korea
基金
新加坡国家研究基金会;
关键词
data pre-processing; electricity theft; imbalance data; parameter tuning; smart grid; FRAMEWORK;
D O I
10.3390/su12198023
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Due to the increase in the number of electricity thieves, the electric utilities are facing problems in providing electricity to their consumers in an efficient way. An accurate Electricity Theft Detection (ETD) is quite challenging due to the inaccurate classification on the imbalance electricity consumption data, the overfitting issues and the High False Positive Rate (FPR) of the existing techniques. Therefore, intensified research is needed to accurately detect the electricity thieves and to recover a huge revenue loss for utility companies. To address the above limitations, this paper presents a new model, which is based on the supervised machine learning techniques and real electricity consumption data. Initially, the electricity data are pre-processed using interpolation, three sigma rule and normalization methods. Since the distribution of labels in the electricity consumption data is imbalanced, an Adasyn algorithm is utilized to address this class imbalance problem. It is used to achieve two objectives. Firstly, it intelligently increases the minority class samples in the data. Secondly, it prevents the model from being biased towards the majority class samples. Afterwards, the balanced data are fed into a Visual Geometry Group (VGG-16) module to detect abnormal patterns in electricity consumption. Finally, a Firefly Algorithm based Extreme Gradient Boosting (FA-XGBoost) technique is exploited for classification. The simulations are conducted to show the performance of our proposed model. Moreover, the state-of-the-art methods are also implemented for comparative analysis, i.e., Support Vector Machine (SVM), Convolution Neural Network (CNN), and Logistic Regression (LR). For validation, precision, recall, F1-score, Matthews Correlation Coefficient (MCC), Receiving Operating Characteristics Area Under Curve (ROC-AUC), and Precision Recall Area Under Curve (PR-AUC) metrics are used. Firstly, the simulation results show that the proposed Adasyn method has improved the performance of FA-XGboost classifier, which has achieved F1-score, precision, and recall of 93.7%, 92.6%, and 97%, respectively. Secondly, the VGG-16 module achieved a higher generalized performance by securing accuracy of 87.2% and 83.5% on training and testing data, respectively. Thirdly, the proposed FA-XGBoost has correctly identified actual electricity thieves, i.e., recall of 97%. Moreover, our model is superior to the other state-of-the-art models in terms of handling the large time series data and accurate classification. These models can be efficiently applied by the utility companies using the real electricity consumption data to identify the electricity thieves and overcome the major revenue losses in power sector.
引用
收藏
页码:1 / 25
页数:25
相关论文
共 50 条
[1]  
Adil M., 2020, APPL SCI, V10, P1
[2]   GAME-THEORETIC MODELS OF ELECTRICITY THEFT DETECTION IN SMART UTILITY NETWORKS PROVIDING NEW CAPABILITIES WITH ADVANCED METERING INFRASTRUCTURE [J].
Amin, Saurabh ;
Schwartz, Galina A. ;
Cardenas, Alvaro A. ;
Sastry, S. Shankar .
IEEE CONTROL SYSTEMS MAGAZINE, 2015, 35 (01) :66-81
[3]  
[Anonymous], 2016, SCHEDAE INFORM, DOI [10.4467/20838476si.16.004.6185, DOI 10.4467/20838476SI.16.004.6185]
[4]   NTL Detection in Electric Distribution Systems Using the Maximal Overlap Discrete Wavelet-Packet Transform and Random Undersampling Boosting [J].
Avila, Nelson Fabian ;
Figueroa, Gerardo ;
Chu, Chia-Chi .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2018, 33 (06) :7171-7180
[5]   Electricity Theft Pinpointing Through Correlation Analysis of Master and Individual Meter Readings [J].
Biswas, Partha P. ;
Cai, Hongyun ;
Zhou, Bin ;
Chen, Binbin ;
Mashima, Daisuke ;
Zheng, Vincent W. .
IEEE TRANSACTIONS ON SMART GRID, 2020, 11 (04) :3031-3042
[6]   Hybrid Deep Neural Networks for Detection of Non-Technical Losses in Electricity Smart Meters [J].
Buzau, Madalina-Mihaela ;
Tejedor-Aguilera, Javier ;
Cruz-Romero, Pedro ;
Gomez-Exposito, Antonio .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2020, 35 (02) :1254-1263
[7]   Multilevel Image Segmentation Based on an Improved Firefly Algorithm [J].
Chen, Kai ;
Zhou, Yifan ;
Zhang, Zhisheng ;
Dai, Min ;
Chao, Yuan ;
Shi, Jinfei .
MATHEMATICAL PROBLEMS IN ENGINEERING, 2016, 2016
[8]  
Chen T, 2016, ery and Data Mining, V785, P785, DOI [DOI 10.1145/2939672.2939785, 10.1145/2939672.2939785]
[9]   Efficient deep features selections and classification for flower species recognition [J].
Cibuk, Musa ;
Budak, Umit ;
Guo, Yanhui ;
Ince, M. Cevdet ;
Sengur, Abdulkadir .
MEASUREMENT, 2019, 137 :7-13
[10]  
Depuru SSSR., 2011, POWER SYSTEMS C EXPO, P1