Adaptive Data Balancing Method Using Stacking Ensemble Model and Its Application to Non-Technical Loss Detection in Smart Grids

被引:8
作者
Ullah, Ashraf [1 ]
Javaid, Nadeem [1 ]
Javed, Muhammad Umar [1 ]
Kim, Byung-Seo [2 ]
Bahaj, Saeed Ali [3 ]
机构
[1] COMSATS Univ Islamabad, Dept Comp Sci, Islamabad 44000, Pakistan
[2] Hongik Univ, Dept Comp & Informat Commun Engn, Sejong 30016, South Korea
[3] Prince Sattam bin Abdulaziz Univ, Coll Business Adm, Dept Management Informat Syst, Al Kharj 11942, Saudi Arabia
基金
新加坡国家研究基金会;
关键词
ADASYN; deep learning; non-technical losses; SGCC; stacking model; smart grids; TomekLink; ELECTRICITY THEFT DETECTION; RIDGE-REGRESSION; ENERGY THEFT; FRAMEWORK;
D O I
10.1109/ACCESS.2022.3230952
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A stacking ensemble model (SEM) is proposed in this paper to identify non-technical losses. Three layers make up the proposed model. Data pre-processing is performed at the first layer, where issues of data imbalance, missing values, and data normalization are dealt with. Min-max and a simple imputer are used to handle data normalization and missing values, respectively. Besides, ADASYN and TomekLink are used in a combined form to address the problem of data imbalance. The second layer employs three different machine learning models. The models, also referred to as base classifiers, used at the second layer in the proposed SEM include the following classifiers: random forest (RF), extra tree (ET), and extreme gradient boosting (XGBoost). To accomplish the final classification using the ridge classifier, the output of the basic classifiers is ensembled at the third layer. The ridge classifier is also regarded as the meta classifier. Furthermore, the training and testing of the suggested model is aided by real-time data from the smart grid corporation of China (SGCC). The proposed model's performance is validated by multiple simulations using various performance indicators and is found to surpass the standalone classifiers in terms of ETD.
引用
收藏
页码:133244 / 133255
页数:12
相关论文
共 36 条
[1]   LSTM and Bat-Based RUSBoost Approach for Electricity Theft Detection [J].
Adil, Muhammad ;
Javaid, Nadeem ;
Qasim, Umar ;
Ullah, Ibrar ;
Shafiq, Muhammad ;
Choi, Jin-Ghoo .
APPLIED SCIENCES-BASEL, 2020, 10 (12)
[2]   GAME-THEORETIC MODELS OF ELECTRICITY THEFT DETECTION IN SMART UTILITY NETWORKS PROVIDING NEW CAPABILITIES WITH ADVANCED METERING INFRASTRUCTURE [J].
Amin, Saurabh ;
Schwartz, Galina A. ;
Cardenas, Alvaro A. ;
Sastry, S. Shankar .
IEEE CONTROL SYSTEMS MAGAZINE, 2015, 35 (01) :66-81
[3]   Towards Efficient Energy Utilization Using Big Data Analytics in Smart Cities for Electricity Theft Detection [J].
Arif, Arooj ;
Alghamdi, Turki Ali ;
Khan, Zahoor Ali ;
Javaid, Nadeem .
BIG DATA RESEARCH, 2022, 27
[4]   Big data analytics for identifying electricity theft using machine learning approaches in microgrids for smart communities [J].
Arif, Arooj ;
Javaid, Nadeem ;
Aldegheishem, Abdulaziz ;
Alrajeh, Nabil .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (17)
[5]   NTL Detection in Electric Distribution Systems Using the Maximal Overlap Discrete Wavelet-Packet Transform and Random Undersampling Boosting [J].
Avila, Nelson Fabian ;
Figueroa, Gerardo ;
Chu, Chia-Chi .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2018, 33 (06) :7171-7180
[6]  
Browniee J., 2020, COMBINE OVERSAMPLING
[7]  
Brownlee, 2020, STACKING ENSEMBLE DE
[8]  
Ceballos F., 2019, INTUITIVE EXPLANATIO
[9]   Real-time anomaly detection based on long short-Term memory and Gaussian Mixture Model [J].
Ding, Nan ;
Ma, HaoXuan ;
Gao, Huanbo ;
Ma, YanHua ;
Tan, GuoZhen .
COMPUTERS & ELECTRICAL ENGINEERING, 2019, 79
[10]  
Dzeroski S, 2004, MACH LEARN, V54, P255, DOI 10.1023/B.MAC.0000015881.36452.6e