Energy Theft Detection in Smart Grids with Genetic Algorithm-Based Feature Selection

被引:3
作者
Umair, Muhammad [1 ]
Saeed, Zafar [1 ]
Saeed, Faisal [2 ]
Ishtiaq, Hiba [1 ]
Zubair, Muhammad [1 ]
Hameed, Hala Abdel [3 ,4 ]
机构
[1] Univ Cent Punjab, Fac Informat Technol, Lahore 54590, Pakistan
[2] Birmingham City Univ, Sch Comp & Digital Technol, Dept Comp & Data Sci, DAAI Res Grp, Birmingham B4 7XG, England
[3] Fayoum Univ, Fac Comp & Informat Syst, Faiyum 63514, Egypt
[4] Taibah Univ, Khaybar Appl Coll, Medina, Saudi Arabia
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2023年 / 74卷 / 03期
关键词
Big data; data analysis; feature engineering; genetic algorithm; machine learning; ELECTRICITY THEFT; SECURITY; PRIVACY; NETWORKS;
D O I
10.32604/cmc.2023.033884
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As big data, its technologies, and application continue to advance, the Smart Grid (SG) has become one of the most successful pervasive and fixed computing platforms that efficiently uses a data-driven approach and employs efficient information and communication technology (ICT) and cloud computing. As a result of the complicated architecture of cloud com-puting, the distinctive working of advanced metering infrastructures (AMI), and the use of sensitive data, it has become challenging to make the SG secure. Faults of the SG are categorized into two main categories, Technical Losses (TLs) and Non-Technical Losses (NTLs). Hardware failure, communication issues, ohmic losses, and energy burnout during transmission and propagation of energy are TLs. NTL's are human-induced errors for malicious purposes such as attacking sensitive data and electricity theft, along with tampering with AMI for bill reduction by fraudulent customers. This research proposes a data-driven methodology based on principles of computational intelligence as well as big data analysis to identify fraudulent customers based on their load profile. In our proposed methodology, a hybrid Genetic Algorithm and Support Vector Machine (GA-SVM) model has been used to extract the relevant subset of feature data from a large and unsupervised public smart grid project dataset in London, UK, for theft detection. A subset of 26 out of 71 features is obtained with a classification accuracy of 96.6%, compared to studies conducted on small and limited datasets.
引用
收藏
页码:5431 / 5446
页数:16
相关论文
共 43 条
[1]   Feature Selection Using Particle Swarm Optimization in Intrusion Detection [J].
Ahmad, Iftikhar .
INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2015,
[2]  
Ahmad J., 2020, PROC INT C ADV EMERG, P1
[3]  
Ahmad J, 2018, PROCEEDINGS OF 2018 IEEE 17TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC 2018), P524, DOI 10.1109/ICCI-CC.2018.8482047
[4]   Energy Theft in Smart Grids: A Survey on Data-Driven Attack Strategies and Detection Methods [J].
Althobaiti, Ahlam ;
Jindal, Anish ;
Marnerides, Angelos K. ;
Roedig, Utz .
IEEE ACCESS, 2021, 9 :159291-159312
[5]  
Antmann P., 2009, REDUCING TECHNICAL N
[6]   Mobility Prediction-Based Optimisation and Encryption of Passenger Traffic-Flows Using Machine Learning [J].
Asad, Syed Muhammad ;
Ahmad, Jawad ;
Hussain, Sajjad ;
Zoha, Ahmed ;
Abbasi, Qammer Hussain ;
Imran, Muhammad Ali .
SENSORS, 2020, 20 (09)
[7]   Incentive Mechanisms for Smart Grid: State of the Art, Challenges, Open Issues, Future Directions [J].
Bhattacharya, Sweta ;
Chengoden, Rajeswari ;
Srivastava, Gautam ;
Alazab, Mamoun ;
Javed, Abdul Rehman ;
Victor, Nancy ;
Maddikunta, Praveen Kumar Reddy ;
Gadekallu, Thippa Reddy .
BIG DATA AND COGNITIVE COMPUTING, 2022, 6 (02)
[8]  
Blazakis K., 2019, arXiv
[9]  
Cabral JE, 2008, TRANS DISTRIB CONF, P1175
[10]   A Parallel Genetic Algorithm Based Feature Selection and Parameter Optimization for Support Vector Machine [J].
Chen, Zhi ;
Lin, Tao ;
Tang, Ningjiu ;
Xia, Xin .
SCIENTIFIC PROGRAMMING, 2016, 2016