Simple Data Augmentation Tricks for Boosting Performance on Electricity Theft Detection Tasks

被引:16
作者
Liao, Wenlong [1 ,5 ]
Yang, Zhe [1 ]
Bak-Jensen, Birgitte [1 ]
Pillai, Jayakrishnan Radhakrishna [1 ]
Von Krannichfeldt, Leandro [2 ]
Wang, Yusen [3 ]
Yang, Dechang [4 ]
机构
[1] Aalborg Univ, AAU Energy, DK-9220 Aalborg, Denmark
[2] Univ Hong Kong, Dept Elect & Elect Engn, Hong Kong 999077, Peoples R China
[3] KTH Royal Inst Technol, Sch Elect Engn & Comp Sci, S-10044 Stockholm, Sweden
[4] China Agr Univ, Coll Informat & Elect Engn, Beijing 100083, Peoples R China
[5] China Agr Univ, Sch Informat & Elect Engn, Beijing 10083, Peoples R China
基金
中国国家自然科学基金;
关键词
Data augmentation; electricity consumption reading; electricity theft detection; smart grid; smart meter; SMART; NETWORKS;
D O I
10.1109/TIA.2023.3262232
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In practical engineering, electricity theft detection is usually performed on highly imbalanced datasets (i.e., the number of fraudulent samples is much smaller than the benign ones), which limits the accuracy of the classifier. To alleviate the data imbalance problem, this article proposes simple data augmentation tricks (SDAT) to boost performance on electricity theft detection tasks. SDAT includes five simple but powerful operations: adding noises to electricity consumption readings, drifting values of electricity consumption readings, quantizing electricity consumption readings to a level set, adding a fixed value to electricity consumption readings, and adding changeable values to electricity consumption readings. In addition, eight potential tricks are also mentioned. Numerical simulations are conducted on a real-world dataset. The simulation results show that SDAT can significantly boost the performance of different classifiers, especially for small datasets. Besides, specific suggestions on how to select parameters of SDAT are provided for its migration use to other datasets.
引用
收藏
页码:4846 / 4858
页数:13
相关论文
共 26 条
[1]  
Adil M., 2022, APPL SCI, V71, P1
[2]   GAME-THEORETIC MODELS OF ELECTRICITY THEFT DETECTION IN SMART UTILITY NETWORKS PROVIDING NEW CAPABILITIES WITH ADVANCED METERING INFRASTRUCTURE [J].
Amin, Saurabh ;
Schwartz, Galina A. ;
Cardenas, Alvaro A. ;
Sastry, S. Shankar .
IEEE CONTROL SYSTEMS MAGAZINE, 2015, 35 (01) :66-81
[3]   Mitigating Drift in Time Series Data With Noise Augmentation [J].
Fields, Tonya ;
Hsieh, George ;
Chenou, Jules .
2019 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2019), 2019, :227-230
[4]  
Fons E, 2020, Arxiv, DOI arXiv:2010.15111
[5]   A Physically Inspired Data-Driven Model for Electricity Theft Detection With Smart Meter Data [J].
Gao, Yuanqi ;
Foggo, Brandon ;
Yu, Nanpeng .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2019, 15 (09) :5076-5088
[6]  
Gong X., 2020, ENERGIES, V13, P1
[7]   Ensemble machine learning models for the detection of energy theft [J].
Gunturi, Sravan Kumar ;
Sarkar, Dipu .
ELECTRIC POWER SYSTEMS RESEARCH, 2021, 192
[8]   An empirical survey of data augmentation for time series classification with neural networks [J].
Iwana, Brian Kenji ;
Uchida, Seiichi .
PLOS ONE, 2021, 16 (07)
[9]  
Jingge Li, 2021, 2021 IEEE 5th Conference on Energy Internet and Energy System Integration (EI2), P3132, DOI 10.1109/EI252483.2021.9712854
[10]   A Stacked Machine and Deep Learning-Based Approach for Analysing Electricity Theft in Smart Grids [J].
Khan, Inam Ullah ;
Javeid, Nadeem ;
Taylor, C. James ;
Gamage, Kelum A. A. ;
Ma, Xiandong .
IEEE TRANSACTIONS ON SMART GRID, 2022, 13 (02) :1633-1644