Hydraulic Data Preprocessing for Machine Learning-Based Intrusion Detection in Cyber-Physical Systems

被引:6
作者
Mboweni, Ignitious V. [1 ]
Ramotsoela, Daniel T. [1 ]
Abu-Mahfouz, Adnan M. [2 ]
机构
[1] Univ Cape Town, Dept Elect Engn, ZA-7700 Cape Town, South Africa
[2] CSIR, ZA-0184 Pretoria, South Africa
关键词
critical infrastructure; critical water system infrastructure; cyber-physical systems; data preprocessing; industrial control; intrusion detection systems; machine learning; water treatment system; NOISE;
D O I
10.3390/math11081846
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
The protection of critical infrastructure such as water treatment and water distribution systems is crucial for a functioning economy. The use of cyber-physical systems in these systems presents numerous vulnerabilities to attackers. To enhance security, intrusion detection systems play a crucial role in limiting damage from successful attacks. Machine learning can enhance security by analysing data patterns, but several attributes of the data can negatively impact the performance of the machine learning model. Data in critical water system infrastructure can be difficult to work with due to their complexity, variability, irregularities, and sensitivity. The data involve various measurements and can vary over time due to changes in environmental conditions and operational changes. Irregular patterns and small changes can have significant impacts on analysis and decision making, requiring effective data preprocessing techniques to handle the complexities and ensure accurate analysis. This paper explores data preprocessing techniques using a water treatment system dataset as a case study and provides preprocessing techniques specific to processing data in industrial control to yield a more informative dataset. The results showed significant improvement in accuracy, F1 score, and time to detection when using the preprocessed dataset.
引用
收藏
页数:21
相关论文
共 39 条
[1]  
Anish Halimaa A., 2019, 2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI). Proceedings, P916, DOI 10.1109/ICOEI.2019.8862784
[2]  
[Anonymous], 2018, Secure Water Treatment (SWaT) Testbed
[3]  
[Anonymous], MATHWORKS FILTER
[4]   Systematic approach to provide building occupants with feedback to reduce energy consumption [J].
Ashouri, Milad ;
Fung, Benjamin C. M. ;
Haghighat, Fariborz ;
Yoshino, Hiroshi .
ENERGY, 2020, 194
[5]   Large-scale human metabolomics studies: A strategy for data (pre-) processing and validation [J].
Bijlsma, S ;
Bobeldijk, L ;
Verheij, ER ;
Ramaker, R ;
Kochhar, S ;
Macdonald, IA ;
van Ommen, B ;
Smilde, AK .
ANALYTICAL CHEMISTRY, 2006, 78 (02) :567-574
[6]  
Breiman L, 1984, Classification and Regression Trees, DOI DOI 10.1201/9781315139470
[7]   Breaking with trends in pre-processing? [J].
Engel, Jasper ;
Gerretzen, Jan ;
Szymanska, Ewa ;
Jansen, Jeroen J. ;
Downey, Gerard ;
Blanchet, Lionel ;
Buydens, Lutgarde M. C. .
TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 2013, 50 :96-106
[8]   Cyber Security and Power System Communication-Essential Parts of a Smart Grid Infrastructure [J].
Ericsson, Goran N. .
IEEE TRANSACTIONS ON POWER DELIVERY, 2010, 25 (03) :1501-1507
[9]   A Review on Data Preprocessing Techniques Toward Efficient and Reliable Knowledge Discovery From Building Operational Data [J].
Fan, Cheng ;
Chen, Meiling ;
Wang, Xinghua ;
Wang, Jiayuan ;
Huang, Bufu .
FRONTIERS IN ENERGY RESEARCH, 2021, 9
[10]   A decision-theoretic generalization of on-line learning and an application to boosting [J].
Freund, Y ;
Schapire, RE .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1997, 55 (01) :119-139