Data preprocessing for anomaly based network intrusion detection: A review

被引：173

作者：

Davis, Jonathan J. ^{[1
]}

Clark, Andrew J. ^{[2
]}

机构：

[1] DSTO, Div C3I, Edinburgh, SA 5111, Australia

[2] Queensland Univ Technol, Informat Secur Inst, Brisbane, Qld 4001, Australia

来源：

COMPUTERS & SECURITY | 2011年 / 30卷 / 6-7期

关键词：

Data preprocessing; Network intrusion; Anomaly detection; Data mining; Feature construction; Feature selection; SYSTEM;

D O I：

10.1016/j.cose.2011.05.008

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Data preprocessing is widely recognized as an important stage in anomaly detection. This paper reviews the data preprocessing techniques used by anomaly-based network intrusion detection systems (NIDS), concentrating on which aspects of the network traffic are analyzed, and what feature construction and selection methods have been used. Motivation for the paper comes from the large impact data preprocessing has on the accuracy and capability of anomaly-based NIDS. The review finds that many NIDS limit their view of network traffic to the TCP/IP packet headers. Time-based statistics can be derived from these headers to detect network scans, network worm behavior, and denial of service attacks. A number of other NIDS perform deeper inspection of request packets to detect attacks against network services and network applications. More recent approaches analyze full service responses to detect attacks targeting clients. The review covers a wide range of NIDS, highlighting which classes of attack are detectable by each of these approaches. Data preprocessing is found to predominantly rely on expert domain knowledge for identifying the most relevant parts of network traffic and for constructing the initial candidate set of traffic features. On the other hand, automated methods have been widely used for feature extraction to reduce data dimensionality, and feature selection to find the most relevant subset of features from this candidate set. The review shows a trend toward deeper packet inspection to construct more relevant features through targeted content parsing. These context sensitive features are required to detect current attacks. Crown Copyright (C) 2011 Published by Elsevier Ltd. All rights reserved.

引用

页码：353 / 375

页数：23

共 74 条

[1]

[Anonymous], 2004, EFFICIENT INTRUSION

[2]

[Anonymous], 1999, KDD cup 1999 data

[3]

[Anonymous], 2002, P 9 ACM C COMP COMM

[4]

[Anonymous], 2001, CS200104

[5]

[Anonymous], 2003, P IEEE FDN NEW DIR D

[6]

AXELSSON S, 2000, ACM T INFORM SYSTEM, V3

[7]

Bace R., 2001, NIST Special Publication on Intrusion Detection Systems

[8]

Barbara D., 2001, 1 SIAM C DAT MIN

[9]

Bloedorn EE, 2006, ADV INFO KNOW PROC, P65

[10] POSEIDON: a 2-tier anomaly-based network intrusion detection system [J].

Bolzoni, Damiano ;

Etalle, Sandro ;

Hartel, Pieter ;

Zambon, Emmanuele .

FOURTH IEEE INTERNATIONAL WORKSHOP ON INFORMATION ASSURANCE, PROCEEDINGS, 2006, :144-+

← 1 2 3 4 5 6 7 8 →