The importance of generalizability for anomaly detection

被引:10
作者
Peterson, Gilbert L. [1 ]
McBride, Brent T. [1 ]
机构
[1] USAF, Inst Technol, Dept Elect & Comp Engn, Wright Patterson AFB, OH 45433 USA
关键词
clustering; anomaly detection; convex polytope; ellipsoid;
D O I
10.1007/s10115-007-0072-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In security-related areas there is concern over novel "zero-day" attacks that penetrate system defenses and wreak havoc. The best methods for countering these threats are recognizing "nonself" as in an Artificial Immune System or recognizing "self" through clustering. For either case, the concern remains that something that appears similar to self could be missed. Given this situation, one could incorrectly assume that a preference for a tighter fit to self over generalizability is important for false positive reduction in this type of learning problem. This article confirms that in anomaly detection as in other forms of classification a tight fit, although important, does not supersede model generality. This is shown using three systems each with a different geometric bias in the decision space. The first two use spherical and ellipsoid clusters with a k-means algorithm modified to work on the one-class/blind classification problem. The third is based on wrapping the self points with a multidimensional convex hull (polytope) algorithm capable of learning disjunctive concepts via a thresholding constant. All three of these algorithms are tested using the Voting dataset from the UCI Machine Learning Repository, the MIT Lincoln Labs intrusion detection dataset, and the lossy-compressed steganalysis domain.
引用
收藏
页码:377 / 392
页数:16
相关论文
共 48 条
[1]  
[Anonymous], SPIE S EL IM SAN JOS
[2]  
AVCIBAS I, 2002, INT C IM PROC ROCH N
[3]  
BARBER CB, 2002, QHULL VERSION 2002 1
[4]   Hydroxy-functionalized liquid crystalline polyazomethines .2. Study of new central cores and synthesis of coordination polymers [J].
Barbera, J ;
Cerrada, P ;
Oriol, L ;
Pinol, M ;
Serrano, JL ;
Alonso, PJ .
LIQUID CRYSTALS, 1997, 22 (04) :483-495
[5]  
BARRON AR, 1991, PROCEEDINGS OF THE FOURTH ANNUAL WORKSHOP ON COMPUTATIONAL LEARNING THEORY, P243
[6]  
BAUM EB, 1988, P NEUR INF PROC SYST, P81
[7]  
Blake C.L., 1998, UCI repository of machine learning databases
[8]  
BROTHERTON T, 2001, IEEE AER C BIG SK MT
[9]   Anomaly detection and classification for hyperspectral imagery [J].
Chang, CI ;
Chiang, SS .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2002, 40 (06) :1314-1325
[10]   Efficient anomaly detection by modeling privilege flows using hidden Markov model [J].
Cho, SB ;
Park, HJ .
COMPUTERS & SECURITY, 2003, 22 (01) :45-55