A Statistical Technique for Online Anomaly Detection for Big Data Streams in Cloud Collaborative Environment

被引:4
作者
Smrithy, G. S. [1 ]
Balakrishnan, Ramadoss [1 ]
机构
[1] Natl Inst Technol, Dept Comp Applicat, Tiruchirappalli, Tamil Nadu, India
来源
2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (CIT) | 2016年
关键词
Big data; cloud collaboration; online anomaly detection; non-parametric statistical technique; KolmogorovSmirnov goodness of fit test;
D O I
10.1109/CIT.2016.80
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Big data and cloud computing are the two top IT initiatives that are in the mind for industries across the globe. Both innovations keep on evolving. As a delivery model for IT services, cloud computing has the potential to enhance agility and productivity while enabling greater efficiencies and reducing costs. As a result a number of enterprises are building efficient and agile cloud environments, and cloud providers continue to expand service offerings. Many cloud providers offer online collaboration service which is basically loosely-coupled in nature. Online anomaly detection aims to detect anomalies in data flowing in a streaming fashion. Such stream data is commonplace in today's cloud centric collaborations which enables participating domains to dynamically interoperate through sharing and accessing of information. Accordingly to forestall unauthorized disclosure of the shared resources and conceivable misappropriation, there is a need to identify anomalous access requests. To the best of our knowledge, the detection of anomalous access requests in cloud-based collaborations through non-parametric statistical technique has not been studied in earlier works. This paper proposes an online anomaly detection algorithm based on Kolmogorov-Smirnov goodness of fit test to detect anomalous access requests in cloud environment at runtime.
引用
收藏
页码:108 / 111
页数:4
相关论文
共 16 条
[1]   A comprehensive survey of numeric and symbolic outlier mining techniques [J].
Agyemang, Malik ;
Barker, Ken ;
Alhajj, Rada .
INTELLIGENT DATA ANALYSIS, 2006, 10 (06) :521-538
[2]  
Bamnett V., 1994, OUTLIERS STAT DATA, V3
[3]  
BECKMAN RJ, 1983, TECHNOMETRICS, V25, P119, DOI 10.2307/1268541
[4]   Maintaining stream statistics over sliding windows [J].
Datar, M ;
Gionis, A ;
Indyk, P ;
Motwani, R .
SIAM JOURNAL ON COMPUTING, 2002, 31 (06) :1794-1813
[5]  
Farid D. M., 2010, INT J NETW SECUR ITS, DOI DOI 10.5121/IJNSA.2010.2202
[6]   Big Data with Cloud Computing: an insight on the computing environment, MapReduce, and programming frameworks [J].
Fernandez, Alberto ;
del Rio, Sara ;
Lopez, Victoria ;
Bawakid, Abdullah ;
del Jesus, Maria J. ;
Benitez, Jose M. ;
Herrera, Francisco .
WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2014, 4 (05) :380-409
[7]  
Gupta J., 2014, SYNTH LECT DATA MINI, V5, P1, DOI DOI 10.2200/S00573ED1V01Y201403DMK008
[8]  
Ha Duong Nguyen, 2016, 2016 18 INT C ADV CO
[9]  
Kim Hyunjoo, 2015, 13 AUSTR INF SEC MAN, V13, P73
[10]  
Lehmann EL, 2006, TESTING STAT HYPOTHE