Big Data Network Flow Processing Using Apache Spark

被引:0
作者
Jerabek, Kamil [1 ]
Rysavy, Ondrej [1 ]
机构
[1] Brno Univ Technol, Brno, Czech Republic
来源
PROCEEDINGS OF THE 6TH CONFERENCE ON THE ENGINEERING OF COMPUTER BASED SYSTEMS (ECBS 2019) | 2020年
关键词
Big Data; Network flows; Apache Spark; Cassandra; Apache Ignite;
D O I
10.1145/3352700.3352709
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The increasing amount of traffic flows captured as a part of network monitoring activities makes the analysis more complicated. One of the goals for network traffic analysis is to identify malicious communication. In the paper, we present a new system for big data network flow classification and clustering. The proposed system is based on the popular big data engines such as Apache Spark and Apache Ignite. The conducted experiments demonstrate the feasibility of the proposed approach and show the possible scalability.
引用
收藏
页数:9
相关论文
共 13 条
[1]   Big Data Analytics for Security [J].
Cardenas, Alvaro A. ;
Manadhata, Pratyusa K. ;
Rajan, Sreeranga P. .
IEEE SECURITY & PRIVACY, 2013, 11 (06) :74-76
[2]  
Carpenter Jeff., 2016, CASSANDRA DEFINITIVE
[3]  
Chambers B., 2018, Spark: the definitive guide
[4]  
Hendawi AM, 2016, 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), P2590, DOI 10.1109/BigData.2016.7840901
[5]  
Kadam Yogesh V, 2013, INT J ENG COMPUTER S
[6]  
Lee Y, 2013, ACM SIGCOMM COMP COM, V43, P6
[7]  
Lukashin Alexey, 2014, DISTRIBUTED PACKET T, P535, DOI 10.1007/978-3-319-10353- 2_49
[8]  
Rychly Marek, 2018, J CYBER SECURITY MOB, V8, P165, DOI [https: 10.13052/jcsm2245-1439.822, DOI 10.13052/JCSM2245-1439.822]
[9]   Towards Large Scale Packet Capture and Network Flow Analysis on Hadoop [J].
Saavedra, Miguel Zenon Nicanor L. ;
Yu, William Emmanuel S. .
2018 SIXTH INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING WORKSHOPS (CANDARW 2018), 2018, :186-189
[10]  
Wullink M, 2016, IEEE IFIP NETW OPER, P913, DOI 10.1109/NOMS.2016.7502925