Effect of Feature Selection on Performance of Internet Traffic Classification on NIMS Multi-Class dataset

被引:3
作者
Oluranti, Jonathan
Omoregbe, Nicholas
Misra, Sanjay
机构
来源
3RD INTERNATIONAL CONFERENCE ON SCIENCE AND SUSTAINABLE DEVELOPMENT (ICSSD 2019): SCIENCE, TECHNOLOGY AND RESEARCH: KEYS TO SUSTAINABLE DEVELOPMENT | 2019年 / 1299卷
关键词
Traffic Classification; Network Management; Feature Selection; Multi-class dataset;
D O I
10.1088/1742-6596/1299/1/012035
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The challenges faced by networks nowadays can be solved to a great extent by the application of accurate network traffic classification. Internet network traffic classification is responsible for associating network traffic with the application generating them and helps in the area of network monitoring, Quality of Service management, among other. Traditional methods of traffic classification including port-based, payload-load based, host-based, behavior-based exhibit a number of limitations that range from high computational cost to inability to access encrypted packets for the purpose of classification. Machine learning techniques based on statistical properties are now being employed to overcome the limitations of existing techniques. However, the high number of features of flows that serve as input to the learning machine poses a great challenge that requires the application of a pre-processing stage known as feature selection. Too many irrelevant and redundant features affect predictive accuracy and performance of the learning machine. This work analyses experimentally, the effect of a collection of ranking-basedfilter feature selection methods on a multi-class dataset for traffic classification. In the first stage, the proposed Top-N criterionis applied to the feature sets obtained, while in the second stage we generate for each Top-N set of features a new dataset which is applied as input to a set of four machine learning algorithms (classifiers).Experimental results show the viability of our model as a tool for selecting the optimal subset of features which when applied, lead to improvement of accuracy and performance of the traffic classification process.
引用
收藏
页数:10
相关论文
共 19 条
[1]   Mutual information-based feature selection for intrusion detection systems [J].
Amiri, Fatemeh ;
Yousefi, MohammadMahdi Rezaei ;
Lucas, Caro ;
Shakery, Azadeh ;
Yazdani, Nasser .
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2011, 34 (04) :1184-1199
[2]   A comprehensive survey on machine learning for networking: evolution, applications and research opportunities [J].
Boutaba, Raouf ;
Salahuddin, Mohammad A. ;
Limam, Noura ;
Ayoubi, Sara ;
Shahriar, Nashid ;
Estrada-Solano, Felipe ;
Caicedo, Oscar M. .
JOURNAL OF INTERNET SERVICES AND APPLICATIONS, 2018, 9 (09)
[3]  
Cai Jie, 2018, NEUROCOMPUTING
[4]  
Chen ZX, 2014, LECT NOTES COMPUT SC, V8631, P631, DOI 10.1007/978-3-319-11194-0_56
[5]  
Ding C., 2006, 23 INT C MACH LEARN
[6]  
En-Najjary T., 2010, 22 INT TEL C
[7]   Toward an efficient and scalable feature selection approach for internet traffic classification [J].
Fahad, Adil ;
Tari, Zahir ;
Khalil, Ibrahim ;
Habib, Ibrahim ;
Alnuweiri, Hussein .
COMPUTER NETWORKS, 2013, 57 (09) :2040-2057
[8]  
Ferri C., 2009, PATTERN RECOGNITION
[9]  
Hasani Seyed Reza, 2014, Journal of Computer Science, V10, P1015, DOI 10.3844/jcssp.2014.1015.1025
[10]  
Kulin M., 2016, SENSORS