Feature selection for optimizing traffic classification

被引:96
|
作者
Zhang, Hongli [1 ]
Lu, Gang [1 ]
Qassrawi, Mahmoud T. [1 ]
Zhang, Yu [1 ]
Yu, Xiangzhan [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; Traffic classification; Class imbalance; Robust features; IDENTIFICATION;
D O I
10.1016/j.comcom.2012.04.012
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning (ML) algorithms have been widely applied in recent traffic classification. However, due to the imbalance in the number of traffic flows, ML based classifiers are prone to misclassify flows as the traffic type that occupies the majority of flows on the Internet. To address the problem, a novel feature selection metric named Weighted Symmetrical Uncertainty (WSU) is proposed. We design a hybrid feature selection algorithm named WSU_AUC, which prefilters most of features with WSU metric and further uses a wrapper method to select features for a specific classifier with Area Under roc Curve (AUC) metric. Additionally, to overcome the impacts of dynamic traffic flows on feature selection, we propose an algorithm named SRSF that Selects the Robust and Stable Features from the results achieved by WSU_AUC. We evaluate our approaches using three classifiers on the traces captured from entirely different networks. Experimental results obtained by our algorithms are promising in terms of true positive rate (TPR) and false positive rate (FPR). Moreover, our algorithms can achieve >94% flow accuracy and >80% byte accuracy on average. (c) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:1457 / 1471
页数:15
相关论文
共 50 条
  • [31] Optimizing Urban Traffic Incident Prediction With Vertical Federated Learning: A Feature Selection Based Approach
    Hussain, Basharat
    Afzal, Muhammad Khalil
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2025, 12 (01): : 145 - 155
  • [32] Optimizing Neural Networks for Academic Performance Classification Using Feature Selection and Resampling Approach
    Supriyadi D.
    Purwanto P.
    Warsito B.
    Mendel, 2023, 29 (02) : 261 - 272
  • [33] Optimal Supervised Feature Extraction in Internet Traffic Classification
    Aliakbarian, M. Sadegh
    Fanian, Ali
    Saleh, Fatemeh Sadat
    Gulliver, T. Aaron
    2013 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS AND SIGNAL PROCESSING (PACRIM), 2013, : 102 - 107
  • [34] Comprehensive feature selection for appliance classification in NILM
    Sadeghianpourhamami, N.
    Ruyssinck, J.
    Deschrijver, D.
    Dhaene, T.
    Develder, C.
    ENERGY AND BUILDINGS, 2017, 151 : 98 - 106
  • [35] A Feature Selection Method for Large-Scale Network Traffic Classification Based on Spark
    Wang, Yong
    Ke, Wenlong
    Tao, Xiaoling
    INFORMATION, 2016, 7 (01)
  • [36] On feature selection for traffic congestion prediction
    Yang, Su
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2013, 26 : 160 - 169
  • [37] Feature Selection to Traffic Flow Forecasting
    Sun, Zhanquan
    Wang, Yinglong
    Pan, Jingshan
    2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 2864 - 2869
  • [38] Feature Selection for Monotonic Classification
    Hu, Qinghua
    Pan, Weiwei
    Zhang, Lei
    Zhang, David
    Song, Yanping
    Guo, Maozu
    Yu, Daren
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2012, 20 (01) : 69 - 81
  • [39] Feature Selection for Classification with QAOA
    Turati, Gloria
    Dacrema, Maurizio Ferrari
    Cremonesi, Paolo
    2022 IEEE INTERNATIONAL CONFERENCE ON QUANTUM COMPUTING AND ENGINEERING (QCE 2022), 2022, : 782 - 785
  • [40] Feature Selection for Collective Classification
    Senliol, Baris
    Aral, Atakan
    Cataltepe, Zehra
    2009 24TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2009, : 285 - 290