Adaptive learning on mobile network traffic data

被引:23
作者
Liu, Zhen [1 ,2 ,3 ]
Japkowicz, Nathalie [2 ]
Wang, Ruoyu [4 ,5 ]
Tang, Deyu [1 ,3 ]
机构
[1] Guangdong Pharmaceut Univ, Sch Med Informat Engn, Guangzhou, Guangdong, Peoples R China
[2] Amer Univ, Dept Comp Sci, Washington, DC 20016 USA
[3] Guangdong Chinese Med Big Data Engn Res Ctr, Guangzhou, Guangdong, Peoples R China
[4] Commun & Comp Network Lab Guangdong, Guangzhou 510041, Guangdong, Peoples R China
[5] South China Univ Technol, Informat & Network Engn & Res Ctr, Guangzhou 510041, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Mobile traffic classification; concept drift; data distributions; ensemble learning;
D O I
10.1080/09540091.2018.1512557
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning based mobile traffic classification has become a popular topic in recent years. As mobile traffic data is dynamic in nature, the static model has become ineffective for the task of classifying future traffic. This is known as the concept drift problem in data streams. To this end, this paper presents an adaptive mobile traffic classification method. Specifically, a method based on the fuzzy competence model is devised to detect concept drift, and a dynamic learning method is presented to update the classification model, so as to adapt to an ever-changing environment at an appropriate time. The concept drift detection method relies on the data distribution instead of the classification error rate. Furthermore, the weights of flow samples are dynamically updated and flow samples are resampled for training a new model when a concept drift is detected. Moreover, recently trained models are saved and used for classification in weighted voting. The weight of each model is updated according to the performance it obtains on the most recent flow samples. On mobile traffic data, experimental results show that our proposed method obtains lower classification error rate with less time consumption on updating models as compared to related methods designed for handling concept drift problems.
引用
收藏
页码:185 / 214
页数:30
相关论文
共 38 条
[1]   Multi-classification approaches for classifying mobile app traffic [J].
Aceto, Giuseppe ;
Ciuonzo, Domenico ;
Montieri, Antonio ;
Pescape, Antonio .
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2018, 103 :131-145
[2]   Can Android Applications Be Identified Using Only TCP/IP Headers of Their Launch Time Traffic? [J].
Alan, Hasan Faik ;
Kaur, Jasleen .
PROCEEDINGS OF THE 9TH ACM CONFERENCE ON SECURITY & PRIVACY IN WIRELESS AND MOBILE NETWORKS (WISEC'16), 2016, :61-66
[3]   Energy-Efficient Dynamic Traffic Offloading and Reconfiguration of Networked Data Centers for Big Data Stream Mobile Computing: Review, Challenges, and a Case Study [J].
Baccarelli, Enzo ;
Cordeschi, Nicola ;
Mei, Alessandro ;
Panella, Massimo ;
Shojafar, Mohammad ;
Stefa, Julinda .
IEEE NETWORK, 2016, 30 (02) :54-61
[4]  
Baena-Garcia M., 2006, P 4 INT WORKSH KNOWL, P1
[5]   Analyzing Android Encrypted Network Traffic to Identify User Actions [J].
Conti, Mauro ;
Mancini, Luigi Vincenzo ;
Spolaor, Riccardo ;
Verde, Nino Vincenzo .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2016, 11 (01) :114-125
[6]  
Dai SF, 2013, IEEE INFOCOM SER, P809
[7]  
Dasu T., 2006, P S INT STAT COMP SC, P1
[8]   SLIC: Self-Learning Intelligent Classifier for network traffic [J].
Divakaran, Dinil Mon ;
Su, Le ;
Liau, Yung Siang ;
Thing, Vrizlynn L. L. .
COMPUTER NETWORKS, 2015, 91 :283-297
[9]  
Domingos P., 2000, Proceedings. KDD-2000. Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P71, DOI 10.1145/347090.347107
[10]   Fuzzy competence model drift detection for data-driven decision support systems [J].
Dong, Fan ;
Zhang, Guangquan ;
Lu, Jie ;
Li, Kan .
KNOWLEDGE-BASED SYSTEMS, 2018, 143 :284-294