Using support vector machine in traffic analysis for website recognition

被引:2
作者
Shi, JQ [1 ]
Fang, BX [1 ]
Bin, L [1 ]
Wang, FL [1 ]
机构
[1] Harbin Inst Technol, Res Ctr Comp Network & Informat Secur Technol, Harbin 150010, Peoples R China
来源
PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7 | 2004年
关键词
traffic analysis; support vector machine; session describing vector;
D O I
10.1109/ICMLC.2004.1378294
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Website Recognition is the process of identifying specific websites from analyzing the traffic flow. Encryption invalidates content analysis techniques, while traffic analysis can solve the problem by concentrating on the nature and behavior of traffic. Based on the structural-stable but content-mutable properties of website, a method combining machine learning algorithm and traffic analysis technique is proposed for encrypted website recognition. Session Describing Vector, composed of connection count and data volumes transferred in each connection, is introduced to characterize a web surfing flow. And through vector normalization, generalization and ranking, the sequence, length and dimension weight are adjusted to improve the recognition effect The recognition process can be considered as a binary classification problem, thus SVM (support vector machine) algorithm is adopted because of its excellent performance in pattern classification problems. Experiments show that the proposed method can discern the vectors of a specific website from others clearly, and the process of generalization and ranking are of great help to classification.
引用
收藏
页码:2680 / 2684
页数:5
相关论文
共 8 条
  • [1] [Anonymous], 2000, DESIGNING PRIVACY EN
  • [2] CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
  • [3] Gunn S. R., 1998, SUPPORT VECTOR MACHI
  • [4] PFITZMANN B, 1990, P EUROCRYPT 1989 HOU
  • [5] RFC, 1945, HYP TRANSF PROT HTTP
  • [6] SCHNEIER B, 1996, APPL CRYPTOGRAPHY, P219
  • [7] Syverson Paul F., 2002, INT WORKSH PRIV ENH, P171
  • [8] Vapnik V, 1999, NATURE STAT LEARNING