Detection of DoH Traffic Tunnels Using Deep Learning for Encrypted Traffic Classification

被引:5
作者
Alzighaibi, Ahmad Reda [1 ]
机构
[1] Taibah Univ, Coll Comp Sci & Engn, Yanbu 42353, Saudi Arabia
关键词
DNS over HTTPS (DoH); CIRA-CIC-DoHBrw-2020; deep Learning; encrypted traffic classification;
D O I
10.3390/computers12030047
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Currently, the primary concerns on the Internet are security and privacy, particularly in encrypted communications to prevent snooping and modification of Domain Name System (DNS) data by hackers who may attack using the HTTP protocol to gain illegal access to the information. DNS over HTTPS (DoH) is the new protocol that has made remarkable progress in encrypting Domain Name System traffic to prevent modifying DNS traffic and spying. To alleviate these challenges, this study explored the detection of DoH traffic tunnels of encrypted traffic, with the aim to determine the gained information through the use of HTTP. To implement the proposed work, state-of-the-art machine learning algorithms were used including Random Forest (RF), Gaussian Naive Bayes (GNB), Logistic Regression (LR), k-Nearest Neighbor (KNN), the Support Vector Classifier (SVC), Linear Discriminant Analysis (LDA), Decision Tree (DT), Adaboost, Gradient Boost (SGD), and LSTM neural networks. Moreover, ensemble models consisting of multiple base classifiers were utilized to carry out a series of experiments and conduct a comparative study. The CIRA-CIC-DoHBrw2020 dataset was used for experimentation. The experimental findings showed that the detection accuracy of the stacking model for binary classification was 99.99%. In the multiclass classification, the gradient boosting model scored maximum values of 90.71%, 90.71%, 90.87%, and 91.18% in Accuracy, Recall, Precision, and AUC. Moreover, the micro average ROC curve for the LSTM model scored 98%.
引用
收藏
页数:17
相关论文
共 41 条
  • [1] Al Majzoub Hisham, 2020, International Journal of Machine Learning and Computing, V10, P39, DOI 10.18178/ijmlc.2020.10.1.894
  • [2] Amaratunga T., 2020, DEEP LEARNING WINDOW, P67
  • [3] Banadaki YM, 2020, Journal of Computer Sciences and Applications, V8, P46, DOI [10.12691/jcsa-8-2-2, DOI 10.12691/JCSA-8-2-2]
  • [4] Bhukya D.P., 2010, Int. J. Electr. Comput. Eng, V2, P660, DOI DOI 10.7763/IJCEE.2010.V2.208
  • [5] Borgolte Kevin, 2019, TPRC47
  • [6] An Empirical Study of the Cost of DNS-over-HTTPS
    Bottger, Timm
    Cuadrado, Felix
    Antichi, Gianni
    Fernandes, Eder Leao
    Tyson, Gareth
    Castro, Ignacio
    Uhlig, Steve
    [J]. IMC'19: PROCEEDINGS OF THE 2019 ACM INTERNET MEASUREMENT CONFERENCE, 2019, : 15 - 21
  • [7] Bramer M, 2020, PRINCIPLES DATA MINI, P79
  • [8] On the Impact of DNS over HTTPS Paradigm on Cyber Systems
    Bumanglag, Kimo
    Kettani, Houssain
    [J]. 2020 3RD INTERNATIONAL CONFERENCE ON INFORMATION AND COMPUTER TECHNOLOGIES (ICICT 2020), 2020, : 494 - 499
  • [9] Bushart J., 2020, 10 USENIX WORKSH FRE
  • [10] de Vries L., 2021, THESIS U TWENTE ENSC