Text-based Malicious Domain Names Detection Based on Variational Autoencoder And Supervised Learning

被引:0
作者
Sun, Yuwei [1 ]
Chong, Ng S. T. [2 ]
Ochiai, Hideya [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo, Japan
[2] United Nations Univ, Campus Comp Ctr, Tokyo, Japan
来源
2020 54TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS) | 2020年
关键词
malicious domain names detection; VAE; cybersecurity; machine learning;
D O I
10.1109/CISS48834.2020.1570601577
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid development of information technology, adaptation of an information system in industries and institutes has become more and more common. However, attacks like using zombie networks to access a host thus causing it to shut down are frequent in recent years. Domain names play a significant role in the connection with a server, considered as a key for detecting these attacks. In this paper, we propose a text-based method to convert domain names into numeric features, based on the term frequency and inverse document frequency (TF-IDF). Then we adopt the variational autoencoder (VAE) consisting of an encoder and a decoder, extracting hidden information from features. Moreover, through collapsing the Gaussian distribution of these features at the hidden layer to its mean, the distribution of domain names is visualized. After that, we adopt a supervised learning called Convolutional Neural Network (CNN) for the classification between the malicious and benign. We train the model using feature vectors from the VAE. At last, the scheme achieves a validation accuracy of 0.868 for the malicious domain names detection.
引用
收藏
页码:192 / 196
页数:5
相关论文
共 50 条
  • [21] Improved Detection of Malicious Domain Names Using Gradient Boosted Machines and Feature Engineering
    Alhogail, Areej
    Al-Turaiki, Isra
    [J]. INFORMATION TECHNOLOGY AND CONTROL, 2022, 51 (02): : 313 - 331
  • [22] A Semi-Supervised Learning Scheme to Detect Unknown DGA Domain Names based on Graph Analysis
    Yan, Fan
    Liu, Jia
    Gu, Liang
    Chen, Zelong
    [J]. 2020 IEEE 19TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2020), 2020, : 1578 - 1583
  • [23] Detecting malicious domain names using deep learning approaches at scale
    Vinayakumar, R.
    Soman, K. P.
    Poornachandran, Prabaharan
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 34 (03) : 1355 - 1367
  • [24] TEXT CLASSIFICATION BASED ON SEMI-SUPERVISED LEARNING
    Vo Duy Thanh
    Vo Trung Hung
    Pham Minh Tuan
    Doan Van Ban
    [J]. 2013 INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2013, : 232 - 236
  • [25] Convolutional One-Dimensional Variational Autoencoder Based Intrusion detection
    Wu, Kailin
    Cao, MengTing
    Wang, Pan
    Wang, ZiXuan
    [J]. 2021 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS DASC/PICOM/CBDCOM/CYBERSCITECH 2021, 2021, : 488 - 493
  • [26] Semisupervised anomaly detection of multivariate time series based on a variational autoencoder
    Ningjiang Chen
    Huan Tu
    Xiaoyan Duan
    Liangqing Hu
    Chengxiang Guo
    [J]. Applied Intelligence, 2023, 53 : 6074 - 6098
  • [27] Semisupervised anomaly detection of multivariate time series based on a variational autoencoder
    Chen, Ningjiang
    Tu, Huan
    Duan, Xiaoyan
    Hu, Liangqing
    Guo, Chengxiang
    [J]. APPLIED INTELLIGENCE, 2023, 53 (05) : 6074 - 6098
  • [28] DOLPHIN: Phonics based Detection of DGA Domain Names
    Zhao, Dan
    Li, Hao
    Sun, Xiuwen
    Tang, Yazhe
    [J]. 2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [29] A Novel Self-Supervised Learning-Based Anomalous Node Detection Method Based on an Autoencoder for Wireless Sensor Networks
    Ye, Miao
    Zhang, Qinghao
    Xue, Xingsi
    Wang, Yong
    Jiang, Qiuxiang
    Qiu, Hongbing
    [J]. IEEE SYSTEMS JOURNAL, 2024, 18 (01): : 256 - 267
  • [30] Character Level based Detection of DGA Domain Names
    Yu, Bin
    Pan, Jie
    Hu, Jiaming
    Nascimento, Anderson
    De Cock, Martine
    [J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,