An IoT Device Identification Method based on Semi-supervised Learning

被引:17
作者
Fan, Linna [1 ,2 ,3 ]
Zhang, Shize [1 ,2 ]
Wu, Yichao [1 ,2 ]
Wang, Zhiliang [1 ,2 ]
Duan, Chenxin [1 ,2 ]
Li, Jia [4 ]
Yang, Jiahai [1 ,2 ]
机构
[1] Tsinghua Univ, Inst Network Sci & Cyberspace, Beijing, Peoples R China
[2] Beijing Natl Res Ctr Informat Sci & Technol, Beijing, Peoples R China
[3] Natl Univ Def Technol, Sch Informat & Commun, Xian, Peoples R China
[4] Coordinat Ctr China, Natl Comp Network Emergency Response Tech Team, Beijing, Peoples R China
来源
2020 16TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT (CNSM) | 2020年
关键词
IoT; identification; semi-supervised learning; PHYSICAL DEVICE; INTERNET;
D O I
10.23919/cnsm50824.2020.9269044
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid proliferation of IoT devices, device management and network security are becoming significant challenges. Knowing how many IoT devices are in the network and whether they are behaving normally is significant. IoT device identification is the first step to achieve these goals. Previous IoT identification works mainly use supervised learning and need lots of labeled data. Considering collecting labeled data is time-consuming and cannot be scaled, in this paper, we propose an IoT identification model based on semi-supervised learning. The model can differentiate IoT and non-IoT and classify specific IoT devices based on time interval features, traffic volume features, protocol features and TLS related features. The evaluation in a public dataset shows that our model only needs 5% labeled data and gets accuracy over 99%.
引用
收藏
页数:7
相关论文
共 40 条
  • [1] Aksoy A, 2019, IEEE ICC
  • [2] Identifying Encrypted Malware Traffic with Contextual Flow Data
    Anderson, Blake
    McGrew, David
    [J]. AISEC'16: PROCEEDINGS OF THE 2016 ACM WORKSHOP ON ARTIFICIAL INTELLIGENCE AND SECURITY, 2016, : 35 - 46
  • [3] [Anonymous], 2018, ARXIV180602679
  • [4] Antonakakis M, 2017, PROCEEDINGS OF THE 26TH USENIX SECURITY SYMPOSIUM (USENIX SECURITY '17), P1093
  • [5] KURTOSIS - A CRITICAL-REVIEW
    BALANDA, KP
    MACGILLIVRAY, HL
    [J]. AMERICAN STATISTICIAN, 1988, 42 (02) : 111 - 119
  • [6] Bao JQ, 2020, INT WIREL COMMUN, P565, DOI 10.1109/IWCMC48107.2020.9148110
  • [7] Bezawada B., 2018, ARXIV180403852
  • [8] Revealing Skype traffic: When randomness plays with you
    Bonfiglio, Dario
    Mellia, Marco
    Meo, Michela
    Rossi, Dario
    Tofanelli, Paolo
    [J]. ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2007, 37 (04) : 37 - 48
  • [9] IoT or NoT: Identifying IoT Devices in a Short Time Scale
    Bremler-Barr, Anat
    Levy, Haim
    Yakhini, Zohar
    [J]. NOMS 2020 - PROCEEDINGS OF THE 2020 IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM 2020: MANAGEMENT IN THE AGE OF SOFTWARIZATION AND ARTIFICIAL INTELLIGENCE, 2020,
  • [10] Measuring Skewness: A Forgotten Statistic?
    Doane, David P.
    Seward, Lori E.
    [J]. JOURNAL OF STATISTICS EDUCATION, 2011, 19 (02):