Parallelized Jaccard-Based Learning Method and MapReduce Implementation for Mobile Devices Recognition from Massive Network Data

被引:7
|
作者
Liu Jun [1 ]
Li Yinzhou [1 ]
Cuadrado, Felix [2 ]
Uhlig, Steve [2 ]
Lei Zhenming [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing Key Lab Network Syst Architecture & Conve, Beijing 100876, Peoples R China
[2] Univ London, Dept Elect Engn & Comp Sci, London E1 4NS, England
基金
中国国家自然科学基金;
关键词
mobile device recognition; data mining; Jaccard coefficient measurement; distributed computing; MapReduce;
D O I
10.1109/CC.2013.6571290
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
The ability of accurate and scalable mobile device recognition is critically important for mobile network operators and ISPs to understand their customers' behaviours and enhance their user experience. In this paper, we propose a novel method for mobile device model recognition by using statistical information derived from large amounts of mobile network traffic data. Specifically, we create a Jaccard-based coefficient measure method to identify a proper keyword representing each mobile device model from massive unstructured textual HTTP access logs. To handle the large amount of traffic data generated from large mobile networks, this method is designed as a set of parallel algorithms, and is implemented through the MapReduce framework which is a distributed parallel programming model with proven low-cost and high-efficiency features. Evaluations using real data sets show that our method can accurately recognise mobile client models while meeting the scalability and producer-independency requirements of large mobile network operators. Results show that a 91.5% accuracy rate is achieved for recognising mobile client models from 2 billion records, which is dramatically higher than existing solutions.
引用
收藏
页码:71 / 84
页数:14
相关论文
共 15 条
  • [1] A Method For Hybrid Bayesian Network Structure Learning from Massive Data Using MapReduce
    Li, Shun
    Wang, Biao
    2017 IEEE 3RD INTERNATIONAL CONFERENCE ON BIG DATA SECURITY ON CLOUD (BIGDATASECURITY, IEEE 3RD INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE AND SMART COMPUTING, (HPSC) AND 2ND IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT DATA AND SECURITY (IDS), 2017, : 272 - 276
  • [2] Hybrid Parrallel Bayesian Network Structure Learning from Massive Data Using MapReduce
    Shun Li
    Biao Wang
    Journal of Signal Processing Systems, 2018, 90 : 1115 - 1121
  • [3] Hybrid Parrallel Bayesian Network Structure Learning from Massive Data Using MapReduce
    Li, Shun
    Wang, Biao
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 90 (8-9): : 1115 - 1121
  • [4] Parallelized User Clicks Recognition from Massive HTTP Data Based on Dependency Graph Model
    Fang Cheng
    Liu Jun
    Lei Zhenming
    CHINA COMMUNICATIONS, 2014, 11 (12) : 13 - 25
  • [5] Mobile Big Data Analytics for Human Behavior Recognition in Wireless Sensor Network Based on Transfer Learning
    Cui, Zhexiong
    Ren, Jie
    JOURNAL OF INTERCONNECTION NETWORKS, 2024, 24 (SUPP01)
  • [6] Bayesian Network Structure Learning from Big Data: A Reservoir Sampling Based Ensemble Method
    Tang, Yan
    Xu, Zhuoming
    Zhuang, Yuanhang
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2016, 2016, 9645 : 209 - 222
  • [7] A personalized federated learning-based fault diagnosis method for data suffering from network attacks
    Zhiqiang Zhang
    Funa Zhou
    Chongsheng Zhang
    Chenglin Wen
    Xiong Hu
    Tianzhen Wang
    Applied Intelligence, 2023, 53 : 22834 - 22849
  • [8] A personalized federated learning-based fault diagnosis method for data suffering from network attacks
    Zhang, Zhiqiang
    Zhou, Funa
    Zhang, Chongsheng
    Wen, Chenglin
    Hu, Xiong
    Wang, Tianzhen
    APPLIED INTELLIGENCE, 2023, 53 (19) : 22834 - 22849
  • [9] Sensor-based Complex Human Activity Recognition from Smartwatch Data using Hybrid Deep Learning Network
    Mekruksavanich, Sakorn
    Jitpattanakul, Anuchit
    2021 36TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC), 2021,
  • [10] Reinforcement Learning Based Active Attack Detection and Blockchain Technique to Protect the Data from the Passive Attack in the Autonomous Mobile Network
    Sivasankar, C.
    Kumanan, T.
    WIRELESS PERSONAL COMMUNICATIONS, 2023, 131 (04) : 2697 - 2714