DEEP LEARNING BASED SENSITIVE DATA DETECTION

被引:2
|
作者
Chong, Peng [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 610000, Peoples R China
关键词
Sensitive data detection; Data anonymization; Deep Learning; Cyber intelligence; PRIVACY;
D O I
10.1109/ICCWAMTIP56608.2022.10016592
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The growing popularity of edge techniques, such as IoT, 5G, blockchain, make it increasingly challenging to protect sensitive data due to the amount of data increases and the growing volume of regulatory policies. To properly protect sensitive data, it is very important to identify sensitive data and implement data anonymization to ensure the quality and proper use of data anonymization techniques. This work focuses on proactively sensitive data identification, classification and anonymization using machine learning techniques. We first investigated the sensitive data extraction from both structured data and unstructured data, in which Bert models and Regular expressions were used to achieve the identification of sensitive data in real-time. Meanwhile, we propose a comprehensive sensitive detection framework combining the Bert model with regular expressions that can achieve high precision and good generalization capability with not so large corpus. The experimental results demonstrate the effectiveness of proposed solution.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Sensitive Information Detection Based on Deep Learning Models
    Zhang, Ruotong
    Zhu, Dingju
    Wu, Chao
    Xu, Jianyu
    Wu, Chun Ho
    APPLIED SCIENCES-BASEL, 2024, 14 (17):
  • [2] IDSDL: a sensitive intrusion detection system based on deep learning
    Yanjun Hu
    Fan Bai
    Xuemiao Yang
    Yafeng Liu
    EURASIP Journal on Wireless Communications and Networking, 2021
  • [3] IDSDL: a sensitive intrusion detection system based on deep learning
    Hu, Yanjun
    Bai, Fan
    Yang, Xuemiao
    Liu, Yafeng
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2021, 2021 (01)
  • [4] Deep Learning Based Data Race Detection Approach
    Zhang Y.
    Qiao L.
    Dong C.
    Gao H.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (09): : 1914 - 1928
  • [5] Anomaly Detection in Health Data Based on Deep Learning
    Han, Ning
    Gao, Sheng
    Li, Jin
    Zhang, Xinming
    Guo, Jun
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC), 2018, : 188 - 192
  • [6] DeepFlow: Deep Learning-Based Malware Detection by Mining Android Application for Abnormal Usage of Sensitive Data
    Zhu, Dali
    Jin, Hao
    Yang, Ying
    Wu, Di
    Chen, Weiyi
    2017 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2017, : 438 - 443
  • [7] Pistachio Visual Detection Based on Data Balance and Deep Learning
    Gao J.
    Ni J.
    Yang H.
    Han Z.
    Han, Zhongzhi (hanzhongzhi@qau.edu.cn), 1600, Chinese Society of Agricultural Machinery (52): : 367 - 372
  • [8] Deep learning based epileptic seizure detection with EEG data
    Poorani, S.
    Balasubramanie, P.
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2023,
  • [9] Molecular communication data augmentation and deep learning based detection
    Scazzoli, Davide
    Vakilipoor, Fardad
    Magarini, Maurizio
    NANO COMMUNICATION NETWORKS, 2024, 40
  • [10] Android malware detection framework based on sensitive opcodes and deep reinforcement learning
    Yang J.
    Gui C.
    Journal of Intelligent and Fuzzy Systems, 2024, 46 (04): : 8933 - 8942