Log Layering Based on Natural Language Processing

被引:0
|
作者
Shen, Hanji [1 ,2 ]
Long, Chun [1 ]
Wan, Wei [1 ]
Li, Jun [1 ]
Qin, Yakui [1 ]
Fu, Yuhao [1 ]
Song, Xiaofan [1 ]
机构
[1] Chinese Acad Sci, Comp Network Informat Ctr, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
来源
2019 21ST INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT): ICT FOR 4TH INDUSTRIAL REVOLUTION | 2019年
关键词
Real-time Log Data; Natural Language Processing; Data Compression;
D O I
10.23919/icact.2019.8702019
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
With the increasing number and variety of logs, the requirement of storage space is growing rapidly. Meantime, the speed and accuracy of querying in massive logs are becoming increasingly important. Although the well-built distributed storage technique solves the problem of mass storage and fast query, the cost is too high. As logs are created as the method to trace the historical operation, the requirement for query rate is not high. To balance the storage cost and query rate, this paper proposes a real-time log layering storage technique based on natural language processing. According to the characteristics of the log data, this technique is combined with the text language processing technique. It compresses the real-time log data effectively while considering the query efficiency. Firstly, the method extracts the feature of each log that flows in, which will be the type name of the log. Then, the method performs word segmentation on the log and encodes each word to store the key value pairs. Finally, the key value pairs of the log are stored in the memory, and the code of each log is stored in the database. Experiments show that this method can ensure the integrity of the data effectively, decompression time dropped to 40%, compression rate down to 35%.
引用
收藏
页码:660 / 663
页数:4
相关论文
共 50 条
  • [1] Natural Language Processing-based Model for Log Anomaly Detection
    Li, Zezhou
    Zhang, Jing
    Zhang, Xianbo
    Lin, Feng
    Wang, Chao
    Cai, Xingye
    2022 2ND IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND ARTIFICIAL INTELLIGENCE (SEAI 2022), 2022, : 129 - 134
  • [2] Leveraging Code Clones and Natural Language Processing for Log Statement Prediction
    Gholamian, Sina
    2021 36TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING ASE 2021, 2021, : 1043 - 1047
  • [3] Leveraging Clustering and Natural Language Processing to Overcome Variety Issues in Log Management
    Eljasik-Swoboda, Tobias
    Demuth, Wilhelm
    ICAART: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2020, : 281 - 288
  • [4] Anomaly Detection in Log Files Using Selected Natural Language Processing Methods
    Ryciak, Piotr
    Wasielewska, Katarzyna
    Janicki, Artur
    APPLIED SCIENCES-BASEL, 2022, 12 (10):
  • [5] PEPO: Petition Executing Processing Optimizer Based on Natural Language Processing
    Chiu, Yin-Wei
    Huang, Hsiao-Ching
    Lee, Cheng-Ju
    Hsieh, Hsun-Ping
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 3150 - 3154
  • [6] Professional Chat Application based on Natural Language Processing
    Karthick, S.
    Victor, R. John
    Manikandan, S.
    Goswami, Bhargavi
    2018 IEEE INTERNATIONAL CONFERENCE ON CURRENT TRENDS IN ADVANCED COMPUTING (ICCTAC), 2018,
  • [7] Vulnerability Detection Methods Based on Natural Language Processing
    Yang Y.
    Li Y.
    Chen K.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (12): : 2649 - 2666
  • [8] Natural Language Processing Based Interpretation of Skewed Graphs
    Mahmood, Aqsa
    Qazi, Kiran
    Bajwa, Imran Sarwar
    Naeem, M. Asif
    2014 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2014, : 2700 - 2704
  • [9] Text Encryption Algorithm Based on Natural Language Processing
    Jing, Xianghe
    Hao, Yu
    Fei, Huaping
    Li, Zhijun
    2012 FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION NETWORKING AND SECURITY (MINES 2012), 2012, : 670 - 672
  • [10] Development and Optimization of Language Reading Comprehension Aids Based on Natural Language Processing
    Zhang, Chuqing
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (06) : 393 - 398