An Efficient PDF Malware Detection Method Using Highly Compact Features

被引:0
作者
Liu, Ran [1 ]
Matuszek, Cynthia [1 ]
Nicholas, Charles [1 ]
机构
[1] Univ Maryland, Baltimore, MD 21201 USA
来源
PROCEEDINGS OF THE 2024 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, DOCENG 2024 | 2024年
关键词
Malware Detection; Machine Learning; Feature Engineering;
D O I
10.1145/3685650.3685668
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The growing use of PDFs has made them a prime target for malware attacks. Machine learning-based approaches for detecting PDF malware are increasingly popular due to their high accuracy and efficiency. However, the effectiveness of these systems largely depends on the quality of the dataset and the features used. Additionally, they face challenges from sophisticated evasion attacks. This paper introduces a compact yet highly effective feature set, consisting of just five features, designed to improve training efficiency and enhance the robustness of PDF malware detection models. Through experiments, including tests on the real-world detection system PDFRATE, we demonstrate that our proposed feature set not only trains highly accurate models but also increases the system's robustness against a specific evasive attack known as the Benign Random Noise (BRN) attack.
引用
收藏
页数:4
相关论文
共 50 条
  • [41] PDF Malware Detection: Toward Machine Learning Modeling With Explainability Analysis
    Hossain, G. M. Sakhawat
    Deb, Kaushik
    Janicke, Helge
    Sarker, Iqbal H.
    IEEE ACCESS, 2024, 12 : 13833 - 13859
  • [42] Malware Detection Method using Tree-based Machine Learning Algorithms
    Okada, Satoshi
    Matsuda, Wataru
    Fujimoto, Mariko
    Mitsunaga, Takuho
    2021 IEEE INTERNATIONAL CONFERENCE ON COMPUTING (ICOCO), 2021, : 103 - 108
  • [43] Malware Detection Using Automated Generation of Yara Rules on Dynamic Features
    Si, Qin
    Xu, Hui
    Tong, Ying
    Zhou, Yu
    Liang, Jian
    Cui, Lei
    Hao, Zhiyu
    SCIENCE OF CYBER SECURITY, SCISEC 2022, 2022, 13580 : 315 - 330
  • [44] Malware Detection Using Semantic Features and Improved Chi-square
    Ha, Seung-Tae
    Hong, Sung-Sam
    Han, Myung-Mook
    JOURNAL OF INTERNET TECHNOLOGY, 2018, 19 (03): : 879 - 887
  • [45] OMD: Orthogonal Malware Detection using Audio, Image, and Static Features
    Nataraj, Lakshmanan
    Mohammed, Tajuddin Manhar
    Nanjundaswamy, Tejaswi
    Chikkagoudar, Satish
    Chandrasekaran, Shivkumar
    Manjunath, B. S.
    2021 IEEE MILITARY COMMUNICATIONS CONFERENCE (MILCOM 2021), 2021,
  • [46] Automatic malware classification and new malware detection using machine learning
    Liu Liu
    Bao-sheng Wang
    Bo Yu
    Qiu-xi Zhong
    Frontiers of Information Technology & Electronic Engineering, 2017, 18 : 1336 - 1347
  • [47] HEMD: a highly efficient random forest-based malware detection framework for Android
    Zhu, Hui-Juan
    Jiang, Tong-Hai
    Ma, Bo
    You, Zhu-Hong
    Shi, Wei-Lei
    Cheng, Li
    NEURAL COMPUTING & APPLICATIONS, 2018, 30 (11) : 3353 - 3361
  • [48] Fast and Efficient Malware Detection with Joint Static and Dynamic Features Through Transfer Learning
    Ngo, Mao, V
    Tram Truong-Huu
    Rabadi, Dima
    Loo, Jia Yi
    Teo, Sin G.
    APPLIED CRYPTOGRAPHY AND NETWORK SECURITY, PT I, ACNS 2023, 2023, 13905 : 503 - 531
  • [49] Machine-Learning Classifiers for Malware Detection Using Data Features
    Habtor, Saleh Abdulaziz
    Dahah, Ahmed Haidarah Hasan
    JOURNAL OF ICT RESEARCH AND APPLICATIONS, 2021, 15 (03) : 265 - 290
  • [50] HEMD: a highly efficient random forest-based malware detection framework for Android
    Hui-Juan Zhu
    Tong-Hai Jiang
    Bo Ma
    Zhu-Hong You
    Wei-Lei Shi
    Li Cheng
    Neural Computing and Applications, 2018, 30 : 3353 - 3361