FindMal: A file-to-file social network based malware detection framework

被引:15
|
作者
Ni, Ming [1 ]
Li, Tao [2 ,3 ]
Li, Qianmu [1 ]
Zhang, Hong [1 ]
Ye, Yanfang [4 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Jiangsu, Peoples R China
[2] Florida Int Univ, Sch Comp & Informat Sci, Miami, FL 33199 USA
[3] Nanjing Univ Posts & Telecommun, Sch Comp Sci & Technol, Nanjing 210023, Jiangsu, Peoples R China
[4] West Virginia Univ, Dept Comp Sci & Elect Engn, Morgantown, WV 26506 USA
关键词
Malware detection; File relation graph; Graph feature; Label propagation; Active learning;
D O I
10.1016/j.knosys.2016.09.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rapid development of malicious software programs has posed severe threats to Computer and Internet security. Therefore, it motivates anti-malware vendors and researchers to develop novel methods which are capable of protecting users against new threats. Existing malware detectors mostly treat the file samples separately using supervised learning algorithms. However, ignoring the relationship among file samples limits the capability of malware detectors. In this paper, based on the file-to-file social network, we present a new malware detection framework, FindMal(File-to-File Social Network based Malware Detection Framework), including graph-based features extraction, Label Propagation algorithm, and active learning strategy. Nearest neighbors are first chosen as adjacent nodes for each file node to construct kNN file relation graph. Three file relation graph features are proposed to sample the representative file samples for labeling. Then, Label Propagation algorithm, which propagates the label information from labeled file samples to unlabeled files, is applied to learn the probability that one unknown file is classified as malicious or benign. A batch mode active learning method is employed to reduce the labeling cost and improve the performance of Label Propagation. Comprehensive experiments on real and large scale dataset obtained from an anti-malware company are performed. The results demonstrate that our proposed FindMal outperforms other existing detection models in classifying file samples. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:142 / 151
页数:10
相关论文
共 50 条
  • [1] AN ANDROID MALWARE DETECTION METHOD BASED ON ANDROIDMANIFEST FILE
    Li, Xiang
    Liu, Jianyi
    Huo, Yanyu
    Zhang, Ru
    Yao, Yuangang
    PROCEEDINGS OF 2016 4TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (IEEE CCIS 2016), 2016, : 239 - 243
  • [2] GooseBt: A programmable malware detection framework based on process, file, registry, and COM monitoring
    Yang, Yuer
    Lin, Yifeng
    Li, Zhiying
    Zhao, Liangtian
    Yao, Mengting
    Lai, Yixi
    Li, Peiya
    COMPUTER COMMUNICATIONS, 2023, 204 : 24 - 32
  • [3] N-GRAMS-BASED FILE SIGNATURES FOR MALWARE DETECTION
    Santos, Igor
    penya, Yoseba K.
    Devesa, Jaime
    Bringas, Pablo G.
    ICEIS 2009 : PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL AIDSS, 2009, : 317 - 320
  • [4] A topic modeling-based approach to executable file malware detection
    Hilal, Waleed
    Wilkinson, Connor
    Alsadi, Naseem
    Surucu, Onur
    Giuliano, Alessandro
    Gadsden, Stephen A.
    Yawney, John
    DISRUPTIVE TECHNOLOGIES IN INFORMATION SCIENCES VI, 2022, 12117
  • [5] Distributed Malware Detection based on Binary File Features in Cloud Computing Environment
    Han, Xiaoguang
    Sun, Jigang
    Qu, Wu
    Yao, Xuanxia
    26TH CHINESE CONTROL AND DECISION CONFERENCE (2014 CCDC), 2014, : 4083 - 4088
  • [6] A Neural Network Approach to a Grayscale Image-Based Multi-File Type Malware Detection System
    Copiaco, Abigail
    El Neel, Leena
    Nazzal, Tasnim
    Mukhtar, Husameldin
    Obaid, Walid
    APPLIED SCIENCES-BASEL, 2023, 13 (23):
  • [7] Detection of Adversarial PE File Malware via Model Interpretation
    Tian Z.-C.
    Zhang W.-Z.
    Qiao Y.-C.
    Liu Y.
    Ruan Jian Xue Bao/Journal of Software, 2023, 34 (04): : 1926 - 1943
  • [8] Malware Detection Using Byte Streams of Different File Formats
    Jeong, Young-Seob
    Lee, Sang-Min
    Kim, Jong-Hyun
    Woo, Jiyoung
    Kang, Ah Reum
    IEEE ACCESS, 2022, 10 : 51041 - 51047
  • [9] Intelligent File Scoring System for Malware Detection from the Gray List
    Ye, Yanfang
    Li, Tao
    Jiang, Qingshan
    Han, Zhixue
    Wan, Li
    KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2009, : 1385 - 1393
  • [10] Learning edge weights in file co-occurrence graphs for malware detection
    Mao, Weixuan
    Cai, Zhongmin
    Zeng, Bo
    Guan, Xiaohong
    DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 33 (01) : 168 - 203