Windows Malware Binaries in C/C plus plus GitHub Repositories: Prevalence and Lessons Learned

被引:1
|
作者
La Cholter, William [1 ]
Elder, Matthew [1 ]
Stalick, Antonius [1 ]
机构
[1] Johns Hopkins Univ, Appl Phys Lab, Baltimore, MD 21218 USA
来源
ICISSP: PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS SECURITY AND PRIVACY | 2021年
关键词
Malware; GitHub; Open Source Software; Windows;
D O I
10.5220/0010237904750484
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Does malware lurking in GitHub pose a threat? GitHub is the most popular open source software website, having 188 million repositories. GitHub hosts malware-related projects for research and educational purposes and has also been used by malware to attack users. In this paper, we explore the prevalence of unencrypted, uncompressed binary code malware in Microsoft Windows compatible C and C++ GitHub repositories and characterize the threat. We mined 1,835 repositories for already-compiled malicious files and data suggesting whether the repository is malware-related. We focused on these repositories because Windows is frequently targeted by malware written in C or C++. These repositories are good resources for attackers and could target Windows users. We extracted all Portable Executable (PE) files from all commits and queried the malware resource VirusTotal for analysis from its 76 anti-virus engines. Of the 24,395 files, 4,335 are suspicious, with at least one detection; 440 could be considered malicious, with at least seven detections. We identify topic tags suggesting malware or offensive security content, to differentiate from seemingly benign repositories. 197 of 440 malicious executables were in 27 ostensibly benign repositories. This work illustrates risks in source code repositories and lessons learned in relating GitHub and VirusTotal data.
引用
收藏
页码:475 / 484
页数:10
相关论文
共 50 条
  • [1] Interactive C plus plus code development using C plus plus Explorer and GitHub classroom for educational purposes
    Diehl, Patrick
    Brandt, Steven R.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (18):
  • [2] Scalable data structure detection and classification for C/C plus plus binaries
    Haller, Istvan
    Slowinska, Asia
    Bos, Herbert
    EMPIRICAL SOFTWARE ENGINEERING, 2016, 21 (03) : 778 - 810
  • [3] DSIbin: Identifying Dynamic Data Structures in C/C plus plus Binaries
    Rupprecht, Thomas
    Chen, Xi
    White, David H.
    Boockmann, Jan H.
    Luettgen, Gerald
    Bos, Herbert
    PROCEEDINGS OF THE 2017 32ND IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE'17), 2017, : 331 - 341
  • [4] Recovering Container Class Types in C plus plus Binaries
    Wang, Xudong
    Xu, Xuezheng
    Li, Qingan
    Yuan, Mengting
    Xue, Jingling
    CGO '22: PROCEEDINGS OF THE 2022 IEEE/ACM INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION (CGO), 2022, : 131 - 143
  • [5] Devil is Virtual: Reversing Virtual Inheritance in C plus plus Binaries
    Erinfolami, Rukayat Ayomide
    Prakash, Aravind
    CCS '20: PROCEEDINGS OF THE 2020 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2020, : 133 - 148
  • [6] Strict Virtual Call Integrity Checking for C plus plus Binaries
    Elsabagh, Mohamed
    Fleck, Dan
    Stavrou, Angelos
    PROCEEDINGS OF THE 2017 ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY (ASIA CCS'17), 2017, : 140 - 154
  • [7] MemPick: High-Level Data Structure Detection in C/C plus plus Binaries
    Haller, Istvan
    Slowinska, Asia
    Bos, Herbert
    2013 20TH WORKING CONFERENCE ON REVERSE ENGINEERING (WCRE), 2013, : 32 - 41
  • [8] On Design Inference from Binaries Compiled using Modern C plus plus Defenses
    Erinfolami, Rukayat Ayomide
    Anh Quach
    Prakash, Aravind
    PROCEEDINGS OF THE 22ND INTERNATIONAL SYMPOSIUM ON RESEARCH IN ATTACKS, INTRUSIONS AND DEFENSES, 2019, : 17 - 30
  • [9] DeClassifier: Class-Inheritance Inference Engine for Optimized C plus plus Binaries
    Erinfolami, Rukayat Ayomide
    Prakash, Aravind
    PROCEEDINGS OF THE 2019 ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY (ASIACCS '19), 2019, : 28 - 40
  • [10] vfGuard: Strict Protection for Virtual Function Calls in COTS C plus plus Binaries
    Prakash, Aravind
    Hu, Xunchao
    Yin, Heng
    22ND ANNUAL NETWORK AND DISTRIBUTED SYSTEM SECURITY SYMPOSIUM (NDSS 2015), 2015,