Binary Code Similarity Detection: State and Future

被引:0
|
作者
Li, Zhenshan [1 ]
Liu, Hao [1 ]
Shan, Ruijie [2 ]
Sun, Yanbin [1 ]
Jiang, Yu [1 ]
Hu, Ning [1 ]
机构
[1] Guangzhou Univ, Cyberspace Initiate Adv Technol, Guangzhou, Peoples R China
[2] Beijing Normal Univ, Hong Kong Baptist Univ United Int Coll, Zhuhai, Peoples R China
基金
中国国家自然科学基金;
关键词
network security; feature extraction; binary code similarity; malware analysis;
D O I
10.1109/CloudNet59005.2023.10490019
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Binary code similarity detection (BCSD) is an important research direction in the field of computer network security. The application scenarios for BCSD are widespread and many related methods have been proposed to solve the problems of eliminating code redundancy, vulnerability mining, malware analysis, and code copyright protection. Based on the absence of a comprehensive overview of the BCSD method, the present article provides a review of the state of art on BCSD research. Firstly, this paper introduces the relevant concepts, general process, and challenges faced by BCSD, such as the compiler optimization diversity challenge. Secondly, based on different feature information and method principles, this paper classifies the current BCSD method into three categories: text-based, logic-based, and semantics-based methods. Then, the implementation procedures of multiple BCSD methods are described. Finally, this paper proposes prospects for the technology and application of BCSD.
引用
收藏
页码:408 / 412
页数:5
相关论文
共 50 条
  • [1] Binary Code Similarity Detection
    Liu, Zian
    2021 36TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING ASE 2021, 2021, : 1056 - 1060
  • [2] A Survey of Binary Code Similarity Detection Techniques
    Ruan, Liting
    Xu, Qizhen
    Zhu, Shunzhi
    Huang, Xujing
    Lin, Xinyang
    ELECTRONICS, 2024, 13 (09)
  • [3] CodeExtract: Enhancing Binary Code Similarity Detection with Code Extraction Techniques
    Jia, Lichen
    Wu, Chenggang
    Zhang, Peihua
    Wang, Zhe
    PROCEEDINGS OF THE 25TH ACM SIGPLAN/SIGBED INTERNATIONAL CONFERENCE ON LANGUAGES, COMPILERS, AND TOOLS FOR EMBEDDED SYSTEMS, LCTES 2024, 2024, : 143 - 154
  • [4] Unsupervised Binary Code Translation with Application to Code Similarity Detection and Vulnerability Discovery
    Ahmad, Iftakhar
    Luo, Lannan
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 14581 - 14592
  • [5] BinDeep: A deep learning approach to binary code similarity detection
    Tian, Donghai
    Jia, Xiaoqi
    Ma, Rui
    Liu, Shuke
    Liu, Wenjing
    Hu, Changzhen
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 168
  • [6] Practical Binary Code Similarity Detection with BERT-based Transferable Similarity Learning
    Ahn, Sunwoo
    Ahn, Seonggwan
    Koo, Hyungjoon
    Paek, Yunheung
    PROCEEDINGS OF THE 38TH ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE, ACSAC 2022, 2022, : 361 - 374
  • [7] A Survey of Binary Code Similarity
    Ul Haq, Irfan
    Caballero, Juan
    ACM COMPUTING SURVEYS, 2022, 54 (03)
  • [8] UniASM: Binary code similarity detection without fine-tuning
    Gu, Yeming
    Shu, Hui
    Kang, Fei
    Hu, Fan
    NEUROCOMPUTING, 2025, 630
  • [9] jTrans: Jump-Aware Transformer for Binary Code Similarity Detection
    Wang, Hao
    Qu, Wenjie
    Katz, Gilad
    Zhu, Wenyu
    Gao, Zeyu
    Qiu, Han
    Zhuge, Jianwei
    Zhang, Chao
    PROCEEDINGS OF THE 31ST ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2022, 2022, : 1 - 13
  • [10] αDiff: Cross-Version Binary Code Similarity Detection with DNN
    Liu, Bingchang
    Huo, Wei
    Zhang, Chao
    Li, Wenchao
    Li, Feng
    Piao, Aihua
    Zou, Wei
    PROCEEDINGS OF THE 2018 33RD IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMTED SOFTWARE ENGINEERING (ASE' 18), 2018, : 667 - 678