Binary Code Similarity Detection: State and Future

被引:0
|
作者
Li, Zhenshan [1 ]
Liu, Hao [1 ]
Shan, Ruijie [2 ]
Sun, Yanbin [1 ]
Jiang, Yu [1 ]
Hu, Ning [1 ]
机构
[1] Guangzhou Univ, Cyberspace Initiate Adv Technol, Guangzhou, Peoples R China
[2] Beijing Normal Univ, Hong Kong Baptist Univ United Int Coll, Zhuhai, Peoples R China
基金
中国国家自然科学基金;
关键词
network security; feature extraction; binary code similarity; malware analysis;
D O I
10.1109/CloudNet59005.2023.10490019
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Binary code similarity detection (BCSD) is an important research direction in the field of computer network security. The application scenarios for BCSD are widespread and many related methods have been proposed to solve the problems of eliminating code redundancy, vulnerability mining, malware analysis, and code copyright protection. Based on the absence of a comprehensive overview of the BCSD method, the present article provides a review of the state of art on BCSD research. Firstly, this paper introduces the relevant concepts, general process, and challenges faced by BCSD, such as the compiler optimization diversity challenge. Secondly, based on different feature information and method principles, this paper classifies the current BCSD method into three categories: text-based, logic-based, and semantics-based methods. Then, the implementation procedures of multiple BCSD methods are described. Finally, this paper proposes prospects for the technology and application of BCSD.
引用
收藏
页码:408 / 412
页数:5
相关论文
共 50 条
  • [21] CBSDI: Cross-Architecture Binary Code Similarity Detection based on Index Table
    Deng, Longmin
    Zhao, Dongdong
    Zhou, Junwei
    Xia, Zhe
    Xiang, Jianwen
    2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY, QRS, 2022, : 527 - 536
  • [22] Cross-platform binary code similarity detection based on NMT and graph embedding
    Zhu, Xiaodong
    Jiang, Liehui
    Chen, Zeng
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2021, 18 (04) : 4528 - 4551
  • [23] Unleashing the power of pseudo-code for binary code similarity analysis
    Weiwei Zhang
    Zhengzi Xu
    Yang Xiao
    Yinxing Xue
    Cybersecurity, 5
  • [24] Unleashing the power of pseudo-code for binary code similarity analysis
    Zhang, Weiwei
    Xu, Zhengzi
    Xiao, Yang
    Xue, Yinxing
    CYBERSECURITY, 2022, 5 (01)
  • [25] Towards Practical Binary Code Similarity Detection: Vulnerability Verification via Patch Semantic Analysis
    Yang, Shouguo
    Xu, Zhengzi
    Xiao, Yang
    Lang, Zhe
    Tang, Wei
    Liu, Yang
    Shi, Zhiqiang
    Li, Hong
    Sun, Limin
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2023, 32 (06)
  • [26] GenTAL: Generative Denoising Skip-gram Transformer for Unsupervised Binary Code Similarity Detection
    Li, Li Tao
    Ding, Steven H. H.
    Charland, Philippe
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [27] IoTSim: Internet of Things-Oriented Binary Code Similarity Detection with Multiple Block Relations
    Luo, Zhenhao
    Wang, Pengfei
    Xie, Wei
    Zhou, Xu
    Wang, Baosheng
    SENSORS, 2023, 23 (18)
  • [28] CEBin: A Cost-Effective Framework for Large-Scale Binary Code Similarity Detection
    Wang, Hao
    Gao, Zeyu
    Zhang, Chao
    Sun, Mingyang
    Zhou, Yuchen
    Qiu, Han
    Xiao, Xi
    PROCEEDINGS OF THE 33RD ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2024, 2024, : 149 - 161
  • [29] Feature Extraction Methods for Binary Code Similarity Detection Using Neural Machine Translation Models
    Ito, Norimitsu
    Hashimoto, Masaki
    Otsuka, Akira
    IEEE ACCESS, 2023, 11 : 102796 - 102805
  • [30] BINCODEX: A comprehensive and multi-level dataset for evaluating binary code similarity detection techniques
    Zhang P.
    Wu C.
    Wang Z.
    BenchCouncil Transactions on Benchmarks, Standards and Evaluations, 2024, 4 (02):