Empirical Evaluation of Smart Contract Testing: What Is the Best Choice?

被引:42
|
作者
Ren, Meng [1 ]
Yin, Zijing [1 ]
Ma, Fuchen [1 ]
Xu, Zhenyang [2 ]
Jiang, Yu [1 ]
Sun, Chengnian [2 ]
Li, Huizhong [3 ]
Cai, Yan [4 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] Univ Waterloo, Waterloo, ON, Canada
[3] WeBank, Shenzhen, Peoples R China
[4] Chinese Acad Sci, Inst Software, State Key Lab Comp Sci, Beijing, Peoples R China
来源
ISSTA '21: PROCEEDINGS OF THE 30TH ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS | 2021年
基金
欧洲研究理事会;
关键词
smart contract testing; evaluation; observations and solutions;
D O I
10.1145/3460319.3464837
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Security of smart contracts has attracted increasing attention in recent years. Many researchers have devoted themselves to devising testing tools for vulnerability detection. Each published tool has demonstrated its effectiveness through a series of evaluations on their own experimental scenarios. However, the inconsistency of evaluation settings such as different data sets or performance metrics, may result in biased conclusion. In this paper, based on an empirical evaluation of widely used smart contract testing tools, we propose a unified standard to eliminate the bias in the assessment process. First, we collect 46,186 source-available smart contracts from four influential organizations. This comprehensive dataset is open to the public and involves different code characteristics, vulnerability patterns and application scenarios. Then we propose a 4-step evaluation process and summarize the difference among relevant work in these steps. We use nine representative tools to carry out extensive experiments. The results demonstrate that different choices of experimental settings could significantly affect tool performance and lead to misleading or even opposite conclusions. Finally, we generalize some problems of existing testing tools, and propose some possible directions for further improvement.
引用
收藏
页码:566 / 579
页数:14
相关论文
共 9 条
  • [1] Evaluation of Contemporary Smart Contract Analysis Tools
    Wang, Baocheng
    Chen, Shiping
    Wang, Qin
    PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON EVALUATION OF NOVEL APPROACHES TO SOFTWARE ENGINEERING, ENASE 2023, 2023, : 322 - 329
  • [2] GethReplayer: a smart contract testing method based on transaction replay
    Shi, Xiaohong
    Wang, Ziyan
    Zhong, Runlu
    Ma, Liangliang
    Chen, Xiangping
    Yang, Peng
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2024, 20 (04) : 358 - 373
  • [3] Design and empirical testing of a checklist for the evaluation of multimedia software for children
    Duda, S
    DESIGNING EFFECTIVE AND USABLE MULTIMEDIA SYSTEMS, 1998, : 191 - 204
  • [4] Gap between academia and industry: a case of empirical evaluation of three software testing methods
    Farooq, Sheikh Umar
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2019, 10 (06) : 1487 - 1504
  • [5] Gap between academia and industry: a case of empirical evaluation of three software testing methods
    Sheikh Umar Farooq
    International Journal of System Assurance Engineering and Management, 2019, 10 : 1487 - 1504
  • [6] Best practices for heuristic evaluation, usability testing and redesign for health-related web sites
    Davis, W
    Whitaker, S
    Kinzie, M
    Cohn, W
    Knaus, W
    MEDINFO 2001: PROCEEDINGS OF THE 10TH WORLD CONGRESS ON MEDICAL INFORMATICS, PTS 1 AND 2, 2001, 84 : 1218 - 1218
  • [7] Treatment of female stress urinary incontinence: what women find acceptable and the impact of clinical and urodynamic evaluation on their final choice
    Elmissiry, Mostafa
    Mahdy, Ayman
    Ghoniem, Gamal
    SCANDINAVIAN JOURNAL OF UROLOGY AND NEPHROLOGY, 2011, 45 (05): : 326 - 331
  • [8] To what extent do the characteristics of the object of evaluation influence the choice of epistemological framework? - The case of universal newborn hearing screening
    Young, A
    Tattersall, H
    Uus, K
    Bamford, J
    McCracken, W
    QUALITATIVE HEALTH RESEARCH, 2004, 14 (06) : 866 - 874
  • [9] Quantitative evaluation of empirical models of vortex-induced vibration of bridge decks through sectional model wind tunnel testing
    Xu, Kun
    Ge, Yaojun
    Zhao, Lin
    ENGINEERING STRUCTURES, 2020, 219