Data Mining in Programs: Clustering Programs Based on Structure Metrics and Execution Values

被引:2
作者
Wang, TianTian [1 ]
Wang, KeChao [2 ]
Su, XiaoHong [1 ]
Liu, Lin [2 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin, Peoples R China
[2] Harbin Univ, Sch Informat Engn, Harbin, Peoples R China
基金
中国国家自然科学基金;
关键词
Clustering; Data Mining; Program Repair; Structural Metrics; Value Sequence;
D O I
10.4018/IJDWM.2020040104
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Software exists in various control systems, such as security-critical systems and so on. Existing program clustering methods are limited in identifying functional equivalent programs with different syntactic representations. To solve this problem, firstly, a clustering method based on structured metric vectors was proposed to quickly identify structurally similar programs from a large number of existing programs. Next, a clustering method based on similar execution value sequences was proposed, to accurately identify the functional equivalent programs with code variations. This approach has been applied in automatic program repair, to identify sample programs from a large pool of template programs. The average purity value is 0.95576 and the average entropy is 0.15497. This means that the clustering partition is consistent with the expected partition.
引用
收藏
页码:48 / 63
页数:16
相关论文
共 50 条
[41]   Selecting single cell clustering parameter values using subsampling-based robustness metrics [J].
Ryan B. Patterson-Cross ;
Ariel J. Levine ;
Vilas Menon .
BMC Bioinformatics, 22
[42]   Selecting single cell clustering parameter values using subsampling-based robustness metrics [J].
Patterson-Cross, Ryan B. ;
Levine, Ariel J. ;
Menon, Vilas .
BMC BIOINFORMATICS, 2021, 22 (01)
[43]   Data mining based on clustering and association rule analysis for knowledge discovery in multiobjective topology optimization [J].
Sato, Yuki ;
Izui, Kazuhiro ;
Yamada, Takayuki ;
Nishiwaki, Shinji .
EXPERT SYSTEMS WITH APPLICATIONS, 2019, 119 :247-261
[44]   Clustering Barotrauma Patients in ICU-A Data Mining Based Approach Using Ventilator Variables [J].
Oliveira, Sergio ;
Portela, Filipe ;
Santos, Manuel F. ;
Machado, Jose ;
Abelha, Antonio ;
Silva, Alvaro ;
Rua, Fernando .
PROGRESS IN ARTIFICIAL INTELLIGENCE-BK, 2015, 9273 :122-127
[45]   An automatic approach to evaluate assertions' quality based on data-mining metrics [J].
Ghasempouri, Tara ;
Azad, Siavoosh Payandeh ;
Niazmand, Behrad ;
Raik, Jaan .
2018 IEEE INTERNATIONAL TEST CONFERENCE IN ASIA (ITC-ASIA 2018), 2018, :61-66
[46]   Data mining application based on fuzzy clustering for traffic network evaluation [J].
Wang, Quan ;
Wang, Wei ;
Li, Deren ;
Wang, Chao .
MIPPR 2007: AUTOMATIC TARGET RECOGNITION AND IMAGE ANALYSIS; AND MULTISPECTRAL IMAGE ACQUISITION, PTS 1 AND 2, 2007, 6786
[47]   Fault diagnosis by data mining based on focusing fuzzy clustering algorithm [J].
Yang Ping .
PROCEEDINGS OF 2005 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1 AND 2, 2005, :992-996
[48]   Freeway travel time prediction based on clustering method with data mining [J].
Xing X. ;
Yu D. ;
Tian X. ;
Cheng Z. .
Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2016, 44 (08) :36-40
[49]   Disease prediction in data mining using association rule mining and keyword based clustering algorithms [J].
Ramasamy S. ;
Nirmala K. .
International Journal of Computers and Applications, 2020, 42 (01) :1-8
[50]   Chiller Optimization Using Data Mining Based on Prediction Model, Clustering and Association Rule Mining [J].
Nisa, Elsa Chaerun ;
Kuan, Yean-Der ;
Lai, Chin-Chang .
ENERGIES, 2021, 14 (20)