Data Mining in Programs: Clustering Programs Based on Structure Metrics and Execution Values

被引:2
作者
Wang, TianTian [1 ]
Wang, KeChao [2 ]
Su, XiaoHong [1 ]
Liu, Lin [2 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin, Peoples R China
[2] Harbin Univ, Sch Informat Engn, Harbin, Peoples R China
基金
中国国家自然科学基金;
关键词
Clustering; Data Mining; Program Repair; Structural Metrics; Value Sequence;
D O I
10.4018/IJDWM.2020040104
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Software exists in various control systems, such as security-critical systems and so on. Existing program clustering methods are limited in identifying functional equivalent programs with different syntactic representations. To solve this problem, firstly, a clustering method based on structured metric vectors was proposed to quickly identify structurally similar programs from a large number of existing programs. Next, a clustering method based on similar execution value sequences was proposed, to accurately identify the functional equivalent programs with code variations. This approach has been applied in automatic program repair, to identify sample programs from a large pool of template programs. The average purity value is 0.95576 and the average entropy is 0.15497. This means that the clustering partition is consistent with the expected partition.
引用
收藏
页码:48 / 63
页数:16
相关论文
共 50 条
[21]   Data Mining Methods for Educational Effectiveness in Higher Education Aesthetic Education Programs [J].
Wei, Qidan .
Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
[22]   Density Based Clustering Technique For Efficient Data Mining [J].
Rahman, Md Asikur ;
Chowdhury, A. K. M. Rasheduzzaman ;
Rahman, Daud Md Jamilur ;
Kamal, Abu Raihan Mostofa .
2008 11TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY: ICCIT 2008, VOLS 1 AND 2, 2008, :706-710
[23]   Research on spatial clustering algorithm based on data mining [J].
Lv R. ;
Zhao J.K. ;
Li Y. .
International Journal of Database Theory and Application, 2016, 9 (12) :217-230
[24]   A clustering algorithm for data mining based on swarm intelligence [J].
Jin, Peng ;
Zhu, Vun-Long ;
Hu, Kun-Yuan .
PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, :803-807
[25]   A Fuzzy Clustering Algorithm of Data Mining Based on IWO [J].
Zhao Xiao-qiang ;
Zhou Jin-Hu ;
Yang Jia-Min .
2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, :7988-7993
[26]   Clustering algorithm in data mining based on web log [J].
Jing L. .
International Journal of Simulation: Systems, Science and Technology, 2016, 17 (36) :31.1-31.6
[27]   Data mining approach based on clustering and association rules applicable to different fields [J].
Belabed, Imane ;
Talibi Alaoui, Mohammed ;
Talibi Alaoui, Youssef ;
Belabed, Abdelmajid .
2018 INTERNATIONAL CONFERENCE ON ELECTRONICS, CONTROL, OPTIMIZATION AND COMPUTER SCIENCE (ICECOCS), 2018,
[28]   Patterns Discovery of Load Curves Characteristics Using Clustering Based Data Mining [J].
Neagu, Bogdan ;
Grigoras, Gheorghe ;
Scarlatache, Florina ;
Schreiner, Cristina ;
Ciobanu, Romeo .
2017 11TH IEEE INTERNATIONAL CONFERENCE ON COMPATIBILITY, POWER ELECTRONICS AND POWER ENGINEERING (CPE-POWERENG), 2017, :83-87
[29]   Understanding time use via data mining: A clustering-based framework [J].
Rosales-Salas, Jorge ;
Maldonado, Sebastian ;
Seret, Alex .
INTELLIGENT DATA ANALYSIS, 2018, 22 (03) :597-616
[30]   Target values of combustion optimization in coal fire boiler based on data mining [J].
Wang, Wenhuan ;
Pan, Weiguo ;
He, Mingfu ;
Pan, Bingchao ;
Pan, Yiqiong ;
Hu, Guoxin .
PROGRESS IN RENEWABLE AND SUSTAINABLE ENERGY, PTS 1 AND 2, 2013, 608-609 :1143-+