A Block Cipher Algorithm Identification Scheme Based on Hybrid Random Forest and Logistic Regression Model

被引:5
作者
Yuan, Ke [1 ,2 ]
Huang, Yabing [1 ]
Li, Jiabao [1 ]
Jia, Chunfu [3 ]
Yu, Daoming [1 ]
机构
[1] Henan Univ, Sch Comp & Informat Engn, Kaifeng 475004, Peoples R China
[2] Henan Univ, Henan Key Lab Big Data Anal & Proc, Kaifeng 475004, Peoples R China
[3] Nankai Univ, Coll Cybersecur, Tianjin 300350, Peoples R China
基金
中国国家自然科学基金;
关键词
Cryptographic algorithm identification; Ensemble learning; Random forest; Logistic regression; CLASSIFICATION;
D O I
10.1007/s11063-022-11005-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cryptographic algorithm identification is aimed to analyze the potential feature information in ciphertext data when the ciphertext is known, which belongs to the category of cryptanalysis. This paper takes block cipher algorithm as the research object, and proposes a block cipher algorithm identification scheme based on hybrid random forest and logistic regression (HRFLR) model with the idea of ensemble learning. Based on the NIST randomness test feature extraction method, five block ciphers, AES, 3DES, Blowfish, CAST and RC2, are selected as the research object of cryptographic algorithm identification to carry out the ciphertext classification tasks. The experimental results show that, compared with the existing methods, the cryptographic algorithm identification scheme based on HRFLR proposed in this paper has higher accuracy and stability on binary classification and multi-class classification tasks. In the binary classification tasks of AES and 3DES, the identification accuracy of our proposed cryptographic algorithm identification scheme based on HRFLR can reach up to 74%, and the highest identification accuracy of the five classification tasks is 38%. Compared with the 54% and 28.8% accuracies of random forest-based identification scheme, the accuracy is increased by 37.04% and 18.06%, respectively. This result is significantly better than the 50% and 20% accuracies of random guessing scheme.
引用
收藏
页码:3185 / 3203
页数:19
相关论文
共 26 条
[1]   Identification of significant features and data mining techniques in predicting heart disease [J].
Amin, Mohammad Shafenoor ;
Chiam, Yin Kia ;
Varathan, Kasturi Dewi .
TELEMATICS AND INFORMATICS, 2019, 36 :82-93
[2]   Identifying traffic of same keys in cryptographic communications using fuzzy decision criteria and bit-plane measures [J].
Arvind ;
Ratan, Ram .
INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2020, 11 (02) :466-480
[3]   Amended fused TOPSIS-VIKOR for classification (ATOVIC) applied to some UCI data sets [J].
Baccour, Leila .
EXPERT SYSTEMS WITH APPLICATIONS, 2018, 99 :115-125
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]   Multiconstrained Real-Time Entry Guidance Using Deep Neural Networks [J].
Cheng, Lin ;
Jiang, Fanghua ;
Wang, Zhenbo ;
Li, Junfeng .
IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2021, 57 (01) :325-340
[6]  
Chou J-W, 2012, P ACM C COMP COMM SE, P105, DOI [10.1145/2381896.2381912, DOI 10.1145/2381896.2381912]
[7]   Random forests for classification in ecology [J].
Cutler, D. Richard ;
Edwards, Thomas C., Jr. ;
Beard, Karen H. ;
Cutler, Adele ;
Hess, Kyle T. .
ECOLOGY, 2007, 88 (11) :2783-2792
[8]   A distinguishing attack with a neural network [J].
de Souza, William A. R. ;
Tomlinson, Allan .
2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2013, :154-161
[9]  
Dileep AD, 2006, IEEE IJCNN, P2696
[10]  
Esfahani HA, 2017, 2017 IEEE 4TH INTERNATIONAL CONFERENCE ON KNOWLEDGE-BASED ENGINEERING AND INNOVATION (KBEI), P1011, DOI 10.1109/KBEI.2017.8324946