An Effective Imbalanced JPEG Steganalysis Scheme Based on Adaptive Cost-Sensitive Feature Learning

被引:13
作者
Jia, Ju [1 ]
Zhai, Liming [1 ]
Ren, Weixiang [1 ]
Wang, Lina [1 ]
Ren, Yanzhen [1 ]
机构
[1] Wuhan Univ, Sch Cyber Sci & Engn, Key Lab Aerosp Informat Secur & Trusted Comp, Minist Educ, Wuhan 430072, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Optimization; Transform coding; Training; Measurement; Learning systems; Estimation; Steganalysis; imbalanced data; adaptive cost-sensitive; feature learning; F-measure maximization; FEATURE-SELECTION; IMAGE STEGANALYSIS; CLASSIFICATION;
D O I
10.1109/TKDE.2020.2995070
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Steganalysis in real-world application often exhibit skewed sample distribution which poses a massive challenge for steganography detection. Conventional steganalysis algorithms are not effective when the training data distribution is imbalanced, and may fail in the scenario of imbalanced data distribution. To address imbalanced data distribution issue in steganalysis, a novel framework termed adaptive cost-sensitive feature learning via F-measure maximization is proposed, which is inspired by the fact that F-measure is a more suitable performance metric compared to accuracy for imbalanced data. We investigate the adaptive cost-sensitive strategy by generating and assigning different weight to each instance with misclassification occurrence. This scheme adaptively determines the weights according to the intra-class and inter-class costs from the imbalanced distribution. Features corresponding to the largest F-measure can be obtained by solving a series of adaptive cost-sensitive feature learning problems with optimization theory. In this way, the learned features are the most representative features between the cover and stego images so that imbalanced steganalysis can significantly alleviate. Extensive experiments on various imbalanced steganalysis tasks show the superiority of the proposed method over the state-of-the-art methods, and it can recognize more minority samples and has excellent classification performance.
引用
收藏
页码:1038 / 1052
页数:15
相关论文
共 40 条
[1]   To Combat Multi-Class Imbalanced Problems by Means of Over-Sampling Techniques [J].
Abdi, Lida ;
Hashemi, Sattar .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (01) :238-251
[2]   Image steganalysis using improved particle swarm optimization based feature selection [J].
Adeli, Ali ;
Broumandnia, Ali .
APPLIED INTELLIGENCE, 2018, 48 (06) :1609-1622
[3]  
Alistarh D, 2018, ADV NEUR IN, V31
[4]  
[Anonymous], 2011, ADV NEURAL INFORM PR
[5]  
Bascol K., 2019, PMLR, P1245
[6]   A Feature Selection and Classification Algorithm Based on Randomized Extraction of Model Populations [J].
Brankovic, Aida ;
Falsone, Alessandro ;
Prandini, Maria ;
Piroddi, Luigi .
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (04) :1151-1162
[7]  
Brutzkus A, 2017, PR MACH LEARN RES, V70
[8]   Digital image steganography: Survey and analysis of current methods [J].
Cheddad, Abbas ;
Condell, Joan ;
Curran, Kevin ;
Mc Kevitt, Paul .
SIGNAL PROCESSING, 2010, 90 (03) :727-752
[9]   Feature selection for imbalanced data based on neighborhood rough sets [J].
Chen, Hongmei ;
Li, Tianrui ;
Fan, Xin ;
Luo, Chuan .
INFORMATION SCIENCES, 2019, 483 :1-20
[10]   Local Kernel Regression Score for Selecting Features of High-Dimensional Data [J].
Cheung, Yiu-ming ;
Zeng, Hong .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (12) :1798-1802