Detecting AI-Generated Images Using a Hybrid ResNet-SE Attention Model

被引：0

作者：

Gunukula, Abhilash Reddy ^{[1
]}

Das Gupta, Himel ^{[2
]}

Sheng, Victor S. ^{[1
]}

机构：

[1] Texas Tech Univ, Dept Comp Sci, Lubbock, TX 79409 USA

[2] Louisiana State Univ Alexandria, Dept Math & Comp Sci, Alexandria, LA 71302 USA

来源：

APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 13期

关键词：

deep learning; AI-generated images; SE-ResNet34; squeeze-and-excitation; image classification; CIFAKE dataset;

D O I：

10.3390/app15137421

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

The rapid advancements in generative artificial intelligence (AI), particularly through models like Generative Adversarial Networks (GANs) and diffusion-based architectures, have made it increasingly difficult to distinguish between real and synthetically generated images. While these technologies offer benefits in creative domains, they also pose serious risks in terms of misinformation, digital forgery, and identity manipulation. This paper presents a novel hybrid deep learning model for detecting AI-generated images by integrating the ResNet-50 architecture with Squeeze-and-Excitation (SE) attention blocks. The proposed SE-ResNet50 model enhances channel-wise feature recalibration and interpretability by integrating Squeeze-and-Excitation (SE) blocks into the ResNet-50 backbone, enabling dynamic emphasis on subtle generative artifacts such as unnatural textures and semantic inconsistencies, thereby improving classification fidelity. Experimental evaluation on the CIFAKE dataset demonstrates the model's effectiveness, achieving a test accuracy of 96.12%, precision of 97.04%, recall of 88.94%, F1-score of 92.82%, and an AUC score of 0.9862. The model shows strong generalization, minimal overfitting, and superior performance compared with transformer-based models and standard architectures like ResNet-50, VGGNet, and DenseNet. These results confirm the hybrid model's suitability for real-time and resource-constrained applications in media forensics, content authentication, and ethical AI governance.

引用

页数：20

共 32 条

[1]

Ait El Asri O., 2024, Int. J. Electr. Comput. Eng, V14, P1113, DOI [10.11591/ijece.v14i4.pp4531-4541, DOI 10.11591/IJECE.V14I4.PP4531-4541]

[2] Deepfake Video Detection through Optical Flow based CNN [J].

Amerini, Irene ;

Galteri, Leonardo ;

Caldelli, Roberto ;

Del Bimbo, Alberto .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :1205-1207

[3] CIFAKE: Image Classification and Explainable Identification of AI-Generated Synthetic Images [J].

Bird, Jordan J. ;

Lotfi, Ahmad .

IEEE ACCESS, 2024, 12 :15642-15650

[4] Writer-independent signature verification; Evaluation of robotic and generative adversarial attacks [J].

Bird, Jordan J. ;

Naser, Abdallah ;

Lotfi, Ahmad .

INFORMATION SCIENCES, 2023, 633 :170-181

[5] On the use of Benford's law to detect GAN-generated images [J].

Bonettini, Nicolo ;

Bestagini, Paolo ;

Milani, Simone ;

Tubaro, Stefano .

2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, :5495-5502

[6]

Chambon P, 2022, Arxiv, DOI arXiv:2210.04133

[7]

Da Yi, 2021, 2021 IEEE 1st International Conference on Digital Twins and Parallel Intelligence (DTPI), P332, DOI 10.1109/DTPI52967.2021.9540115

[8] AdvFaces: Adversarial Face Synthesis [J].

Deb, Debayan ;

Zhang, Jianbang ;

Jain, Anil K. .

IEEE/IAPR INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB 2020), 2020,

[9]

Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171

[10] Automated pulmonary nodule detection in CT images using 3D deep squeeze-and-excitation networks [J].

Gong, Li ;

Jiang, Shan ;

Yang, Zhiyong ;

Zhang, Guobin ;

Wang, Lu .

INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2019, 14 (11) :1969-1979

← 1 2 3 4 →