Robustness of Generative Adversarial CLIPs Against Single-Character Adversarial Attacks in Text-to-Image Generation

被引:0
作者
Chanakya, Patibandla [1 ]
Harsha, Putla [1 ]
Pratap Singh, Krishna [1 ]
机构
[1] Indian Inst Informat Technol, Dept Informat Technol, Prayagraj 211012, India
关键词
Text to image; Robustness; Perturbation methods; Degradation; Diffusion models; Image synthesis; Image quality; Training; Generators; Generative adversarial networks; Single-character attack; GALIP; GAN; CLIP text encoder; Text-to-image generation; MODELS;
D O I
10.1109/ACCESS.2024.3491017
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Generative Adversarial Networks (GANs) have emerged as a powerful type of generative model, particularly effective at creating images from textual descriptions. Similar to diffusion models, GANs rely on text encoders to extract embeddings from these descriptions. However, this reliance introduces specific vulnerabilities to adversarial attacks. A notable example is a single-character adversarial attack, where altering a single character in the text description can lead to significant performance degradation in the generated image quality and model's performance. In this study, we systematically evaluate the susceptibility of GANs to such attacks using Generative Adversarial CLIP (GALIP), a single-stage architecture that leverages a pre-trained Contrastive Language-Image Pre-training (CLIP) text encoder for text embeddings. We meticulously selected captions with single-character modifications that exhibit maximum and median-distance embeddings for the attack. Experimental results show up to 310.5% degradation in Fr & eacute;chet Inception Distance (FID) scores, underscoring the importance of developing improved defenses in text-to-image synthesis.
引用
收藏
页码:162551 / 162563
页数:13
相关论文
共 82 条
[1]  
Abad Rocamora E., 2024, P 41 INT C MACH LEAR, P1
[2]   An Auto-Encoder based Membership Inference Attack against Generative Adversarial Network [J].
Azadmanesh, Maryam ;
Ghahfarokhi, Behrouz Shahgholi ;
Talouki, Maede Ashouri .
ISECURE-ISC INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2023, 15 (02) :240-253
[3]  
Beal Josh, 2020, arXiv
[4]  
Bird S., 2009, Natural Language Processing with Python
[5]   A full data augmentation pipeline for small object detection based on generative adversarial networks [J].
Bosquet, Brais ;
Cores, Daniel ;
Seidenari, Lorenzo ;
Brea, Victor M. ;
Mucientes, Manuel ;
Del Bimbo, Alberto .
PATTERN RECOGNITION, 2023, 133
[6]   A Novel Data Augmentation Method for Improved Visual Crack Detection Using Generative Adversarial Networks [J].
Branikas, Efstathios ;
Murray, Paul ;
West, Graeme .
IEEE ACCESS, 2023, 11 :22051-22059
[7]  
Cao YH, 2023, Arxiv, DOI [arXiv:2303.04226, DOI 10.48550/ARXIV.2303.04226, 10.48550/arXiv.2303.04226]
[8]  
Cha S.-H., 2007, INT J MATH MODELS ME, V1, P300, DOI [DOI 10.1007/S00167-009-0884-Z, 10.1.1.154.8446]
[9]   TextGuise: Adaptive adversarial example attacks on text classification model [J].
Chang, Guoqin ;
Gao, Haichang ;
Yao, Zhou ;
Xiong, Haoquan .
NEUROCOMPUTING, 2023, 529 :190-203
[10]   CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification [J].
Chen, Chun-Fu ;
Fan, Quanfu ;
Panda, Rameswar .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :347-356