Text-Guided Image Manipulation via Generative Adversarial Network With Referring Image Segmentation-Based Guidance

被引：2

作者：

Watanabe, Yuto ^{[1
]}

Togo, Ren ^{[2
]}

Maeda, Keisuke ^{[2
]}

Ogawa, Takahiro ^{[2
]}

Haseyama, Miki ^{[2
]}

机构：

[1] Hokkaido Univ, Grad Sch Informat Sci & Technol, Sapporo 0600814, Japan

[2] Hokkaido Univ, Fac Informat Sci & Technol, Sapporo 0600814, Japan

来源：

IEEE ACCESS | 2023年 / 11卷

基金：

日本学术振兴会;

关键词：

Image segmentation; Text recognition; Generative adversarial networks; Image color analysis; Visualization; Image reconstruction; Text processing; Text-guided image manipulation; text-to-image synthesis; generative adversarial network; referring image segmentation;

D O I：

10.1109/ACCESS.2023.3269847

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This study proposes a novel text-guided image manipulation method that introduces referring image segmentation into a generative adversarial network. The proposed text-guided image manipulation method aims to manipulate images containing multiple objects while preserving text-unrelated regions. The proposed method assigns the task of distinguishing between text-related and unrelated regions in an image to segmentation guidance based on referring image segmentation. With this architecture, the adversarial generative network can focus on generating new attributes according to the text description and reconstructing text-unrelated regions. For the challenging input images with multiple objects, the experimental results demonstrate that the proposed method outperforms conventional methods in terms of image manipulation precision.

引用

页码：42534 / 42545

页数：12

共 46 条

[1]

Gatys LA, 2015, Arxiv, DOI [arXiv:1508.06576, 10.48550/arXiv.1508.06576, DOI 10.48550/ARXIV.1508.06576]

[2]

Binkowski M., 2018, INT C LEARNING REPRE

[3] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].

Chen, Liang-Chieh ;

Zhu, Yukun ;

Papandreou, George ;

Schroff, Florian ;

Adam, Hartwig .

COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851

[4] Deep Photo Enhancer: Unpaired Learning for Image Enhancement from Photographs with GANs [J].

Chen, Yu-Sheng ;

Wang, Yu-Ching ;

Kao, Man-Hsin ;

Chuang, Yung-Yu .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6306-6314

[5] CrDoCo: Pixel-level Domain Transfer with Cross-Domain Consistency [J].

Chen, Yun-Chun ;

Lin, Yen-Yu ;

Yang, Ming-Hsuan ;

Huang, Jia-Bin .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1791-1800

[6] Deep Colorization [J].

Cheng, Zezhou ;

Yang, Qingxiong ;

Sheng, Bin .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :415-423

[7] Semantic Image Synthesis via Adversarial Learning [J].

Dong, Hao ;

Yu, Simiao ;

Wu, Chao ;

Guo, Yike .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :CP1-CP38

[8] Image Style Transfer Using Convolutional Neural Networks [J].

Gatys, Leon A. ;

Ecker, Alexander S. ;

Bethge, Matthias .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2414-2423

[9]

Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672

[10] SEGMENTATION-AWARE TEXT-GUIDED IMAGE MANIPULATION [J].

Haruyama, Tomoki ;

Togo, Ren ;

Maeda, Keisuke ;

Ogawa, Takahiro ;

Haseyama, Miki .

2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, :2433-2437

← 1 2 3 4 5 →