Mask-guided image person removal with data synthesis

被引：2

作者：

Jiang, Yunliang ^{[1
]}

Gu, Chenyang ^{[1
,2
]}

Xue, Zhenfeng ^{[3
,4
]}

Zhang, Xiongtao ^{[1
,2
]}

Liu, Yong ^{[3
]}

机构：

[1] Huzhou Univ, Sch Informat Engn, Huzhou, Peoples R China

[2] Zhejiang Univ, Intelligent Percept & Control Ctr, Huzhou Inst, Huzhou, Peoples R China

[3] Zhejiang Univ, Inst Cyber Syst & Control, Hangzhou, Peoples R China

[4] Zhejiang Univ, Intelligent Percept & Control Ctr, Huzhou Inst, 819 Xisaishan Rd, Huzhou 313098, Peoples R China

来源：

IET IMAGE PROCESSING | 2023年 / 17卷 / 07期

关键词：

convolutional neural nets; data analysis; image restoration;

D O I：

10.1049/ipr2.12786

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

As a special case of common object removal, image person removal is playing an increasingly important role in social media and criminal investigation domains. Due to the integrity of person area and the complexity of human posture, person removal has its own dilemmas. In this paper, a novel idea is proposed to tackle these problems from the perspective of data synthesis. Concerning the lack of a dedicated dataset for image person removal, two dataset production methods are proposed to automatically generate images, masks and ground truths, respectively. Then, a learning framework similar to local image degradation is proposed so that the masks can be used to guide the feature extraction process and more texture information can be gathered for final prediction. A coarse-to-fine training strategy is further applied to refine the details. The data synthesis and learning framework combine well with each other. Experimental results verify the effectiveness of the method quantitatively and qualitatively, and the trained network proves to have good generalization ability either on real or synthetic images.

引用

页码：2214 / 2224

页数：11

共 40 条

[1] Dynamic Object Removal and Spatio-Temporal RGB-D Inpainting via Geometry-Aware Adversarial Learning
Besic, Borna
Valada, Abhinav
[J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2022, 7 (02): : 170 - 185
[2] Bharathi Kannan B., 2021, P INT C MACHINE INTE, P295, DOI DOI 10.1007/978-981-33-4087-9_26
[3] Blind inpainting using the fully convolutional neural network
Cai, Nian
Su, Zhenghang
Lin, Zhineng
Wang, Han
Yang, Zhijing
Ling, Bingo Wing-Kuen
[J]. VISUAL COMPUTER, 2017, 33 (02) : 249 - 261
[4] REFIT: A UnifiedWatermark Removal Framework For Deep Learning Systems With Limited Data
Chen, Xinyun
Wang, Wenxiao
Bender, Chris
Ding, Yiming
Jia, Ruoxi
Li, Bo
Song, Dawn
[J]. ASIA CCS'21: PROCEEDINGS OF THE 2021 ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2021, : 321 - 335
[5] The Cityscapes Dataset for Semantic Urban Scene Understanding
Cordts, Marius
Omran, Mohamed
Ramos, Sebastian
Rehfeld, Timo
Enzweiler, Markus
Benenson, Rodrigo
Franke, Uwe
Roth, Stefan
Schiele, Bernt
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
[6] What Makes Paris Look like Paris?
Doersch, Carl
Singh, Saurabh
Gupta, Abhinav
Sivic, Josef
Efros, Alexei A.
[J]. ACM TRANSACTIONS ON GRAPHICS, 2012, 31 (04):
[7] Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection
Dwibedi, Debidatta
Misra, Ishan
Hebert, Martial
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1310 - 1319
[8] PedCut: an iterative framework for pedestrian segmentation combining shape models and multiple data cues
Flohr, Fabian
Gavrila, Dariu M.
[J]. PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2013, 2013,
[9] MCPA: A Fast Single Image Haze Removal Method Based on the Minimum Channel and Patchless Approach
Fuh, Chiou-Shann
Tung, Tzu-Chia
[J]. IEEE ACCESS, 2022, 10 : 73033 - 73045
[10] Gao J., 2022, P IEEE CVF C COMP VI, P599

← 1 2 3 4 →