Mask-guided image person removal with data synthesis

被引:2
作者
Jiang, Yunliang [1 ]
Gu, Chenyang [1 ,2 ]
Xue, Zhenfeng [3 ,4 ]
Zhang, Xiongtao [1 ,2 ]
Liu, Yong [3 ]
机构
[1] Huzhou Univ, Sch Informat Engn, Huzhou, Peoples R China
[2] Zhejiang Univ, Intelligent Percept & Control Ctr, Huzhou Inst, Huzhou, Peoples R China
[3] Zhejiang Univ, Inst Cyber Syst & Control, Hangzhou, Peoples R China
[4] Zhejiang Univ, Intelligent Percept & Control Ctr, Huzhou Inst, 819 Xisaishan Rd, Huzhou 313098, Peoples R China
关键词
convolutional neural nets; data analysis; image restoration;
D O I
10.1049/ipr2.12786
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a special case of common object removal, image person removal is playing an increasingly important role in social media and criminal investigation domains. Due to the integrity of person area and the complexity of human posture, person removal has its own dilemmas. In this paper, a novel idea is proposed to tackle these problems from the perspective of data synthesis. Concerning the lack of a dedicated dataset for image person removal, two dataset production methods are proposed to automatically generate images, masks and ground truths, respectively. Then, a learning framework similar to local image degradation is proposed so that the masks can be used to guide the feature extraction process and more texture information can be gathered for final prediction. A coarse-to-fine training strategy is further applied to refine the details. The data synthesis and learning framework combine well with each other. Experimental results verify the effectiveness of the method quantitatively and qualitatively, and the trained network proves to have good generalization ability either on real or synthetic images.
引用
收藏
页码:2214 / 2224
页数:11
相关论文
共 40 条
  • [1] Dynamic Object Removal and Spatio-Temporal RGB-D Inpainting via Geometry-Aware Adversarial Learning
    Besic, Borna
    Valada, Abhinav
    [J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2022, 7 (02): : 170 - 185
  • [2] Bharathi Kannan B., 2021, P INT C MACHINE INTE, P295, DOI DOI 10.1007/978-981-33-4087-9_26
  • [3] Blind inpainting using the fully convolutional neural network
    Cai, Nian
    Su, Zhenghang
    Lin, Zhineng
    Wang, Han
    Yang, Zhijing
    Ling, Bingo Wing-Kuen
    [J]. VISUAL COMPUTER, 2017, 33 (02) : 249 - 261
  • [4] REFIT: A UnifiedWatermark Removal Framework For Deep Learning Systems With Limited Data
    Chen, Xinyun
    Wang, Wenxiao
    Bender, Chris
    Ding, Yiming
    Jia, Ruoxi
    Li, Bo
    Song, Dawn
    [J]. ASIA CCS'21: PROCEEDINGS OF THE 2021 ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2021, : 321 - 335
  • [5] The Cityscapes Dataset for Semantic Urban Scene Understanding
    Cordts, Marius
    Omran, Mohamed
    Ramos, Sebastian
    Rehfeld, Timo
    Enzweiler, Markus
    Benenson, Rodrigo
    Franke, Uwe
    Roth, Stefan
    Schiele, Bernt
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
  • [6] What Makes Paris Look like Paris?
    Doersch, Carl
    Singh, Saurabh
    Gupta, Abhinav
    Sivic, Josef
    Efros, Alexei A.
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2012, 31 (04):
  • [7] Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection
    Dwibedi, Debidatta
    Misra, Ishan
    Hebert, Martial
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1310 - 1319
  • [8] PedCut: an iterative framework for pedestrian segmentation combining shape models and multiple data cues
    Flohr, Fabian
    Gavrila, Dariu M.
    [J]. PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2013, 2013,
  • [9] MCPA: A Fast Single Image Haze Removal Method Based on the Minimum Channel and Patchless Approach
    Fuh, Chiou-Shann
    Tung, Tzu-Chia
    [J]. IEEE ACCESS, 2022, 10 : 73033 - 73045
  • [10] Gao J., 2022, P IEEE CVF C COMP VI, P599