High-Resolution Image Inpainting Based on Multi-Scale Neural Network

被引:13
作者
Sun, Tingzhu [1 ,2 ]
Fang, Weidong [3 ]
Chen, Wei [1 ,2 ,4 ]
Yao, Yanxin [5 ]
Bi, Fangming [1 ,2 ]
Wu, Baolei [1 ,2 ]
机构
[1] China Univ Min Technol, Sch Comp Sci & Technol, Xuzhou 221000, Jiangsu, Peoples R China
[2] China Univ Min & Technol, Mine Digitizat Engn Res Ctr, Minist Educ, Xuzhou 221116, Jiangsu, Peoples R China
[3] Chinese Acad Sci, Shanghai Inst Microsyst & Informat Technol, Key Lab Wireless Sensor Network & Commun, Shanghai 201800, Peoples R China
[4] Peking Univ, Sch Earth & Space Sci, Beijing 100871, Peoples R China
[5] Beijing Informat Sci & Technol Univ, Sch Commun & Informat Engn, Beijing 100101, Peoples R China
基金
中国国家自然科学基金;
关键词
image inpainting; content reconstruction; instance segmentation;
D O I
10.3390/electronics8111370
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Although image inpainting based on the generated adversarial network (GAN) has made great breakthroughs in accuracy and speed in recent years, they can only process low-resolution images because of memory limitations and difficulty in training. For high-resolution images, the inpainted regions become blurred and the unpleasant boundaries become visible. Based on the current advanced image generation network, we proposed a novel high-resolution image inpainting method based on multi-scale neural network. This method is a two-stage network including content reconstruction and texture detail restoration. After holding the visually believable fuzzy texture, we further restore the finer details to produce a smoother, clearer, and more coherent inpainting result. Then we propose a special application scene of image inpainting, that is, to delete the redundant pedestrians in the image and ensure the reality of background restoration. It involves pedestrian detection, identifying redundant pedestrians and filling in them with the seemingly correct content. To improve the accuracy of image inpainting in the application scene, we proposed a new mask dataset, which collected the characters in COCO dataset as a mask. Finally, we evaluated our method on COCO and VOC dataset. the experimental results show that our method can produce clearer and more coherent inpainting results, especially for high-resolution images, and the proposed mask dataset can produce better inpainting results in the special application scene.
引用
收藏
页数:14
相关论文
共 38 条
[1]  
[Anonymous], P IEEE C COMP VIS PA
[2]  
[Anonymous], PROC CVPR IEEE
[3]  
[Anonymous], ADV NEURAL INFORM PR, DOI DOI 10.1109/TPAMI.2016.2577031
[4]  
[Anonymous], ARXIV170400028
[5]  
[Anonymous], PROC CVPR IEEE
[6]  
[Anonymous], ARXIV160903126
[7]  
[Anonymous], 2017, ADV NEURAL INFORM PR
[8]  
[Anonymous], 2016, NIPS
[9]   PatchMatch: A Randomized Correspondence Algorithm for Structural Image Editing [J].
Barnes, Connelly ;
Shechtman, Eli ;
Finkelstein, Adam ;
Goldman, Dan B. .
ACM TRANSACTIONS ON GRAPHICS, 2009, 28 (03)
[10]   Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [J].
Cao, Zhe ;
Simon, Tomas ;
Wei, Shih-En ;
Sheikh, Yaser .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1302-1310