Adaptive Visual Field Multi-scale Generative Adversarial Networks Image Inpainting Base on Coordinate-Attention

被引:2
作者
Chen, Gang [1 ,2 ]
Kang, Peipei [2 ]
Wu, Xingcai [2 ]
Yang, Zhenguo [2 ]
Liu, Wenyin [2 ,3 ]
机构
[1] Guangdong Open Univ, Sch Artificial Intelligence, Guangzhou, Peoples R China
[2] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou, Peoples R China
[3] Cyberspace Secur Res Ctr, Peng Cheng Lab, Shenzhen, Peoples R China
关键词
Image inpainting; Deformable convolutional networks; Coordinate-attention; Multi-Scale GANs; OBJECT REMOVAL; RECONSTRUCTION; ALGORITHM;
D O I
10.1007/s11063-023-11233-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image inpainting with the large missing blocks is tremendous challenging to achieve visual consistency and realistic effect. In this paper, an Adaptive Visual field Multi-scale Generative Adversarial Networks (denoted as GANs) Image Inpainting based on Coordinate-attention (denoted as AVMGC) is proposed. Firstly, an encoder with deformable convolutional networks in the generator of multi-scale generative adversarial networks is designed to expand the local vision field of network sampling adaptively in the image inpainting, which improves the local visual consistency of the image inpainting. Secondly, in order to expand the receptive field of the deep network and the global visual field, AVMGC combines the coordinate-attention mechanism with the convolutional layers, aiming to capture the direction-aware and position-sensitive information by cross-channel, which helps models to more accurately locate and recognize the objects of interest and generate globally consistent geometric contour in the image inpainting. In particular, instance normalization is introduced to the mutil-scale discriminator for transferring the statistic information of the feature maps and aims to keep the style of the original images. Extensive experiments conducted on public datasets prove that the proposal algorithms have the qualitative performance and outperform the baselines.
引用
收藏
页码:9949 / 9967
页数:19
相关论文
共 33 条
[21]   Multi-Scale Patch Partitioning for Image Inpainting Based on Visual Transformers [J].
Campana, Jose Luis Flores ;
Decker, Luis Gustavo Lorgus ;
Roberto e Souza, Marcos ;
Maia, Helena de Almeida ;
Pedrini, Helio .
2022 35TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2022), 2022, :180-185
[22]   MFMAM: Image inpainting via multi-scale feature module with attention module [J].
Chen, Yuantao ;
Xia, Runlong ;
Yang, Kai ;
Zou, Ke .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 238
[23]   A novel generative adversarial networks based multi-scale reconstruction method for porous rocks [J].
Xiao, Nan ;
Peng, Yu ;
Zhou, Xiaoping .
COMPUTERS & STRUCTURES, 2025, 313
[24]   Contextual Information Aggregation and Multi-Scale Feature Fusion for Single Image De-Raining in Generative Adversarial Networks [J].
Zhao, Jia ;
Chen, Ming ;
Pan, Jeng-Shyang ;
Han, Longzhe ;
Qiu, Shenyu ;
Nie, Zhaoxiu .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2025, 37 (03)
[25]   MSMC-NET: IMAGE INPAINTING USING DEEP MULTI-SCALE AND MULTI-CONNECTION NETWORKS [J].
Wang, Miaohui ;
Chen, Xiaoming ;
Chen, Weiqian ;
Yuan, Yuan .
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2019, :683-686
[26]   Reconstruction of three-dimensional porous media using multi-scale generative adversarial networks [J].
Zhang, Ting ;
Ni, Mengling ;
Guan, Qijie ;
Li, Deya ;
Zhou, Shaojing ;
Du, Yi .
JOURNAL OF APPLIED GEOPHYSICS, 2023, 213
[27]   Multi-scale information fusion generative adversarial network for real-world noisy image denoising [J].
Hu, Xuegang ;
Zhao, Wei .
MACHINE VISION AND APPLICATIONS, 2024, 35 (04)
[28]   MCAD-Net: Multi-scale Coordinate Attention Dense Network for Single Image Deraining [J].
Li, Pengpeng ;
Jin, Jiyu ;
Jin, Guiyue ;
Shi, Jiaqi ;
Fan, Lei .
COMMUNICATIONS AND NETWORKING (CHINACOM 2021), 2022, :405-421
[29]   A generative adversarial network with multi-scale convolution and dilated convolution res-network for OCT retinal image despeckling [J].
Yu, Xiaojun ;
Li, Mingshuai ;
Ge, Chenkun ;
Shum, Perry Ping ;
Chen, Jinna ;
Liu, Linbo .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 80
[30]   A temporal attention-based multi-scale generative adversarial network to fill gaps in time series of MODIS data for land surface phenology extraction [J].
Wang, Yidan ;
Wu, Wei ;
Zhang, Zhicheng ;
Li, Ziming ;
Zhang, Fan ;
Xin, Qinchuan .
REMOTE SENSING OF ENVIRONMENT, 2025, 318