Faces;
Face recognition;
Feature extraction;
Visualization;
Transformers;
Task analysis;
Semantics;
Face inpainting;
irregular hole;
multimodality;
text description;
D O I:
10.1109/TCSVT.2024.3370578
中图分类号:
TM [电工技术];
TN [电子技术、通信技术];
学科分类号:
0808 ;
0809 ;
摘要:
Irregular hole face inpainting is a challenging task, since the appearance of faces varies greatly (e.g., different expressions and poses) and the human vision is more sensitive to subtle blemishes in the inpainted face images. Without external information, most existing methods struggle to generate new content containing semantic information for face components in the absence of sufficient contextual information. As it is known that text can be used to describe the content of an image in most cases, and is flexible and user-friendly. In this work, a concise and effective Multimodal Face Inpainting Network (MuFIN) is proposed, which simultaneously utilizes the information of the known regions and the descriptive text of the input image to address the problem of irregular hole face inpainting. To fully exploit the rest parts of the corrupted face images, a plug-and-play Multi-scale Multi-level Skip Fusion Module (MMSFM), which extracts multi-scale features and fuses shallow features into deep features at multiple levels, is illustrated. Moreover, to bridge the gap between textual and visual modalities and effectively fuse cross-modal features, a Multi-scale Text-Image Fusion Block (MTIFB), which incorporates text features into image features from both local and global scales, is developed. Extensive experiments conducted on two commonly used datasets CelebA and Multi-Modal-CelebA-HQ demonstrate that our method outperforms state-of-the-art methods both qualitatively and quantitatively, and can generate realistic and controllable results.
机构:
Guangdong Univ Technol, Sch Comp Sci, Guangzhou 510000, Peoples R ChinaGuangdong Univ Technol, Sch Comp Sci, Guangzhou 510000, Peoples R China
Yang, Zhuopan
Yang, Zhenguo
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong Univ Technol, Sch Comp Sci, Guangzhou 510000, Peoples R ChinaGuangdong Univ Technol, Sch Comp Sci, Guangzhou 510000, Peoples R China
Yang, Zhenguo
Li, Xiaoping
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong Univ Technol, Sch Comp Sci, Guangzhou 510000, Peoples R ChinaGuangdong Univ Technol, Sch Comp Sci, Guangzhou 510000, Peoples R China
Li, Xiaoping
Yu, Yi
论文数: 0引用数: 0
h-index: 0
机构:
Natl Inst Informat, Digital Content & Media Sci Res Div, Tokyo 1018430, JapanGuangdong Univ Technol, Sch Comp Sci, Guangzhou 510000, Peoples R China
Yu, Yi
Li, Qing
论文数: 0引用数: 0
h-index: 0
机构:
Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R ChinaGuangdong Univ Technol, Sch Comp Sci, Guangzhou 510000, Peoples R China
Li, Qing
Liu, Wenyin
论文数: 0引用数: 0
h-index: 0
机构:
Zhongguancun Lab, Beijing 100190, Peoples R ChinaGuangdong Univ Technol, Sch Comp Sci, Guangzhou 510000, Peoples R China
机构:
Beijing Inst Technol, Sch Opt & Photon, Beijing 100081, Peoples R China
Chinese Acad Sci, Inst Automat, Ctr Res Intelligent Percept & Comp, State Key Lab Multimodal Artificial Intelligence S, Beijing 100190, Peoples R ChinaBeijing Inst Technol, Sch Opt & Photon, Beijing 100081, Peoples R China
Tian, Yu
Huang, Yalin
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Inst Automat, Ctr Res Intelligent Percept & Comp, State Key Lab Multimodal Artificial Intelligence S, Beijing 100190, Peoples R ChinaBeijing Inst Technol, Sch Opt & Photon, Beijing 100081, Peoples R China
Huang, Yalin
Zhang, Kunbo
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Inst Automat, Ctr Res Intelligent Percept & Comp, State Key Lab Multimodal Artificial Intelligence S, Beijing 100190, Peoples R China
Univ Chinese Acad Sci, Beijing 101408, Peoples R ChinaBeijing Inst Technol, Sch Opt & Photon, Beijing 100081, Peoples R China
Zhang, Kunbo
Liu, Yue
论文数: 0引用数: 0
h-index: 0
机构:
Beijing Inst Technol, Sch Opt & Photon, Beijing 100081, Peoples R ChinaBeijing Inst Technol, Sch Opt & Photon, Beijing 100081, Peoples R China
Liu, Yue
Sun, Zhenan
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Inst Automat, Ctr Res Intelligent Percept & Comp, State Key Lab Multimodal Artificial Intelligence S, Beijing 100190, Peoples R China
Univ Chinese Acad Sci, Beijing 101408, Peoples R ChinaBeijing Inst Technol, Sch Opt & Photon, Beijing 100081, Peoples R China
机构:
Isfahan University of Technology,Department of Electrical and Computer EngineeringIsfahan University of Technology,Department of Electrical and Computer Engineering
MohammadHossein Givkashi
MohammadReza Naderi
论文数: 0引用数: 0
h-index: 0
机构:
Isfahan University of Technology,Department of Electrical and Computer EngineeringIsfahan University of Technology,Department of Electrical and Computer Engineering
MohammadReza Naderi
Nader Karimi
论文数: 0引用数: 0
h-index: 0
机构:
Isfahan University of Technology,Department of Electrical and Computer EngineeringIsfahan University of Technology,Department of Electrical and Computer Engineering
Nader Karimi
Shahram Shirani
论文数: 0引用数: 0
h-index: 0
机构:
McMaster University,Department of Electrical and Computer EngineeringIsfahan University of Technology,Department of Electrical and Computer Engineering
Shahram Shirani
Shadrokh Samavi
论文数: 0引用数: 0
h-index: 0
机构:
McMaster University,Department of Electrical and Computer EngineeringIsfahan University of Technology,Department of Electrical and Computer Engineering
机构:
Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R ChinaShandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China
Bao, Yongtang
Xiao, Xinfei
论文数: 0引用数: 0
h-index: 0
机构:
Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R ChinaShandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China
Xiao, Xinfei
Qi, Yue
论文数: 0引用数: 0
h-index: 0
机构:
Beihang Univ, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
Beihang Univ, Virtual Real Res Inst, Qingdao Res Inst, Qingdao 266100, Peoples R China
Peng Cheng Lab, Shenzhen 518055, Peoples R ChinaShandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China
机构:
Sun Yat Sen Univ, Sch Elect & Informat Engn, Guangzhou 510006, Peoples R ChinaSun Yat Sen Univ, Sch Elect & Informat Engn, Guangzhou 510006, Peoples R China
Du, Lingshuang
Hu, Haifeng
论文数: 0引用数: 0
h-index: 0
机构:
Sun Yat Sen Univ, Sch Elect & Informat Engn, Guangzhou 510006, Peoples R ChinaSun Yat Sen Univ, Sch Elect & Informat Engn, Guangzhou 510006, Peoples R China
Hu, Haifeng
Wu, Yongbo
论文数: 0引用数: 0
h-index: 0
机构:
Sun Yat Sen Univ, Sch Elect & Informat Engn, Guangzhou 510006, Peoples R ChinaSun Yat Sen Univ, Sch Elect & Informat Engn, Guangzhou 510006, Peoples R China
机构:
Guangdong Univ Technol, Sch Comp Sci, Guangzhou 510000, Peoples R ChinaGuangdong Univ Technol, Sch Comp Sci, Guangzhou 510000, Peoples R China
Yang, Zhuopan
Yang, Zhenguo
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong Univ Technol, Sch Comp Sci, Guangzhou 510000, Peoples R ChinaGuangdong Univ Technol, Sch Comp Sci, Guangzhou 510000, Peoples R China
Yang, Zhenguo
Li, Xiaoping
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong Univ Technol, Sch Comp Sci, Guangzhou 510000, Peoples R ChinaGuangdong Univ Technol, Sch Comp Sci, Guangzhou 510000, Peoples R China
Li, Xiaoping
Yu, Yi
论文数: 0引用数: 0
h-index: 0
机构:
Natl Inst Informat, Digital Content & Media Sci Res Div, Tokyo 1018430, JapanGuangdong Univ Technol, Sch Comp Sci, Guangzhou 510000, Peoples R China
Yu, Yi
Li, Qing
论文数: 0引用数: 0
h-index: 0
机构:
Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R ChinaGuangdong Univ Technol, Sch Comp Sci, Guangzhou 510000, Peoples R China
Li, Qing
Liu, Wenyin
论文数: 0引用数: 0
h-index: 0
机构:
Zhongguancun Lab, Beijing 100190, Peoples R ChinaGuangdong Univ Technol, Sch Comp Sci, Guangzhou 510000, Peoples R China
机构:
Beijing Inst Technol, Sch Opt & Photon, Beijing 100081, Peoples R China
Chinese Acad Sci, Inst Automat, Ctr Res Intelligent Percept & Comp, State Key Lab Multimodal Artificial Intelligence S, Beijing 100190, Peoples R ChinaBeijing Inst Technol, Sch Opt & Photon, Beijing 100081, Peoples R China
Tian, Yu
Huang, Yalin
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Inst Automat, Ctr Res Intelligent Percept & Comp, State Key Lab Multimodal Artificial Intelligence S, Beijing 100190, Peoples R ChinaBeijing Inst Technol, Sch Opt & Photon, Beijing 100081, Peoples R China
Huang, Yalin
Zhang, Kunbo
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Inst Automat, Ctr Res Intelligent Percept & Comp, State Key Lab Multimodal Artificial Intelligence S, Beijing 100190, Peoples R China
Univ Chinese Acad Sci, Beijing 101408, Peoples R ChinaBeijing Inst Technol, Sch Opt & Photon, Beijing 100081, Peoples R China
Zhang, Kunbo
Liu, Yue
论文数: 0引用数: 0
h-index: 0
机构:
Beijing Inst Technol, Sch Opt & Photon, Beijing 100081, Peoples R ChinaBeijing Inst Technol, Sch Opt & Photon, Beijing 100081, Peoples R China
Liu, Yue
Sun, Zhenan
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Inst Automat, Ctr Res Intelligent Percept & Comp, State Key Lab Multimodal Artificial Intelligence S, Beijing 100190, Peoples R China
Univ Chinese Acad Sci, Beijing 101408, Peoples R ChinaBeijing Inst Technol, Sch Opt & Photon, Beijing 100081, Peoples R China
机构:
Isfahan University of Technology,Department of Electrical and Computer EngineeringIsfahan University of Technology,Department of Electrical and Computer Engineering
MohammadHossein Givkashi
MohammadReza Naderi
论文数: 0引用数: 0
h-index: 0
机构:
Isfahan University of Technology,Department of Electrical and Computer EngineeringIsfahan University of Technology,Department of Electrical and Computer Engineering
MohammadReza Naderi
Nader Karimi
论文数: 0引用数: 0
h-index: 0
机构:
Isfahan University of Technology,Department of Electrical and Computer EngineeringIsfahan University of Technology,Department of Electrical and Computer Engineering
Nader Karimi
Shahram Shirani
论文数: 0引用数: 0
h-index: 0
机构:
McMaster University,Department of Electrical and Computer EngineeringIsfahan University of Technology,Department of Electrical and Computer Engineering
Shahram Shirani
Shadrokh Samavi
论文数: 0引用数: 0
h-index: 0
机构:
McMaster University,Department of Electrical and Computer EngineeringIsfahan University of Technology,Department of Electrical and Computer Engineering
机构:
Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R ChinaShandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China
Bao, Yongtang
Xiao, Xinfei
论文数: 0引用数: 0
h-index: 0
机构:
Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R ChinaShandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China
Xiao, Xinfei
Qi, Yue
论文数: 0引用数: 0
h-index: 0
机构:
Beihang Univ, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
Beihang Univ, Virtual Real Res Inst, Qingdao Res Inst, Qingdao 266100, Peoples R China
Peng Cheng Lab, Shenzhen 518055, Peoples R ChinaShandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China
机构:
Sun Yat Sen Univ, Sch Elect & Informat Engn, Guangzhou 510006, Peoples R ChinaSun Yat Sen Univ, Sch Elect & Informat Engn, Guangzhou 510006, Peoples R China
Du, Lingshuang
Hu, Haifeng
论文数: 0引用数: 0
h-index: 0
机构:
Sun Yat Sen Univ, Sch Elect & Informat Engn, Guangzhou 510006, Peoples R ChinaSun Yat Sen Univ, Sch Elect & Informat Engn, Guangzhou 510006, Peoples R China
Hu, Haifeng
Wu, Yongbo
论文数: 0引用数: 0
h-index: 0
机构:
Sun Yat Sen Univ, Sch Elect & Informat Engn, Guangzhou 510006, Peoples R ChinaSun Yat Sen Univ, Sch Elect & Informat Engn, Guangzhou 510006, Peoples R China