PLGNet: Prior-Guided Local and Global Interactive Hybrid Network for Face Super-Resolution

被引：3

作者：

Li, Ling ^{[1
]}

Zhang, Yan ^{[1
]}

Yuan, Lin ^{[1
]}

Gao, Xinbo ^{[1
]}

机构：

[1] Chongqing Univ Posts & Telecommun, Coll Comp Sci & Technol, Chongqing Key Lab Image Cognit, Chongqing 400065, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 10期

基金：

中国国家自然科学基金;

关键词：

Image reconstruction; Faces; Face recognition; Transformers; Superresolution; Semantics; Feature extraction; Face super-resolution (FSR); facial prior; attention aggregation; transformer; TRANSFORMER; ALIGNMENT;

D O I：

10.1109/TCSVT.2024.3403713

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Recent CNN-driven face super-resolution (FSR) technologies have achieved excellent breakthroughs by incorporating facial prior knowledge. However, most of them suffer from some obvious limitations. They always estimate facial priors from input low-resolution (LR) faces or coarsely enhanced LR faces, obtaining unfaithful priors that cannot be adequately exploited. This may bring noticeable artifacts to the target results, especially for large scaling factors, deteriorating the fidelity and naturalness and generating suboptimal reconstructed results. In this paper, we propose a two-stage prior-guided FSR approach to learn facial prior knowledge from the optimal SR results of stage one and explore the complementarity between priors to further guide more accurate reconstruction in stage two. Specifically, we develop an efficient local and global interactive hybrid network incorporating facial semantic and geometric priors for more discriminative results. To reach this, we devise a multiscale interconnected symmetric encoder-decoder architecture composed of Prior Interaction-Integration Modules (PIIMs), the Coarse-to-fine Feature Refinement Module (CFRM), and Feature Aggregation Modulation Modules (FAMMs). The encoder concentrates on hierarchically extracting multiscale features. The CFRM is devised to explore the potential correlations between the encoder and the decoder and further guide the refinement and reinforcement of the encoded features. The decoder aims to take full advantage of informative multiscale encoded features to reconstruct high-quality SR representations. Comprehensive evaluation and visualization results on four benchmark datasets demonstrate the superiority of the proposed PLGNet over current state-of-the-art methods. The source code of PLGNet will be available at https://github.com/lil808/PLGNet.git.

引用

页码：10166 / 10181

页数：16

共 50 条

[41] Super-resolution using neighbourhood regression with local structure prior [J].

Li, Keqiuyin ;

Cao, Feilong .

SIGNAL PROCESSING-IMAGE COMMUNICATION, 2019, 72 :58-68

[42] Robust Face Super-Resolution via Position Relation Model Based on Global Face Context [J].

Chen, Liang ;

Pan, Jinshan ;

Jiang, Junjun ;

Zhang, Jiawei ;

Wu, Yi .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (9002-9016) :9002-9016

[43] Learning face super-resolution through identity features and distilling facial prior knowledge [J].

Tomara, Anurag Singh ;

Arya, K. V. ;

Rajput, Shyam Singh .

EXPERT SYSTEMS WITH APPLICATIONS, 2025, 262

[44] NON-LOCAL SIMILARITY DICTIONARY LEARNING BASED FACE SUPER-RESOLUTION [J].

Liao, Haibin ;

Dai, Wenhua ;

Zhou, Qianjin ;

Liu, Bo .

2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, :88-93

[45] JDSR-GAN: Constructing an Efficient Joint Learning Network for Masked Face Super-Resolution [J].

Gao, Guangwei ;

Tang, Lei ;

Wu, Fei ;

Lu, Huimin ;

Yang, Jian .

IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 :1505-1512

[46] Hyperspectral Image Super-Resolution Network of Local-Global Attention Feature Reuse [J].

Size, Wang ;

Xin, Guan ;

Qiang, Li .

ACTA OPTICA SINICA, 2023, 43 (21)

[47] Super-Resolution for Remote Sensing Images via Local-Global Combined Network [J].

Lei, Sen ;

Shi, Zhenwei ;

Zou, Zhengxia .

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2017, 14 (08) :1243-1247

[48] Face Video Super-Resolution with Identity Guided Generative Adversarial Networks [J].

Li, Dingyi ;

Wang, Zengfu .

COMPUTER VISION, PT II, 2017, 772 :357-369

[49] Context-Aware Guided Attention Based Cross-Feedback Dense Network for Hyperspectral Image Super-Resolution [J].

Dong, Wenqian ;

Qu, Jiahui ;

Zhang, Tongzhen ;

Li, Yunsong ;

Du, Qian .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60

[50] HCT: a hybrid CNN and transformer network for hyperspectral image super-resolution [J].

Wu, Huapeng ;

Wang, Chenyun ;

Lu, Chenyang ;

Zhan, Tianming .

MULTIMEDIA SYSTEMS, 2024, 30 (04)

← 1 2 3 4 5 →