Multi-Scale Visual Perception Based Progressive Feature Interaction Network for Stereo Image Super-Resolution

被引：6

作者：

Liu, Anqi ^{[1
]}

Li, Sumei ^{[1
]}

Chang, Yongli ^{[1
]}

Hou, Yonghong ^{[1
]}

机构：

[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Stereo image super-resolution; convolutional neural network; multi-scale; feature transformer; perceptual texture matching;

D O I：

10.1109/TCSVT.2023.3295087

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In recent years, stereo image super-resolution based on convolutional neural network has been extensively researched and achieved impressive performance by introducing complementary information from another view. However, most existing methods still cannot fully capture both intra- and cross-view information due to the neglect of multi-scale information perception, multi-scale binocular alignment and the excitation of large scale to small scale in human vision system. And they generated blurry results due to the consideration of irrelevant information in search for cross-view information. To address these issues, we propose a multi-scale visual perception based progressive feature interaction network (MS-PFINet) for stereo image super-resolution. Specifically, to exploit comprehensive intra- and cross-view information for image reconstruction, we design a two-stream network with multi-branch structure to extract multi-scale features and progressively use cross-view interaction at larger scales to guide that at smaller scales. Moreover, to explore more proper and accurate cross-view information, we propose a feature transformer module (FTM) to search and transfer the most relevant features from another view by hard attention maps and soft attention maps, which are calculated by patch-wise similarity rather than pixel-wise. In addition, in order to encourage a more effective way to transfer texture features for the target view, we propose a perceptual texture matching loss to supervise the accuracy of feature transformer modules. Experimental results show that our proposed method is superior to the state-of-the-art methods in most cases.

引用

页码：1615 / 1626

页数：12

共 15 条

[1] Cross Parallax Attention Network for Stereo Image Super-Resolution
Chen, Canqiang
Qing, Chunmei
Xu, Xiangmin
Dickinson, Patrick
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 202 - 216
[2] Feedback Network for Mutually Boosted Stereo Image Super-Resolution and Disparity Estimation
Dai, Qinyan
Li, Juncheng
Yi, Qiaosi
Fang, Faming
Zhang, Guixu
[J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1985 - 1993
[3] Second-order Attention Network for Single Image Super-Resolution
Dai, Tao
Cai, Jianrui
Zhang, Yongbing
Xia, Shu-Tao
Zhang, Lei
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11057 - 11066
[4] A Disparity Feature Alignment Module for Stereo Image Super-Resolution
Dan, Jiawang
Qu, Zhaowei
Wang, Xiaoru
Gu, Jiahang
[J]. IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1285 - 1289
[5] Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]
[6] Enhancing the Spatial Resolution of Stereo Images using a Parallax Prior
Jeon, Daniel S.
Baek, Seung-Hwan
Choi, Inchang
Kim, Min H.
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1721 - 1730
[7] MDCN: Multi-Scale Dense Cross Network for Image Super-Resolution
Li, Juncheng
Fang, Faming
Li, Jiaqian
Mei, Kangfu
Zhang, Guixu
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (07) : 2547 - 2561
[8] Simonyan K, 2015, Arxiv, DOI [arXiv:1409.1556, DOI 10.48550/ARXIV.1409.1556]
[9] Song W, 2020, AAAI CONF ARTIF INTE, V34, P12031
[10] Multi-Grained Attention Networks for Single Image Super-Resolution
Wu, Huapeng
Zou, Zhengxia
Gui, Jie
Zeng, Wen-Jun
Ye, Jieping
Zhang, Jun
Liu, Hongyi
Wei, Zhihui
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (02) : 512 - 522

← 1 2 →