Multi-Scale Visual Perception Based Progressive Feature Interaction Network for Stereo Image Super-Resolution

被引:6
作者
Liu, Anqi [1 ]
Li, Sumei [1 ]
Chang, Yongli [1 ]
Hou, Yonghong [1 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
基金
中国国家自然科学基金;
关键词
Stereo image super-resolution; convolutional neural network; multi-scale; feature transformer; perceptual texture matching;
D O I
10.1109/TCSVT.2023.3295087
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In recent years, stereo image super-resolution based on convolutional neural network has been extensively researched and achieved impressive performance by introducing complementary information from another view. However, most existing methods still cannot fully capture both intra- and cross-view information due to the neglect of multi-scale information perception, multi-scale binocular alignment and the excitation of large scale to small scale in human vision system. And they generated blurry results due to the consideration of irrelevant information in search for cross-view information. To address these issues, we propose a multi-scale visual perception based progressive feature interaction network (MS-PFINet) for stereo image super-resolution. Specifically, to exploit comprehensive intra- and cross-view information for image reconstruction, we design a two-stream network with multi-branch structure to extract multi-scale features and progressively use cross-view interaction at larger scales to guide that at smaller scales. Moreover, to explore more proper and accurate cross-view information, we propose a feature transformer module (FTM) to search and transfer the most relevant features from another view by hard attention maps and soft attention maps, which are calculated by patch-wise similarity rather than pixel-wise. In addition, in order to encourage a more effective way to transfer texture features for the target view, we propose a perceptual texture matching loss to supervise the accuracy of feature transformer modules. Experimental results show that our proposed method is superior to the state-of-the-art methods in most cases.
引用
收藏
页码:1615 / 1626
页数:12
相关论文
共 15 条
  • [1] Cross Parallax Attention Network for Stereo Image Super-Resolution
    Chen, Canqiang
    Qing, Chunmei
    Xu, Xiangmin
    Dickinson, Patrick
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 202 - 216
  • [2] Feedback Network for Mutually Boosted Stereo Image Super-Resolution and Disparity Estimation
    Dai, Qinyan
    Li, Juncheng
    Yi, Qiaosi
    Fang, Faming
    Zhang, Guixu
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1985 - 1993
  • [3] Second-order Attention Network for Single Image Super-Resolution
    Dai, Tao
    Cai, Jianrui
    Zhang, Yongbing
    Xia, Shu-Tao
    Zhang, Lei
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11057 - 11066
  • [4] A Disparity Feature Alignment Module for Stereo Image Super-Resolution
    Dan, Jiawang
    Qu, Zhaowei
    Wang, Xiaoru
    Gu, Jiahang
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1285 - 1289
  • [5] Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]
  • [6] Enhancing the Spatial Resolution of Stereo Images using a Parallax Prior
    Jeon, Daniel S.
    Baek, Seung-Hwan
    Choi, Inchang
    Kim, Min H.
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1721 - 1730
  • [7] MDCN: Multi-Scale Dense Cross Network for Image Super-Resolution
    Li, Juncheng
    Fang, Faming
    Li, Jiaqian
    Mei, Kangfu
    Zhang, Guixu
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (07) : 2547 - 2561
  • [8] Simonyan K, 2015, Arxiv, DOI [arXiv:1409.1556, DOI 10.48550/ARXIV.1409.1556]
  • [9] Song W, 2020, AAAI CONF ARTIF INTE, V34, P12031
  • [10] Multi-Grained Attention Networks for Single Image Super-Resolution
    Wu, Huapeng
    Zou, Zhengxia
    Gui, Jie
    Zeng, Wen-Jun
    Ye, Jieping
    Zhang, Jun
    Liu, Hongyi
    Wei, Zhihui
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (02) : 512 - 522