Advancing Real-World Stereoscopic Image Super-Resolution via Vision-Language Model

被引:0
|
作者
Zhang, Zhe [1 ,2 ]
Lei, Jianjun [1 ]
Peng, Bo [1 ]
Zhu, Jie [1 ]
Xu, Liying [1 ]
Huang, Qingming [3 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Tianjin Univ Commerce, Sch Informat Engn, Tianjin 300134, Peoples R China
[3] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Stereo image processing; Degradation; Superresolution; Visualization; Image reconstruction; Training; Iterative methods; Solid modeling; Computational modeling; Cognition; Super-resolution; stereoscopic image; vision-language model;
D O I
10.1109/TIP.2025.3546470
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent years have witnessed the remarkable success of the vision-language model in various computer vision tasks. However, how to exploit the semantic language knowledge of the vision-language model to advance real-world stereoscopic image super-resolution remains a challenging problem. This paper proposes a vision-language model-based stereoscopic image super-resolution (VLM-SSR) method, in which the semantic language knowledge in CLIP is exploited to facilitate stereoscopic image SR in a training-free manner. Specifically, by designing visual prompts for CLIP to infer the region similarity, a prompt-guided information aggregation mechanism is presented to capture inter-view information among relevant regions between the left and right views. Besides, driven by the prior knowledge of CLIP, a cognition prior-driven iterative enhancing mechanism is presented to optimize fuzzy regions adaptively. Experimental results on four datasets verify the effectiveness of the proposed method.
引用
收藏
页码:2187 / 2197
页数:11
相关论文
共 50 条
  • [1] Frequency Generation for Real-World Image Super-Resolution
    Guan, Wenxue
    Li, Haobo
    Xu, Dawei
    Liu, Jiaxin
    Gong, Shenghua
    Liu, Jun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7029 - 7040
  • [2] Toward Real-World Super-Resolution via Adaptive Downsampling Models
    Son, Sanghyun
    Kim, Jaeha
    Lai, Wei-Sheng
    Yang, Ming-Hsuan
    Lee, Kyoung Mu
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) : 8657 - 8670
  • [3] Deep Stereoscopic Image Super-Resolution via Interaction Module
    Lei, Jianjun
    Zhang, Zhe
    Fan, Xiaoting
    Yang, Bolan
    Li, Xinxin
    Chen, Ying
    Huang, Qingming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (08) : 3051 - 3061
  • [4] Real-World Light Field Image Super-Resolution Via Degradation Modulation
    Wang, Yingqian
    Liang, Zhengyu
    Wang, Longguang
    Yang, Jungang
    An, Wei
    Guo, Yulan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
  • [5] Real-World Thermal Image Super-Resolution
    Allahham, Moaaz
    Aakerberg, Andreas
    Nasrollahi, Kamal
    Moeslund, Thomas B.
    ADVANCES IN VISUAL COMPUTING (ISVC 2021), PT I, 2021, 13017 : 3 - 14
  • [6] Structure and Texture Preserving Network for Real-World Image Super-Resolution
    Zhou, Bijun
    Yan, Huibin
    Wang, Shuoyao
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2173 - 2177
  • [7] Unsupervised Degradation Aware and Representation for Real-World Remote Sensing Image Super-Resolution
    Guo, Wen-Zhong
    Weng, Wu-Ding
    Chen, Guang-Yong
    Su, Jian-Nan
    Gan, Min
    Philip Chen, C. L.
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [8] Toward Real-World Remote Sensing Image Super-Resolution: A New Benchmark and an Efficient Model
    Wang, Jia
    Xiang, Liuyu
    Liu, Lei
    Xu, Jiaochong
    Li, Peipei
    Xu, Qizhi
    He, Zhaofeng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [9] Recurrent Interaction Network for Stereoscopic Image Super-Resolution
    Zhang, Zhe
    Peng, Bo
    Lei, Jianjun
    Shen, Haifeng
    Huang, Qingming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (05) : 2048 - 2060
  • [10] Empowering Real-World Image Super-Resolution With Flexible Interactive Modulation
    Mou, Chong
    Wang, Xintao
    Wu, Yanze
    Shan, Ying
    Zhang, Jian
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (11) : 7317 - 7330