TemporalStereo: Efficient Spatial-Temporal Stereo Matching Network

被引:6
|
作者
Zhang, Youmin [1 ]
Poggi, Matteo [1 ]
Mattoccia, Stefano [1 ]
机构
[1] Univ Bologna, Dept Comp Sci & Engn DISI, Bologna, Italy
关键词
D O I
10.1109/IROS55552.2023.10341598
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present TemporalStereo, a coarse-to-fine stereo matching network that is highly efficient, and able to effectively exploit the past geometry and context information to boost matching accuracy. Our network leverages sparse cost volume and proves to be effective when a single stereo pair is given. However, its peculiar ability to use spatio-temporal information across stereo sequences allows TemporalStereo to alleviate problems such as occlusions and reflective regions while enjoying high efficiency also in this latter case. Notably, our model trained once with stereo videos - can run in both single-pair and temporal modes seamlessly. Experiments show that our network relying on camera motion is robust even to dynamic objects when running on videos. We validate TemporalStereo through extensive experiments on synthetic (SceneFlow, TartanAir) and real (KITTI 2012, KITTI 2015) datasets. Our model achieves state-of-the-art performance on any of these datasets.
引用
收藏
页码:9528 / 9535
页数:8
相关论文
共 50 条
  • [21] Spatial-Temporal Aggregation Graph Convolution Network for Efficient Mobile Cellular Traffic Prediction
    Zhao, Nan
    Wu, Aonan
    Pei, Yiyang
    Liang, Ying-Chang
    Niyato, Dusit
    IEEE COMMUNICATIONS LETTERS, 2022, 26 (03) : 587 - 591
  • [22] Spatial-Temporal Feature Fusion Network for Network Traffic Prediction
    Hong, Yu
    Zhou, Jianxin
    Zhou, Ning
    2023 3rd International Symposium on Computer Technology and Information Science, ISCTIS 2023, 2023, : 162 - 166
  • [23] Efficient Video Transformers with Spatial-Temporal Token Selection
    Wang, Junke
    Yang, Xitong
    Li, Hengduo
    Liu, Li
    Wu, Zuxuan
    Jiang, Yu-Gang
    COMPUTER VISION - ECCV 2022, PT XXXV, 2022, 13695 : 69 - 86
  • [24] COLLABORATIVE SPATIAL-TEMPORAL DISTILLATION FOR EFFICIENT VIDEO DERAINING
    Hu, Yuzhang
    Liu, Minghao
    Yang, Wenhan
    Liu, Jiaying
    Guo, Zongming
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1937 - 1942
  • [25] Grouped Spatial-Temporal Aggregation for Efficient Action Recognition
    Luo, Chenxu
    Yuille, Alan
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5511 - 5520
  • [26] Deep spatial and discriminative feature enhancement network for stereo matching
    An, Guowei
    Wang, Yaonan
    Zeng, Kai
    Zhu, Qing
    Yuan, Xiaofang
    VISUAL COMPUTER, 2024, : 4097 - 4110
  • [27] Temporal aspects of spatial interactions affecting stereo-matching solutions
    Zhang, ZL
    Cantor, C
    Ghose, T
    Schor, CM
    VISION RESEARCH, 2004, 44 (27) : 3183 - 3192
  • [28] Temporal Pyramid Network With Spatial-Temporal Attention for Pedestrian Trajectory Prediction
    Li, Yuanman
    Liang, Rongqin
    Wei, Wei
    Wang, Wei
    Zhou, Jiantao
    Li, Xia
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2022, 9 (03): : 1006 - 1019
  • [29] Spatial-Temporal Convolutional Attention Network for Action Recognition
    Luo, Huilan
    Chen, Han
    Computer Engineering and Applications, 2023, 59 (09): : 150 - 158
  • [30] A Spatial-Temporal Weighted Method for Asymmetrically Distorted Stereo Video Quality Assessment
    Fang, Yuming
    Sui, Xiangjie
    Wang, Jiheng
    2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,