APCAFlow: All-Pairs Cost Volume Aggregation for Optical Flow Estimation

被引:0
作者
Feng, Miaojie [1 ]
Jia, Hao [1 ]
Yan, Zengqiang [1 ]
Yang, Xin [1 ]
机构
[1] Huazhong Univ Sci & Technol, Elect Informat & Commun, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
Costs; Optical flow; Estimation; Three-dimensional displays; Correlation; Computer vision; Optimization; cost aggregation; optical flow;
D O I
10.1109/TMM.2024.3385669
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Optical flow estimation is a fundamental task in computer vision. The all-pairs correlation volume has enabled state-of-the-art performance in many optical flow estimation methods. However, all-pairs correlations provide only local matching clues, and lack global context, which could lead to mismatches in textureless and occluded regions. In this paper, we propose a novel all-pairs correlation volume aggregation (APCA) method which includes two key innovations. The first is a cost volume splitting and reassembling approach which partitions the full cost volume into smaller blocks and re-arranges those blocks to allow the use of 2D and 3D convolutions for cost volume aggregation. The second is hierarchical aggregation which performs 2D convolutions within blocks for local matching aggregation and 3D convolutions across blocks for global matching aggregation. We further design a novel optical flow estimation network APCAFlow based on APCA. APCAFlow achieves comparable performance to the most advanced approach, FlowFormer, but with significantly lower complexity. Specifically, APCAFlow reduces the model parameters, inference time, and memory consumption by 24.1%, 35.5%, and 21.6%, respectively, compared to FlowFormer. Furthermore, APCA can be easily integrated into several existing all-pairs cost volume-based methods for performance improvement.
引用
收藏
页码:9060 / 9069
页数:10
相关论文
共 44 条
  • [1] A Naturalistic Open Source Movie for Optical Flow Evaluation
    Butler, Daniel J.
    Wulff, Jonas
    Stanley, Garrett B.
    Black, Michael J.
    [J]. COMPUTER VISION - ECCV 2012, PT VI, 2012, 7577 : 611 - 625
  • [2] Pyramid Stereo Matching Network
    Chang, Jia-Ren
    Chen, Yong-Sheng
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5410 - 5418
  • [3] Region Separable Stereo Matching
    Cheng, Junda
    Yang, Xin
    Pu, Yuechuan
    Guo, Peng
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4880 - 4893
  • [4] Chu XX, 2021, ADV NEUR IN
  • [5] Explicit Motion Disentangling for Efficient Optical Flow Estimation
    Deng, Changxing
    Luo, Ao
    Huang, Haibin
    Ma, Shaodan
    Liu, Jiangyu
    Liu, Shuaicheng
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 9487 - 9496
  • [6] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [7] Rethinking Optical Flow from Geometric Matching Consistent Perspective
    Dong, Qiaole
    Cao, Chenjie
    Fu, Yanwei
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 1337 - 1347
  • [8] FlowNet: Learning Optical Flow with Convolutional Networks
    Dosovitskiy, Alexey
    Fischer, Philipp
    Ilg, Eddy
    Haeusser, Philip
    Hazirbas, Caner
    Golkov, Vladimir
    van der Smagt, Patrick
    Cremers, Daniel
    Brox, Thomas
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2758 - 2766
  • [9] RealFlow: EM-Based Realistic Optical Flow Dataset Generation from Videos
    Han, Yunhui
    Luo, Kunming
    Luo, Ao
    Liu, Jiangyu
    Fan, Haoqiang
    Luo, Guiming
    Liu, Shuaicheng
    [J]. COMPUTER VISION, ECCV 2022, PT XIX, 2022, 13679 : 288 - 305
  • [10] DETERMINING OPTICAL-FLOW
    HORN, BKP
    SCHUNCK, BG
    [J]. ARTIFICIAL INTELLIGENCE, 1981, 17 (1-3) : 185 - 203