APCAFlow: All-Pairs Cost Volume Aggregation for Optical Flow Estimation

被引：0

作者：

Feng, Miaojie ^{[1
]}

Jia, Hao ^{[1
]}

Yan, Zengqiang ^{[1
]}

Yang, Xin ^{[1
]}

机构：

[1] Huazhong Univ Sci & Technol, Elect Informat & Commun, Wuhan 430074, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2024年 / 26卷

基金：

中国国家自然科学基金;

关键词：

Costs; Optical flow; Estimation; Three-dimensional displays; Correlation; Computer vision; Optimization; cost aggregation; optical flow;

D O I：

10.1109/TMM.2024.3385669

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Optical flow estimation is a fundamental task in computer vision. The all-pairs correlation volume has enabled state-of-the-art performance in many optical flow estimation methods. However, all-pairs correlations provide only local matching clues, and lack global context, which could lead to mismatches in textureless and occluded regions. In this paper, we propose a novel all-pairs correlation volume aggregation (APCA) method which includes two key innovations. The first is a cost volume splitting and reassembling approach which partitions the full cost volume into smaller blocks and re-arranges those blocks to allow the use of 2D and 3D convolutions for cost volume aggregation. The second is hierarchical aggregation which performs 2D convolutions within blocks for local matching aggregation and 3D convolutions across blocks for global matching aggregation. We further design a novel optical flow estimation network APCAFlow based on APCA. APCAFlow achieves comparable performance to the most advanced approach, FlowFormer, but with significantly lower complexity. Specifically, APCAFlow reduces the model parameters, inference time, and memory consumption by 24.1%, 35.5%, and 21.6%, respectively, compared to FlowFormer. Furthermore, APCA can be easily integrated into several existing all-pairs cost volume-based methods for performance improvement.

引用

页码：9060 / 9069

页数：10

共 44 条

[1] A Naturalistic Open Source Movie for Optical Flow Evaluation
Butler, Daniel J.
Wulff, Jonas
Stanley, Garrett B.
Black, Michael J.
[J]. COMPUTER VISION - ECCV 2012, PT VI, 2012, 7577 : 611 - 625
[2] Pyramid Stereo Matching Network
Chang, Jia-Ren
Chen, Yong-Sheng
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5410 - 5418
[3] Region Separable Stereo Matching
Cheng, Junda
Yang, Xin
Pu, Yuechuan
Guo, Peng
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4880 - 4893
[4] Chu XX, 2021, ADV NEUR IN
[5] Explicit Motion Disentangling for Efficient Optical Flow Estimation
Deng, Changxing
Luo, Ao
Huang, Haibin
Ma, Shaodan
Liu, Jiangyu
Liu, Shuaicheng
[J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 9487 - 9496
[6] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[7] Rethinking Optical Flow from Geometric Matching Consistent Perspective
Dong, Qiaole
Cao, Chenjie
Fu, Yanwei
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 1337 - 1347
[8] FlowNet: Learning Optical Flow with Convolutional Networks
Dosovitskiy, Alexey
Fischer, Philipp
Ilg, Eddy
Haeusser, Philip
Hazirbas, Caner
Golkov, Vladimir
van der Smagt, Patrick
Cremers, Daniel
Brox, Thomas
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2758 - 2766
[9] RealFlow: EM-Based Realistic Optical Flow Dataset Generation from Videos
Han, Yunhui
Luo, Kunming
Luo, Ao
Liu, Jiangyu
Fan, Haoqiang
Luo, Guiming
Liu, Shuaicheng
[J]. COMPUTER VISION, ECCV 2022, PT XIX, 2022, 13679 : 288 - 305
[10] DETERMINING OPTICAL-FLOW
HORN, BKP
SCHUNCK, BG
[J]. ARTIFICIAL INTELLIGENCE, 1981, 17 (1-3) : 185 - 203

← 1 2 3 4 5 →