Real-Time Free Viewpoint Video Synthesis System Based on DIBR and a Depth Estimation Network

被引:8
作者
Guo, Shuai [1 ]
Hu, Jingchuan [1 ]
Zhou, Kai [1 ]
Wang, Jionghao [1 ]
Song, Li [2 ]
Xie, Rong [1 ]
Zhang, Wenjun [1 ]
机构
[1] Shanghai Jiao Tong Univ, Inst Image Commun & Network Engn, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Inst Image Commun & Network Engn, Cooperat Medianet Innovat Ctr, Shanghai 200240, Peoples R China
关键词
Cameras; Real-time systems; Estimation; Three-dimensional displays; Rendering (computer graphics); Costs; Streaming media; Free viewpoint video (FVV); depth image-based rendering (DIBR); depth estimation; dataset; CONTENT CREATION; GENERATION; IMAGES;
D O I
10.1109/TMM.2024.3355639
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Depth image-based rendering (DIBR) view synthesis is the most widely employed method in real-time FVV research. Despite recent progress, most DIBR-based FVV synthesis approaches are not sufficiently simple and effective in filling holes and artifacts. Additionally, they use RGB-D cameras, which are difficult to widely adopt or take considerable time to estimate high-quality depth images. This article introduces a real-time FVV synthesis system based on DIBR and a depth estimation network. This system includes a 12-view synchronous camera system, a new multistage depth estimation network, a new GPU-accelerated DIBR algorithm, and a virtual view parameter generation method. This system provides the first real-time FVV solution for background-fixed fields based on DIBR and a depth estimation network. It can infer depth images for all camera views and synthesize any virtual view along the horizontal circular arc of the camera rig in real time. To our knowledge, we are the first to introduce background models and foreground masks and a refined multistage structure to address real-time high-quality depth estimation and DIBR FVV synthesis. We also build a high-quality multiview RGB-D synchronous dataset that has promising DIBR FVV synthesis performance to train and evaluate our system. The experimental results demonstrate the real-time and better performance of the proposed system.
引用
收藏
页码:6701 / 6716
页数:16
相关论文
共 91 条
[1]  
4DReplay, 2022, About us
[2]   Large-Scale Data for Multiple-View Stereopsis [J].
Aanaes, Henrik ;
Jensen, Rasmus Ramsbol ;
Vogiatzis, George ;
Tola, Engin ;
Dahl, Anders Bjorholm .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 120 (02) :153-168
[3]   Bidirectional Attention Network for Monocular Depth Estimation [J].
Aich, Shubhra ;
Vianney, Jean Marie Uwabeza ;
Islam, Md Amirul ;
Kaur, Mannat ;
Liu, Bingbing .
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, :11746-11752
[4]   AdaBins: Depth Estimation Using Adaptive Bins [J].
Bhat, Shariq Farooq ;
Alhashim, Ibraheem ;
Wonka, Peter .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :4008-4017
[5]  
Boski M, 2017, 2017 10TH INTERNATIONAL WORKSHOP ON MULTIDIMENSIONAL (ND) SYSTEMS (NDS)
[6]   FVV Live: A Real-Time Free-Viewpoint Video System With Consumer Electronics Hardware [J].
Carballeira, Pablo ;
Carmona, Carlos ;
Diaz, Cesar ;
Berjon, Daniel ;
Corregidor, Daniel ;
Cabrera, Julian ;
Moran, Francisco ;
Doblado, Carmen ;
Arnaldo, Sergio ;
Martin, Maria del Mar ;
Garcia, Narciso .
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 :2378-2391
[7]   A robust blind watermarking algorithm for depth-image-based rendering 3D images [J].
Chen, Lei ;
Zhao, Jiying .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 87 (87)
[8]  
Chen WY, 2005, 2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, P1315
[9]   Hole Filling Method for Depth Image Based Rendering Based on Boundary Decision [J].
Cho, Jea-Hyung ;
Song, Wonseok ;
Choi, Hyuk ;
Kim, Taejeong .
IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (03) :329-333
[10]   GPU-accelerated Real-time Free-viewpoint DIBR for 3DTV [J].
Do, Luat ;
Bravo, German ;
Zinger, Svitlana ;
de With, Peter H. N. .
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2012, 58 (02) :633-640