iButter: Neural Interactive Bullet Time Generator for Human Free-viewpoint Rendering

被引:13
作者
Wang, Liao [1 ]
Wang, Ziyu [1 ]
Lin, Pei [1 ]
Jiang, Yuheng [1 ]
Suo, Xin [1 ]
Wu, Minye [1 ]
Xu, Lan [1 ]
Yu, Jingyi [2 ]
机构
[1] Shanghaitech Univ, Shanghai, Peoples R China
[2] Shanghaitech Univ, Sch Informat Sci & Technol, Shanghai Engn Res Ctr Intelligent Vis & Imaging, Shanghai, Peoples R China
来源
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021 | 2021年
关键词
free-viewpoint video; bullet-time; novel view synthesis; neural rendering; neural representation; VIDEO;
D O I
10.1145/3474085.3475412
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generating "bullet-time" effects of human free-viewpoint videos is critical for immersive visual effects and VR/AR experience. Recent neural advances still lack the controllable and interactive bullet time design ability for human free-viewpoint rendering, especially under the real-time, dynamic and general setting for our trajectory aware task. To fill this gap, in this paper we propose a neural interactive bullet-time generator (iButter) for photo-realistic human free-viewpoint rendering from dense RGB streams, which enables flexible and interactive design for human bullet-time visual effects. Our iButter approach consists of a real-time preview and design stage as well as a trajectory-aware refinement stage. During preview, we propose an interactive bullet-time design approach by extending the NeRF rendering to a real-time and dynamic setting and getting rid of the tedious per-scene training. To this end, our bullet-time design stage utilizes a hybrid training set, light-weight network design and an efficient silhouette-based sampling strategy. During refinement, we introduce an efficient trajectory-aware scheme within 20 minutes, which jointly encodes the spatial, temporal consistency and semantic cues along the designed trajectory, achieving photo-realistic bullet-time viewing experience of human activities. Extensive experiments demonstrate the effectiveness of our approach for convenient interactive bullet-time design and photo-realistic human free-viewpoint video generation.
引用
收藏
页码:4641 / 4650
页数:10
相关论文
共 54 条
[1]  
[Anonymous], 2019, AGISOFT PHOTOSCAN PR
[2]   Immersive Light Field Video with a Layered Mesh Representation [J].
Broxton, Michael ;
Flynn, John ;
Overbeck, Ryan ;
Erickson, Daniel ;
Hedman, Peter ;
Duvall, Matthew ;
Dourgarian, Jason ;
Busch, Jay ;
Whalen, Matt ;
Debevec, Paul .
ACM TRANSACTIONS ON GRAPHICS, 2020, 39 (04)
[3]  
Buehler C, 2001, COMP GRAPH, P425, DOI 10.1145/383259.383309
[4]   Free-viewpoint video of human actors [J].
Carranza, J ;
Theobalt, C ;
Magnor, MA ;
Seidel, HP .
ACM TRANSACTIONS ON GRAPHICS, 2003, 22 (03) :569-577
[5]  
Chen Anpei, 2021, ARXIV210315595 CS CV
[6]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[7]  
Chen S. E., 1993, Computer Graphics Proceedings, P279, DOI 10.1145/166117.166153
[8]  
Chibane Julian, 2021, IEEE C COMP VIS PATT
[9]   Extreme View Synthesis [J].
Choi, Inchang ;
Gallo, Orazio ;
Troccoli, Alejandro ;
Kim, Min H. ;
Kautz, Jan .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :7780-7789
[10]   High-Quality Streamable Free-Viewpoint Video [J].
Collet, Alvaro ;
Chuang, Ming ;
Sweeney, Pat ;
Gillett, Don ;
Evseev, Dennis ;
Calabrese, David ;
Hoppe, Hugues ;
Kirk, Adam ;
Sullivan, Steve .
ACM TRANSACTIONS ON GRAPHICS, 2015, 34 (04)