iButter: Neural Interactive Bullet Time Generator for Human Free-viewpoint Rendering

被引：13

作者：

Wang, Liao ^{[1
]}

Wang, Ziyu ^{[1
]}

Lin, Pei ^{[1
]}

Jiang, Yuheng ^{[1
]}

Suo, Xin ^{[1
]}

Wu, Minye ^{[1
]}

Xu, Lan ^{[1
]}

Yu, Jingyi ^{[2
]}

机构：

[1] Shanghaitech Univ, Shanghai, Peoples R China

[2] Shanghaitech Univ, Sch Informat Sci & Technol, Shanghai Engn Res Ctr Intelligent Vis & Imaging, Shanghai, Peoples R China

来源：

PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021 | 2021年

关键词：

free-viewpoint video; bullet-time; novel view synthesis; neural rendering; neural representation; VIDEO;

D O I：

10.1145/3474085.3475412

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Generating "bullet-time" effects of human free-viewpoint videos is critical for immersive visual effects and VR/AR experience. Recent neural advances still lack the controllable and interactive bullet time design ability for human free-viewpoint rendering, especially under the real-time, dynamic and general setting for our trajectory aware task. To fill this gap, in this paper we propose a neural interactive bullet-time generator (iButter) for photo-realistic human free-viewpoint rendering from dense RGB streams, which enables flexible and interactive design for human bullet-time visual effects. Our iButter approach consists of a real-time preview and design stage as well as a trajectory-aware refinement stage. During preview, we propose an interactive bullet-time design approach by extending the NeRF rendering to a real-time and dynamic setting and getting rid of the tedious per-scene training. To this end, our bullet-time design stage utilizes a hybrid training set, light-weight network design and an efficient silhouette-based sampling strategy. During refinement, we introduce an efficient trajectory-aware scheme within 20 minutes, which jointly encodes the spatial, temporal consistency and semantic cues along the designed trajectory, achieving photo-realistic bullet-time viewing experience of human activities. Extensive experiments demonstrate the effectiveness of our approach for convenient interactive bullet-time design and photo-realistic human free-viewpoint video generation.

引用

页码：4641 / 4650

页数：10

共 54 条

[1]

[Anonymous], 2019, AGISOFT PHOTOSCAN PR

[2] Immersive Light Field Video with a Layered Mesh Representation [J].

Broxton, Michael ;

Flynn, John ;

Overbeck, Ryan ;

Erickson, Daniel ;

Hedman, Peter ;

Duvall, Matthew ;

Dourgarian, Jason ;

Busch, Jay ;

Whalen, Matt ;

Debevec, Paul .

ACM TRANSACTIONS ON GRAPHICS, 2020, 39 (04)

[3]

Buehler C, 2001, COMP GRAPH, P425, DOI 10.1145/383259.383309

[4] Free-viewpoint video of human actors [J].

Carranza, J ;

Theobalt, C ;

Magnor, MA ;

Seidel, HP .

ACM TRANSACTIONS ON GRAPHICS, 2003, 22 (03) :569-577

[5]

Chen Anpei, 2021, ARXIV210315595 CS CV

[6] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].

Chen, Liang-Chieh ;

Papandreou, George ;

Kokkinos, Iasonas ;

Murphy, Kevin ;

Yuille, Alan L. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848

[7]

Chen S. E., 1993, Computer Graphics Proceedings, P279, DOI 10.1145/166117.166153

[8]

Chibane Julian, 2021, IEEE C COMP VIS PATT

[9] Extreme View Synthesis [J].

Choi, Inchang ;

Gallo, Orazio ;

Troccoli, Alejandro ;

Kim, Min H. ;

Kautz, Jan .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :7780-7789

[10] High-Quality Streamable Free-Viewpoint Video [J].

Collet, Alvaro ;

Chuang, Ming ;

Sweeney, Pat ;

Gillett, Don ;

Evseev, Dennis ;

Calabrese, David ;

Hoppe, Hugues ;

Kirk, Adam ;

Sullivan, Steve .

ACM TRANSACTIONS ON GRAPHICS, 2015, 34 (04)

← 1 2 3 4 5 6 →