Mobile Volumetric Video Streaming System through Implicit Neural Representation

被引:0
作者
Liu, Junhua [1 ,2 ]
Wang, Yuanyuan [2 ]
Wang, Yan [4 ,5 ]
Wang, Yufeng [5 ]
Cui, Shuguang [1 ,3 ]
Wang, Fangxin [1 ,3 ]
机构
[1] CUHK Shenzhen, FNii, Shenzhen, Peoples R China
[2] Sensetime Res, Hong Kong, Peoples R China
[3] CUHK Shenzhen, SSE, Shenzhen, Peoples R China
[4] Tsinghua Univ, Inst AI Ind Res AIR, Beijing, Peoples R China
[5] Tsinghua Univ, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 2023 WORKSHOP ON EMERGING MULTIMEDIA SYSTEMS, EMS 2023 | 2023年
关键词
Volumetric Video Streaming; Implicit Neural Representation; Mobile Mixed Reality;
D O I
10.1145/3609395.3610593
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Volumetric video (VV) emerges as a new video paradigm with six degree-of-freedom (DoF) immersive viewing experience. Most existing VV systems focus on the point cloud (PtCl)-based architecture, which is however far from effective due to the huge video size, unrealistic color variations, and specialized player platform requirement. The recent advance of implicit neural representations (INR) such as NeRF brings great opportunities to VV given its potential in creating photorealistic 3D appearances and lighting consistency. However, there still exist arduous challenges in many aspects such as model training, display rendering, streaming optimization, and system implementation. To address the above challenges, we develop NeRVo, an INR-based VV representation for mobile VV. NeRVo improves the training and rendering speed over 300x and 1000x with photorealism, mobile compatibility, and desirable datarates compared to NeRF. We adopt NeRVo as a building block, design and implement a holistic INR-enhanced VV streaming system VoINR.
引用
收藏
页码:1 / 7
页数:7
相关论文
共 33 条
[1]  
Adhikari R, 2013, Arxiv, DOI [arXiv:1302.6613, 10.48550/arXiv.1302.6613, DOI 10.48550/ARXIV.1302.6613]
[2]  
[Anonymous], Stop motion obj addon
[3]  
[Anonymous], Point cloud Library
[4]  
[Anonymous], 2018, Draco 3d
[5]  
[Anonymous], Point cloud visualizer addon
[6]  
[Anonymous], 8i voxelized full bodies-a voxelized point cloud dataset
[7]  
[Anonymous], Jpeg pleno database:microsoft voxelized upper bodies-a voxelized point cloud dataset
[8]   MobileNeRF: Exploiting the Polygon Rasterization Pipeline for Efficient Neural Field Rendering on Mobile Architectures [J].
Chen, Zhiqin ;
Funkhouser, Thomas ;
Hedman, Peter ;
Tagliasacchi, Andrea .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :16569-16578
[9]  
De Luigi L., 2023, arXiv
[10]  
Fidler Sanja, 2022, ACM SIGGRAPH 2022 C, P1