Enriching Telepresence with Semantic-driven Holographic Communication

被引:9
作者
Cheng, Ruizhi [1 ]
Liu, Kaiyan [1 ]
Wu, Nan [1 ]
Han, Bo [1 ]
机构
[1] George Mason Univ, Fairfax, VA 22030 USA
来源
PROCEEDINGS OF THE 22ND ACM WORKSHOP ON HOT TOPICS IN NETWORKS, HOTNETS 2023 | 2023年
关键词
Semantic Communication; Immersive Telepresence; PREDICTION; MODEL;
D O I
10.1145/3626111.3628184
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Achieving the optimal balance of minimizing bandwidth consumption and end-to-end latency while preserving a satisfactory level of visual quality becomes the ultimate goal of live, interactive holographic communication, a fundamental building block of immersive telepresence envisioned for 6G. Nevertheless, achieving this ambitious goal poses significant challenges for mobile devices with limited computing power, considering the substantial amount of 3D data to stream, the demanding latency requirements, and the high computation workload involved. Instead of distributing immersive content bit by bit, in this position paper, we propose to deliver semantic information extracted from telepresence participants to drastically reduce Internet bandwidth usage for task-oriented applications such as remote collaboration. We contribute a taxonomy by categorizing related semantics into three different types (i.e., keypoints, 2D images, and text), pinpoint the open research challenges associated with developing a practical system for each category in our comprehensive research agenda, and delve into the potential solutions for overcoming these challenges. The preliminary results from our proof-of-concept implementation that harnesses keypoint-based semantics (partially) validate the feasibility of our research agenda.
引用
收藏
页码:147 / 156
页数:10
相关论文
共 107 条
[1]  
Rusu AA, 2016, Arxiv, DOI [arXiv:1606.04671, 10.48550/arXiv.1606.04671, DOI 10.48550/ARXIV.1606.04671, DOI 10.43550/ARXIV:1606.04671]
[2]  
Achlioptas P, 2018, PR MACH LEARN RES, V80
[3]   Computer generated holograms from three dimensional meshes using an analytic light transport model [J].
Ahrenberg, Lukas ;
Benzie, Philip ;
Magnor, Marcus ;
Watson, John .
APPLIED OPTICS, 2008, 47 (10) :1567-1574
[4]   Practical Saccade Prediction for Head-Mounted Displays: Towards a Comprehensive Model [J].
Arabadzhiyska, Elena ;
Tursun, Cara ;
Seidel, Hans-Peter ;
Didyk, Piotr .
ACM TRANSACTIONS ON APPLIED PERCEPTION, 2023, 20 (01)
[5]   Saccade Landing Position Prediction for Gaze-Contingent Rendering [J].
Arabadzhiyska, Elena ;
Tursun, Okan Tarhan ;
Myszkowski, Karol ;
Seidel, Hans-Peter ;
Didyk, Piotr .
ACM TRANSACTIONS ON GRAPHICS, 2017, 36 (04)
[6]  
Attal B., 2023, P IEEECVF CVPR
[7]   End-to-end delay analysis of videoconferencing over packet-switched networks [J].
Baldi, M ;
Ofek, Y .
IEEE-ACM TRANSACTIONS ON NETWORKING, 2000, 8 (04) :479-492
[8]   Real-time RGBD-based Extended Body Pose Estimation [J].
Bashirov, Renat ;
Ianina, Anastasia ;
Iskakov, Karim ;
Kononenko, Yevgeniy ;
Strizhkova, Valeriya ;
Lempitsky, Victor ;
Vakhitov, Alexander .
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, :2806-2815
[9]  
Bergmann Rachel, 2022, M THE PANDEMIC VIDEO, P1
[10]  
Burov A., 2021, P IEEECVF ICCV