MPEG-I Immersive Audio - Reference Model For The Virtual/Augmented Reality Audio Standard

被引:7
作者
Herre, Juergen [1 ,2 ,3 ]
Disch, Sascha [3 ]
机构
[1] Int Audio Labs Erlangen, Erlangen, Germany
[2] Joint Inst Friedrich Alexander Univ Erlangen Nurn, Erlangen, Germany
[3] Fraunhofer IIS, Erlangen, Germany
来源
JOURNAL OF THE AUDIO ENGINEERING SOCIETY | 2023年 / 71卷 / 05期
关键词
Compendex;
D O I
10.17743/jaes.2022.0074
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
MPEG-I Immersive Audio is a forthcoming standard that is under development within the MPEG Audio group (ISO/IEC JTC1/SC29/WG6) to provide a compressed representation and rendering of audio for Virtual and Augmented Reality (VR/AR) applications with six degrees of freedom (6DoF). MPEG-I Immersive Audio supports bitrate-efficient and high-quality storage/transmission of complex virtual scenes including sources with spatial extent and distinct radiation characteristics (like musical instruments) as well as geometry description of acoustically relevant elements (e.g., walls, doors, occluders). The rendering process includes detailed modeling of room acoustics and complex acoustic phenomena such as occlusion and diffraction due to acoustic obstacles and Doppler effects as well as interactivity with the user. Based on many contributions, this paper reports on the state of the MPEG-I Immersive Audio standardization process and its first technical Reference Model architecture. MPEG-I Immersive Audio establishes the first long-term stable audio format specification in the field of VR/AR and can be used for many consumer applications such as broadcasting, streaming, social VR/AR, or Metaverse technology.
引用
收藏
页码:229 / 240
页数:12
相关论文
共 34 条
[21]  
ISO/IEC, 2012, International Standard 23003-3:2012
[22]  
ISO/IEC, 1997, Standard 13818-7:1997
[23]  
ISO/IEC, 2007, Standard 23003-1:2007
[24]  
ITU-R, 2019, RECOMMENDATION ITU R
[25]  
Magic Leap Inc., Home
[26]  
microsoft, Microsoft HoloLens 2
[27]  
Neuendorf M, 2013, J AUDIO ENG SOC, V61, P956
[28]  
Perez-Ortiz M, 2017, Arxiv, DOI arXiv:1712.03686
[29]  
Pulkki V, 1997, J AUDIO ENG SOC, V45, P456
[30]  
Robotham T., 2018, paper 10131