MPEG-I Immersive Audio - Reference Model For The Virtual/Augmented Reality Audio Standard

被引:7
作者
Herre, Juergen [1 ,2 ,3 ]
Disch, Sascha [3 ]
机构
[1] Int Audio Labs Erlangen, Erlangen, Germany
[2] Joint Inst Friedrich Alexander Univ Erlangen Nurn, Erlangen, Germany
[3] Fraunhofer IIS, Erlangen, Germany
来源
JOURNAL OF THE AUDIO ENGINEERING SOCIETY | 2023年 / 71卷 / 05期
关键词
Compendex;
D O I
10.17743/jaes.2022.0074
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
MPEG-I Immersive Audio is a forthcoming standard that is under development within the MPEG Audio group (ISO/IEC JTC1/SC29/WG6) to provide a compressed representation and rendering of audio for Virtual and Augmented Reality (VR/AR) applications with six degrees of freedom (6DoF). MPEG-I Immersive Audio supports bitrate-efficient and high-quality storage/transmission of complex virtual scenes including sources with spatial extent and distinct radiation characteristics (like musical instruments) as well as geometry description of acoustically relevant elements (e.g., walls, doors, occluders). The rendering process includes detailed modeling of room acoustics and complex acoustic phenomena such as occlusion and diffraction due to acoustic obstacles and Doppler effects as well as interactivity with the user. Based on many contributions, this paper reports on the state of the MPEG-I Immersive Audio standardization process and its first technical Reference Model architecture. MPEG-I Immersive Audio establishes the first long-term stable audio format specification in the field of VR/AR and can be used for many consumer applications such as broadcasting, streaming, social VR/AR, or Metaverse technology.
引用
收藏
页码:229 / 240
页数:12
相关论文
共 34 条
  • [1] Anemuller C., 2023, J. Audio Eng. Soc., V71, pxx
  • [2] Apple, Listen with Personalized Spatial Audio for AirPods and Beats
  • [3] Blauert J., 2013, The Technology of Binaural Listening, DOI [DOI 10.1007/978-3-642-37762-4, 10.1007/978-3-642-37762-4]
  • [4] Bosi M, 1997, J AUDIO ENG SOC, V45, P789
  • [5] Brinkmann F., 2017, The FABIAN Head-Related Transfer FunctionDataBase, DOI [10.14279/depositonce-5718.5, DOI 10.14279/DEPOSITONCE-5718.5]
  • [6] A High Resolution and Full-Spherical Head-Related Transfer Function Database for Different Head-Above-Torso Orientations
    Brinkmann, Fabian
    Lindau, Alexander
    Weinzierl, Stefan
    Van De Par, Steven
    Mueller-Trapet, Markus
    Opdam, Rob
    Vorlaender, Michael
    [J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2017, 65 (10): : 841 - 848
  • [7] EBU, 2007, Tech. Rep. 3324
  • [8] Epic Games Inc., The Most Powerful Real-Time 3D Creation Tool-Unreal Engine
  • [9] Genelec, Aural ID
  • [10] Herre J., 2008, IEEE Signal Process. Mag., V25, P137, DOI [DOI 10.1109/MSP.2008.918684, 10.1109/MSP.2008.918684]