Spatial Audio Object Coding With Two-Step Coding Structure for Interactive Audio Service

被引:14
|
作者
Kim, Kwangki [1 ]
Seo, Jeongil [2 ]
Beack, Seungkwon [2 ]
Kang, Kyeongok [2 ]
Hahn, Minsoo [3 ]
机构
[1] Korea Adv Inst Sci & Technol, Dept Informat & Commun Engn, Taejon 305701, South Korea
[2] Elect & Telecommun Res Inst, Taejon 305606, South Korea
[3] Korea Adv Inst Sci & Technol, Dept Elect Engn, Taejon 305701, South Korea
关键词
Audio object; interactive audio service; residual coding; spatial audio object coding;
D O I
10.1109/TMM.2011.2168197
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An interactive audio service is a new conceptual audio service that provides the users with opportunities for a variety of experiences on the alternative and advanced audio services. In the interactive audio service, users can freely control various audio objects to make their own audio sounds. A spatial audio object coding (SAOC) is a useful technology that can support most parts of the interactive audio service with a relatively low bit-rate, but is very poor to perfect gain control of a certain audio object, i.e., the target audio object. In this paper, the SAOC with a two-step coding structure is proposed to efficiently handle the target audio object as well as the normal audio objects. A transform coded excitation (TCX) based residual coding scheme is presented in the context of the sound quality enhancement. From experimental results, it can be noted that the various audio objects can be successfully handled with respect to the bit-rate and the sound quality by using the proposed two-step coding structure SAOC.
引用
收藏
页码:1208 / 1216
页数:9
相关论文
共 50 条
  • [1] Efficient Residual Coding Method of Spatial Audio Object Coding with Two-Step Coding Structure for Interactive Audio Services
    Lee, Byonghwa
    Kim, Kwangki
    Hahn, Minsoo
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (07): : 1949 - 1952
  • [2] Multi-step Coding Structure of Spatial Audio Object Coding
    Hu, Chenhao
    Hu, Ruimin
    Wang, Xiaochen
    Wu, Tingzhao
    Li, Dengshi
    MULTIMEDIA MODELING (MMM 2020), PT I, 2020, 11961 : 666 - 678
  • [3] Modified Spatial Audio Object Coding Scheme with Harmonic Extraction and Elimination Structure for Interactive Audio Service
    Park, Jihoon
    Kim, Kwangki
    Seo, Jeongil
    Hahn, Minsoo
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2906 - +
  • [4] MPEG Spatial Audio Object Coding-The ISO/MPEG Standard for Efficient Coding of Interactive Audio Scenes
    Herre, Juergen
    Purnhagen, Heiko
    Koppens, Jeroen
    Hellmuth, Oliver
    Engdegard, Jonas
    Hilpert, Johannes
    Villemoes, Lars
    Terentiv, Leon
    Falch, Cornelia
    Hoelzer, Andreas
    Valero, Maria Luis
    Resch, Barbara
    Mundt, Harald
    Oh, Hyen-O
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2012, 60 (09): : 655 - 673
  • [5] MPEG spatial audio object coding-The ISO/MPEG standard for efficient coding of interactive audio scenes
    Herre, J. (juergen.herre@audiolabs-erlangen.de), 1600, Audio Engineering Society (60):
  • [6] Interactive teleconferencing combining Spatial Audio Object Coding and DirAC technology
    Herre, Jürgen
    Falch, Cornelia
    Mahne, Dirk
    Del Galdo, Giovanni
    Kallinger, Markus
    Thiergart, Oliver
    AES: Journal of the Audio Engineering Society, 2011, 59 (12): : 924 - 935
  • [7] Interactive Teleconferencing Combining Spatial Audio Object Coding and DirAC Technology
    Herre, Juergen
    Falch, Cornelia
    Mahne, Dirk
    Del Galdo, Giovanni
    Kallinger, Markus
    Thiergart, Oliver
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2011, 59 (12): : 924 - 935
  • [8] DECORRELATION FOR AUDIO OBJECT CODING
    Villemoes, Lars
    Hirvonen, Toni
    Purnhagen, Heiko
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 706 - 710
  • [9] Mastering Signal Processing with Residual Coding Scheme in Spatial Audio Object Coding
    Kim, Kwangki
    Jong, Byeong-ok
    Park, Sanghyun
    Won, Yonggwan
    Kim, Jinsul
    2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND APPLICATIONS (ICISA 2013), 2013,
  • [10] Audio object coding for distributed audio data management applications
    Melih, K
    Gonzalez, R
    ICCS 2002: 8TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS SYSTEMS, VOLS 1 AND 2, PROCEEDINGS, 2002, : 727 - 731