ENHANCED METHOD OF AUDIO CODING USING CNN-BASED SPECTRAL RECOVERY WITH ADAPTIVE STRUCTURE

被引:0
|
作者
Shin, Seong-Hyeon [1 ]
Beack, Seung Kwon [2 ]
Lim, Wootaek [2 ]
Park, Hochong [1 ]
机构
[1] Kwangwoon Univ, Seoul, South Korea
[2] Elect & Telecommun Res Inst, Daejeon, South Korea
来源
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2020年
关键词
adaptive structure; audio coding; autoencoder; convolutional neural network; spectral recovery; SPEECH;
D O I
10.1109/icassp40776.2020.9054409
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A process of spectral recovery can enhance the performance of transform-based audio coding by transmitting only a portion of spectral data and recovering the missing spectral data in the decoder. This study proposes an enhanced method of audio coding based on spectral recovery with an adaptive structure that yields improved sound quality compared with the previous method. The spectral data to be recovered are arranged in an adaptive pattern depending on the difficulty of recovery. In addition, according to the spectral characteristics, prior information associated with these spectral data is selectively transmitted that helps a neural network improve the performance of magnitude recovery. Prior information also provides the signs of recovered magnitudes. A subjective performance evaluation shows that, for mono coding without window switching at 40 kbps, the proposed coding method provides better sound quality than the conventional method on average.
引用
收藏
页码:351 / 355
页数:5
相关论文
共 50 条
  • [1] AUDIO CODING BASED ON SPECTRAL RECOVERY BY CONVOLUTIONAL NEURAL NETWORK
    Shin, Seong-Hyeon
    Beack, Seung Kwon
    Lee, Taejin
    Park, Hochong
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 725 - 729
  • [2] A QP-adaptive Mechanism for CNN-based Filter in Video Coding
    Liu, Chao
    Sunyz, Heming
    Kattoz, Jiro
    Zeng, Xiaoyang
    Fan, Yibo
    2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 3195 - 3199
  • [3] An Effective CNN-Based Super Resolution Method for Video Coding
    Yin, Jun
    Peng, Shuang
    Lin, Jucai
    Jiang, Dong
    Fang, Cheng
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2023, PT I, 2023, 14233 : 124 - 134
  • [4] Probability Model Adjustment for the CNN-based Lossless Image Coding Method
    Kojima, Hiroki
    Kameda, Yusuke
    Kita, Yasuyo
    Matsuda, Ichiro
    Itoh, Susumu
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY (IWAIT) 2021, 2021, 11766
  • [5] Highly Efficient Audio Coding With Blind Spectral Recovery Based on Machine Learning
    Kim, Jae-Won
    Beack, Seung Kwon
    Lim, Wootaek
    Park, Hochong
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1212 - 1216
  • [6] CNN-Based Joint SNR and Doppler Shift Classification Using Spectrogram Images for Adaptive Modulation and Coding
    Kojima, Shun
    Maruta, Kazuki
    Feng, Yi
    Ahn, Chang-Jun
    Tarokh, Vahid
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2021, 69 (08) : 5152 - 5167
  • [7] ADAPTIVE LOOP FILTER WITH A CNN-BASED CLASSIFICATION
    Lim, Wang-Q
    Pfaff, Jonathan
    Stallenberger, Bjoern
    Erfurt, Johannes
    Schwarz, Heiko
    Marpe, Detlev
    Wiegand, Thomas
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1836 - 1840
  • [8] CNN-Based Audio Front End Processing on Speech Recognition
    Fan, Ruchao
    Liu, Gang
    2018 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2018, : 349 - 354
  • [9] Image Classification with CNN-based Fisher Vector Coding
    Song, Yan
    Hong, Xinhai
    McLoughlin, Ian
    Dai, Lirong
    2016 30TH ANNIVERSARY OF VISUAL COMMUNICATION AND IMAGE PROCESSING (VCIP), 2016,
  • [10] CNN-based damage identification method of tied-arch bridge using spatial-spectral information
    Duan, Yuanfeng
    Chen, Qianyi
    Zhang, Hongmei
    Yun, Chung Bang
    Wu, Sikai
    Zhu, Qi
    SMART STRUCTURES AND SYSTEMS, 2019, 23 (05) : 507 - 520