Study on data augmentation methods for deep neural network-based audio tagging

被引:1
|
作者
Kim, Bum-Jun
Moon, Hyeongi
Park, Sung-Wook
Park, Young Cheol
机构
来源
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA | 2018年 / 37卷 / 06期
关键词
Audio tagging; DNN (Deep Neural Network); Data augmentation; Parameter tuning;
D O I
10.7776/ASK.2018.37.6.475
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we present a study on data augmentation methods for DNN (Deep Neural Network)-based audio tagging. In this system, an audio signal is converted into a mel-spectrogram and used as an input to the DNN for audio tagging. To cope with the problem associated with a small number of training data, we augment the training samples using time stretching, pitch shifting, dynamic range compression, and block mixing. In this paper, we derive optimal parameters and combinations for the augmentation methods through audio tagging simulations.
引用
收藏
页码:475 / 482
页数:8
相关论文
共 50 条
  • [1] Audio Tagging with Compact Feedforward Sequential Memory Network and Audio-to-Audio Ratio Based Data Augmentation
    Huang, Zhiying
    Zhang, Shiliang
    Lei, Ming
    INTERSPEECH 2019, 2019, : 3377 - 3381
  • [2] Analyzing Data Augmentation Methods for Convolutional Neural Network-based Brain-Computer Interfaces
    Faria, Gabriel
    de Souza, Gabriel Henrique
    Bernardino, Heder
    Motta, Luciana
    Vieira, Alex
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [3] Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian
    Stankovic, Ranka
    Sandrih, Branislava
    Krstev, Cvetana
    Utvic, Milos
    Skoric, Mihailo
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 3954 - 3962
  • [4] Deep neural network-based approach for processing sequential data
    Lavanya Devi Golagani
    Naresh Nelaturi
    Srinivasa Rao Kurapati
    CSI Transactions on ICT, 2020, 8 (2) : 263 - 270
  • [5] Deep neural network-based bandwidth enhancement of photoacoustic data
    Gutta, Sreedevi
    Kadimesetty, Venkata Suryanarayana
    Kalva, Sandeep Kumar
    Pramanik, Manojit
    Ganapathy, Sriram
    Yalavarthy, Phaneendra K.
    JOURNAL OF BIOMEDICAL OPTICS, 2017, 22 (11)
  • [6] DEEP NEURAL NETWORK-BASED DATA RECONSTRUCTION FOR LANDSLIDE DETECTION
    Utomo, Darmawan
    Hu, Liang-Cheng
    Hsiung, Pao-Ann
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 3119 - 3122
  • [7] Deep Neural Network-Based Filtering Techniques for Data Assimilation
    Hoang, Truong-Vinh
    Matthies, Hermann G.
    ERCIM NEWS, 2020, (122): : 23 - 23
  • [8] Deep learning hotspots detection with generative adversarial network-based data augmentation
    Cheng, Zeyuan
    Behdinan, Kamran
    JOURNAL OF MICRO-NANOPATTERNING MATERIALS AND METROLOGY-JM3, 2022, 21 (02):
  • [9] A Convolutional Neural Network-based Ancient Sundanese Character Classifier with Data Augmentation
    Hidayat, Alam Ahmad
    Purwandari, Kartika
    Cenggoro, Tjeng Wawan
    Pardamean, Bens
    5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE 2020, 2021, 179 : 195 - 201
  • [10] A COMPARISON OF AUDIO SIGNAL PREPROCESSING METHODS FOR DEEP NEURAL NETWORKS ON MUSIC TAGGING
    Choi, Keunwoo
    Fazekas, Gyorgy
    Sandler, Mark
    Cho, Kyunghyun
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 1870 - 1874