Automatic Exploration of Optimal Data Processing Operations for Sound Data Augmentation Using Improved Differentiable Automatic Data Augmentation

被引:0
|
作者
Sugiura, Toki [1 ]
Nishizaki, Hiromitsu [1 ]
机构
[1] Univ Yamanashi, Grad Sch Med Engn & Agr Sci, Kofu, Japan
来源
关键词
acoustic scene classification; data augmentation; differentiable automatic data augmentation;
D O I
10.21437/Interspeech.2023-202
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Data augmentation is one of the methods used to robustly train machine learning models with a small dataset. This method randomly applies pre-defined data processing operations to input data, regardless of the characteristics of the input data. However, some data processing operations may be inappropriate for certain data. In this study, we propose a new method to automatically search for the best data processing operations for each sound file to be input into a sound classification neural network. The proposed method is an improvement on the previously proposed differentiable automatic data augmentation (DADA), which uses a differentiable neural network to select the optimal data processing operations. We evaluated our proposed method on an acoustic scene classification task on the ESC-50 dataset and demonstrated that the proposed method can train a more robust model compared to the original DADA-based data augmentation.
引用
收藏
页码:5411 / 5415
页数:5
相关论文
共 50 条
  • [31] A data augmentation method for fully automatic brain tumor segmentation
    Wang, Yu
    Ji, Yarong
    Xiao, Hongbing
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 149
  • [32] Data Augmentation using Evolutionary Image Processing
    Fujita, Kosaku
    Kobayashi, Masayuki
    Nagao, Tomoharu
    2018 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2018, : 275 - 280
  • [33] A Survey of the Effects of Data Augmentation for Automatic Speech Recognition Systems
    Manuel Ramirez, Jose
    Montalvo, Ana
    Ramon Calvo, Jose
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS (CIARP 2019), 2019, 11896 : 669 - 678
  • [34] INTERMIX: AN INTERFERENCE-BASED DATA AUGMENTATION AND REGULARIZATION TECHNIQUE FOR AUTOMATIC DEEP SOUND CLASSIFICATION
    Sawhney, Ramit
    Neerkaje, Atula Tejaswi
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3443 - 3447
  • [35] A CNN Sound Classification Mechanism Using Data Augmentation
    Chu, Hung-Chi
    Zhang, Young-Lin
    Chiang, Hao-Chu
    SENSORS, 2023, 23 (15)
  • [36] Smart Augmentation Learning an Optimal Data Augmentation Strategy
    Lemley, Joseph
    Bazrafkan, Shabab
    Corcoran, Peter
    IEEE ACCESS, 2017, 5 : 5858 - 5869
  • [37] AUTOMATIC DATA PROCESSING OF PERSONNEL DATA
    MORGAN, PL
    PERSONNEL JOURNAL, 1966, 45 (09) : 553 - 557
  • [38] Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation
    Bartelds, Martijn
    San, Nay
    McDonnell, Bradley
    Jurafsky, Dan
    Wieling, Martijn
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 715 - 729
  • [39] AUTOMATIC DATA PROCESSING
    RABINOWI.P
    MATHEMATICS OF COMPUTATION, 1966, 20 (94) : 341 - &
  • [40] AUTOMATIC DATA PROCESSING
    不详
    NATURE, 1961, 192 (480) : 1024 - +