UNSUPERVISED CONTRASTIVE LEARNING OF SOUND EVENT REPRESENTATIONS

被引:28
作者
Fonseca, Eduardo [1 ]
Ortego, Diego [2 ]
McGuinness, Kevin [2 ]
O'Connor, Noel E. [2 ]
Serra, Xavier [1 ]
机构
[1] Univ Pompeu Fabra, Mus Technol Grp, Barcelona, Spain
[2] Dublin City Univ, Insight Ctr Data Analyt, Dublin, Ireland
来源
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021) | 2021年
基金
爱尔兰科学基金会;
关键词
Contrastive learning; sound event classification; audio representation learning; self-supervision;
D O I
10.1109/ICASSP39728.2021.9415009
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Self-supervised representation learning can mitigate the limitations in recognition tasks with few manually labeled data but abundant unlabeled data-a common scenario in sound event research. In this work, we explore unsupervised contrastive learning as a way to learn sound event representations. To this end, we propose to use the pretext task of contrasting differently augmented views of sound events. The views are computed primarily via mixing of training examples with unrelated backgrounds, followed by other data augmentations. We analyze the main components of our method via ablation experiments. We evaluate the learned representations using linear evaluation, and in two in-domain downstream sound event classification tasks, namely, using limited manually labeled data, and using noisy labeled data. Our results suggest that unsupervised contrastive pre-training can mitigate the impact of data scarcity and increase robustness against noisy labels.
引用
收藏
页码:371 / 375
页数:5
相关论文
共 31 条
[1]   Self-supervised Learning of Audio-Visual Objects from Video [J].
Afouras, Triantafyllos ;
Owens, Andrew ;
Chung, Joon Son ;
Zisserman, Andrew .
COMPUTER VISION - ECCV 2020, PT XVIII, 2020, 12363 :208-224
[2]  
Cartwright M., 2019, P WORKSHOP DETECTION, DOI DOI 10.33682/J5ZW-2T88
[3]  
Cartwright M, 2019, IEEE WORK APPL SIG, P278, DOI [10.1109/WASPAA.2019.8937265, 10.1109/waspaa.2019.8937265]
[4]  
Chen T., 2020, ARXIV
[5]   Knowledge-guided Deep Reinforcement Learning for Interactive Recommendation [J].
Chen, Xiaocong ;
Huang, Chaoran ;
Yao, Lina ;
Wang, Xianzhi ;
Liu, Wei ;
Zhang, Wenjie .
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[6]  
Cheng K.-H., 2019, IEEE INT WORKSH MULT
[7]  
Elizalde B., 2017, EUR SIGN PROC C EUSI
[8]  
Fonseca E., 2017, ISMIR
[9]  
Fonseca E., 2018, P WORKSH DET REC WIL, P69
[10]  
Fonseca E., 2019, P DET CLASS AC SCEN, P69