SUPERB @ SLT 2022: CHALLENGE ON GENERALIZATION AND EFFICIENCY OF SELF-SUPERVISED SPEECH REPRESENTATION LEARNING

被引:9
|
作者
Feng, Tzu-Hsun [1 ]
Dong, Annie [2 ]
Yeh, Ching-Feng [2 ]
Yang, Shu-Wen [1 ]
Lin, Tzu-Quan [1 ]
Shi, Jiatong
Chang, Kai-Wei [1 ]
Huang, Zili [4 ]
Wu, Haibin [1 ]
Chang, Xuankai [3 ]
Watanabe, Shinji [3 ]
Mohamed, Abdelrahman [2 ]
Li, Shang-Wen [2 ]
Lee, Hung-Yi [1 ]
机构
[1] Natl Taiwan Univ, Taipei City, Taiwan
[2] Meta, Menlo Pk, CA USA
[3] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[4] Johns Hopkins Univ, Baltimore, MD 21218 USA
来源
2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT | 2022年
关键词
Self-supervised Learning; Pre-training; Network Compression;
D O I
10.1109/SLT54892.2023.10022770
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present the SUPERB challenge at SLT 2022, which aims at learning self-supervised speech representation for better performance, generalization, and efficiency. The challenge builds upon the SUPERB benchmark and implements metrics to measure the computation requirements of self-supervised learning (SSL) representation and to evaluate its generalizability and performance across the diverse SUPERB tasks. The SUPERB benchmark provides comprehensive coverage of popular speech processing tasks, from speech and speaker recognition to audio generation and semantic understanding. As SSL has gained interest in the speech community and showed promising outcomes, we envision the challenge to uplevel the impact of SSL techniques by motivating more practical designs of techniques beyond task performance. We summarize the results of 14 submitted models in this paper. We also discuss the main findings from those submissions and the future directions of SSL research.
引用
收藏
页码:1096 / 1103
页数:8
相关论文
共 50 条
  • [31] CHARACTERIZING THE ADVERSARIAL VULNERABILITY OF SPEECH SELF-SUPERVISED LEARNING
    Wu, Haibin
    Zheng, Bo
    Li, Xu
    Wu, Xixin
    Lee, Hung-Yi
    Meng, Helen
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3164 - 3168
  • [32] SELF-SUPERVISED REPRESENTATION LEARNING FROM ELECTROENCEPHALOGRAPHY SIGNALS
    Banville, Hubert
    Albuquerque, Isabela
    Hyvarinen, Aapo
    Moffat, Graeme
    Engemann, Denis-Alexander
    Gramfort, Alexandre
    2019 IEEE 29TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2019,
  • [33] Random Field Augmentations for Self-Supervised Representation Learning
    Mansfield, Philip Andrew
    Afkanpour, Arash
    Morningstar, Warren Richard
    Singhal, Karan
    NEURIPS WORKSHOP ON SYMMETRY AND GEOMETRY IN NEURAL REPRESENTATIONS, 2023, 228 : 292 - 302
  • [34] Video Motion Perception for Self-supervised Representation Learning
    Li, Wei
    Luo, Dezhao
    Fang, Bo
    Li, Xiaoni
    Zhou, Yu
    Wang, Weiping
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 508 - 520
  • [35] Self-supervised representation learning by predicting visual permutations
    Zhao, Qilu
    Dong, Junyu
    KNOWLEDGE-BASED SYSTEMS, 2020, 210
  • [36] Self-supervised Graph Representation Learning with Variational Inference
    Liao, Zihan
    Liang, Wenxin
    Liu, Han
    Mu, Jie
    Zhang, Xianchao
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT III, 2021, 12714 : 116 - 127
  • [37] Self-Supervised ECG Representation Learning for Emotion Recognition
    Sarkar, Pritam
    Etemad, Ali
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2022, 13 (03) : 1541 - 1554
  • [38] Self-Supervised Hypergraph Representation Learning for Sociological Analysis
    Sun, Xiangguo
    Cheng, Hong
    Liu, Bo
    Li, Jia
    Chen, Hongyang
    Xu, Guandong
    Yin, Hongzhi
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (11) : 11860 - 11871
  • [39] Self-Supervised Representation Learning for Video Quality Assessment
    Jiang, Shaojie
    Sang, Qingbing
    Hu, Zongyao
    Liu, Lixiong
    IEEE TRANSACTIONS ON BROADCASTING, 2023, 69 (01) : 118 - 129
  • [40] Self-supervised Representation Learning Using 360° Data
    Li, Junnan
    Liu, Jianquan
    Wong, Yongkang
    Nishimura, Shoji
    Kankanhalli, Mohan S.
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 998 - 1006