SUPERB @ SLT 2022: CHALLENGE ON GENERALIZATION AND EFFICIENCY OF SELF-SUPERVISED SPEECH REPRESENTATION LEARNING

被引:9
|
作者
Feng, Tzu-Hsun [1 ]
Dong, Annie [2 ]
Yeh, Ching-Feng [2 ]
Yang, Shu-Wen [1 ]
Lin, Tzu-Quan [1 ]
Shi, Jiatong
Chang, Kai-Wei [1 ]
Huang, Zili [4 ]
Wu, Haibin [1 ]
Chang, Xuankai [3 ]
Watanabe, Shinji [3 ]
Mohamed, Abdelrahman [2 ]
Li, Shang-Wen [2 ]
Lee, Hung-Yi [1 ]
机构
[1] Natl Taiwan Univ, Taipei City, Taiwan
[2] Meta, Menlo Pk, CA USA
[3] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[4] Johns Hopkins Univ, Baltimore, MD 21218 USA
来源
2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT | 2022年
关键词
Self-supervised Learning; Pre-training; Network Compression;
D O I
10.1109/SLT54892.2023.10022770
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present the SUPERB challenge at SLT 2022, which aims at learning self-supervised speech representation for better performance, generalization, and efficiency. The challenge builds upon the SUPERB benchmark and implements metrics to measure the computation requirements of self-supervised learning (SSL) representation and to evaluate its generalizability and performance across the diverse SUPERB tasks. The SUPERB benchmark provides comprehensive coverage of popular speech processing tasks, from speech and speaker recognition to audio generation and semantic understanding. As SSL has gained interest in the speech community and showed promising outcomes, we envision the challenge to uplevel the impact of SSL techniques by motivating more practical designs of techniques beyond task performance. We summarize the results of 14 submitted models in this paper. We also discuss the main findings from those submissions and the future directions of SSL research.
引用
收藏
页码:1096 / 1103
页数:8
相关论文
共 50 条
  • [41] Self-Supervised Representation Learning for Document Image Classification
    Siddiqui, Shoaib Ahmed
    Dengel, Andreas
    Ahmed, Sheraz
    IEEE ACCESS, 2021, 9 : 164358 - 164367
  • [42] Self-supervised Visual Representation Learning for Histopathological Images
    Yang, Pengshuai
    Hong, Zhiwei
    Yin, Xiaoxu
    Zhu, Chengzhan
    Jiang, Rui
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT II, 2021, 12902 : 47 - 57
  • [43] Self-supervised representation learning for surgical activity recognition
    Daniel Paysan
    Luis Haug
    Michael Bajka
    Markus Oelhafen
    Joachim M. Buhmann
    International Journal of Computer Assisted Radiology and Surgery, 2021, 16 : 2037 - 2044
  • [44] A Comprehensive and Adversarial Approach to Self-Supervised Representation Learning
    Xu, Yi-Zhan
    Han, Sungwon
    Park, Sungwon
    Cha, Meeyoung
    Li, Cheng-Te
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 709 - 717
  • [45] MinEnt: Minimum entropy for self-supervised representation learning
    Li, Shuo
    Liu, Fang
    Hao, Zehua
    Jiao, Licheng
    Liu, Xu
    Guo, Yuwei
    PATTERN RECOGNITION, 2023, 138
  • [46] Video Face Clustering with Self-Supervised Representation Learning
    Sharma V.
    Tapaswi M.
    Saquib Sarfraz M.
    Stiefelhagen R.
    IEEE Transactions on Biometrics, Behavior, and Identity Science, 2020, 2 (02): : 145 - 157
  • [47] Self-supervised Discriminative Representation Learning by Fuzzy Autoencoder
    Yang, Wenlu
    Wang, Hongjun
    Zhang, Yinghui
    Liu, Zehao
    Li, Tianrui
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2023, 14 (01)
  • [48] A survey on self-supervised methods for visual representation learning
    Uelwer, Tobias
    Robine, Jan
    Wagner, Stefan Sylvius
    Hoeftmann, Marc
    Upschulte, Eric
    Konietzny, Sebastian
    Behrendt, Maike
    Harmeling, Stefan
    MACHINE LEARNING, 2025, 114 (04)
  • [49] Functional Knowledge Transfer with Self-supervised Representation Learning
    Chhipa, Prakash Chandra
    Chopra, Muskaan
    Mengi, Gopal
    Gupta, Varun
    Upadhyay, Richa
    Chippa, Meenakshi Subhash
    De, Kanjar
    Saini, Rajkumar
    Uchida, Seiichi
    Liwicki, Marcus
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3339 - 3343
  • [50] Self-Supervised Hypergraph Learning for Enhanced Multimodal Representation
    Shu, Hongji
    Meng, Chaojun
    de Meo, Pasquale
    Wang, Qing
    Zhu, Jia
    IEEE ACCESS, 2024, 12 : 20830 - 20839