SUPERB @ SLT 2022: CHALLENGE ON GENERALIZATION AND EFFICIENCY OF SELF-SUPERVISED SPEECH REPRESENTATION LEARNING

被引：9

作者：

Feng, Tzu-Hsun ^{[1
]}

Dong, Annie ^{[2
]}

Yeh, Ching-Feng ^{[2
]}

Yang, Shu-Wen ^{[1
]}

Lin, Tzu-Quan ^{[1
]}

Shi, Jiatong

Chang, Kai-Wei ^{[1
]}

Huang, Zili ^{[4
]}

Wu, Haibin ^{[1
]}

Chang, Xuankai ^{[3
]}

Watanabe, Shinji ^{[3
]}

Mohamed, Abdelrahman ^{[2
]}

Li, Shang-Wen ^{[2
]}

Lee, Hung-Yi ^{[1
]}

机构：

[1] Natl Taiwan Univ, Taipei City, Taiwan

[2] Meta, Menlo Pk, CA USA

[3] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA

[4] Johns Hopkins Univ, Baltimore, MD 21218 USA

来源：

2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT | 2022年

关键词：

Self-supervised Learning; Pre-training; Network Compression;

D O I：

10.1109/SLT54892.2023.10022770

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present the SUPERB challenge at SLT 2022, which aims at learning self-supervised speech representation for better performance, generalization, and efficiency. The challenge builds upon the SUPERB benchmark and implements metrics to measure the computation requirements of self-supervised learning (SSL) representation and to evaluate its generalizability and performance across the diverse SUPERB tasks. The SUPERB benchmark provides comprehensive coverage of popular speech processing tasks, from speech and speaker recognition to audio generation and semantic understanding. As SSL has gained interest in the speech community and showed promising outcomes, we envision the challenge to uplevel the impact of SSL techniques by motivating more practical designs of techniques beyond task performance. We summarize the results of 14 submitted models in this paper. We also discuss the main findings from those submissions and the future directions of SSL research.

引用

页码：1096 / 1103

页数：8

共 50 条

[41] Self-Supervised Representation Learning for Document Image Classification
Siddiqui, Shoaib Ahmed
Dengel, Andreas
Ahmed, Sheraz
IEEE ACCESS, 2021, 9 : 164358 - 164367
[42] Self-supervised Visual Representation Learning for Histopathological Images
Yang, Pengshuai
Hong, Zhiwei
Yin, Xiaoxu
Zhu, Chengzhan
Jiang, Rui
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT II, 2021, 12902 : 47 - 57
[43] Self-supervised representation learning for surgical activity recognition
Daniel Paysan
Luis Haug
Michael Bajka
Markus Oelhafen
Joachim M. Buhmann
International Journal of Computer Assisted Radiology and Surgery, 2021, 16 : 2037 - 2044
[44] A Comprehensive and Adversarial Approach to Self-Supervised Representation Learning
Xu, Yi-Zhan
Han, Sungwon
Park, Sungwon
Cha, Meeyoung
Li, Cheng-Te
2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 709 - 717
[45] MinEnt: Minimum entropy for self-supervised representation learning
Li, Shuo
Liu, Fang
Hao, Zehua
Jiao, Licheng
Liu, Xu
Guo, Yuwei
PATTERN RECOGNITION, 2023, 138
[46] Video Face Clustering with Self-Supervised Representation Learning
Sharma V.
Tapaswi M.
Saquib Sarfraz M.
Stiefelhagen R.
IEEE Transactions on Biometrics, Behavior, and Identity Science, 2020, 2 (02): : 145 - 157
[47] Self-supervised Discriminative Representation Learning by Fuzzy Autoencoder
Yang, Wenlu
Wang, Hongjun
Zhang, Yinghui
Liu, Zehao
Li, Tianrui
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2023, 14 (01)
[48] A survey on self-supervised methods for visual representation learning
Uelwer, Tobias
Robine, Jan
Wagner, Stefan Sylvius
Hoeftmann, Marc
Upschulte, Eric
Konietzny, Sebastian
Behrendt, Maike
Harmeling, Stefan
MACHINE LEARNING, 2025, 114 (04)
[49] Functional Knowledge Transfer with Self-supervised Representation Learning
Chhipa, Prakash Chandra
Chopra, Muskaan
Mengi, Gopal
Gupta, Varun
Upadhyay, Richa
Chippa, Meenakshi Subhash
De, Kanjar
Saini, Rajkumar
Uchida, Seiichi
Liwicki, Marcus
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3339 - 3343
[50] Self-Supervised Hypergraph Learning for Enhanced Multimodal Representation
Shu, Hongji
Meng, Chaojun
de Meo, Pasquale
Wang, Qing
Zhu, Jia
IEEE ACCESS, 2024, 12 : 20830 - 20839

← 1 2 3 4 5 →