VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation

被引:0
|
作者
Wang, Changhan [1 ]
Riviere, Morgane [1 ]
Lee, Ann [1 ]
Wu, Anne [1 ]
Talnikar, Chaitanya [1 ]
Haziza, Daniel [1 ]
Williamson, Mary [1 ]
Pino, Juan [1 ]
Dupoux, Emmanuel [1 ]
机构
[1] Facebook AI Research, Menlo Pk, CA 94025 USA
来源
59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021) | 2021年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce VoxPopuli, a large-scale multilingual corpus providing 400K hours of unlabeled speech data in 23 languages. It is the largest open data to date for unsupervised representation learning as well as semi-supervised learning. VoxPopuli also contains 1.8K hours of transcribed speeches in 15 languages and their aligned oral interpretations into 15 target languages totaling 17.3K hours. We provide speech recognition (ASR) baselines and validate the versatility of VoxPopuli unlabeled data in semi-supervised ASR and speech-to-text translation under challenging out-of-domain settings.
引用
收藏
页码:993 / 1003
页数:11
相关论文
共 50 条
  • [1] Large-Scale Self- and Semi-Supervised Learning for Speech Translation
    Wang, Changhan
    Wu, Anne
    Pino, Juan
    Baevski, Alexei
    Auli, Michael
    Conneau, Alexis
    INTERSPEECH 2021, 2021, : 2242 - 2246
  • [2] BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition
    Zhang, Yu
    Park, Daniel S.
    Han, Wei
    Qin, James
    Gulati, Anmol
    Shor, Joel
    Jansen, Aren
    Xu, Yuanzhong
    Huang, Yanping
    Wang, Shibo
    Zhou, Zongwei
    Li, Bo
    Ma, Min
    Chan, William
    Yu, Jiahui
    Wang, Yongqiang
    Cao, Liangliang
    Sim, Khe Chai
    Ramabhadran, Bhuvana
    Sainath, Tara N.
    Beaufays, Francoise
    Chen, Zhifeng
    Le, Quoc, V
    Chiu, Chung-Cheng
    Pang, Ruoming
    Wu, Yonghui
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (06) : 1519 - 1532
  • [3] Nonnegative Spectral Clustering for Large-Scale Semi-supervised Learning
    Hu, Weibo
    Chen, Chuan
    Ye, Fanghua
    Zheng, Zibin
    Ling, Guohui
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2019, 11448 : 287 - 291
  • [4] Semi-supervised learning on large-scale geotagged photos for situation recognition
    Tang, Mengfan
    Nie, Feiping
    Pongpaichet, Siripen
    Jain, Ramesh
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 48 : 310 - 316
  • [5] Semi-supervised eigenvectors for large-scale locally-biased learning
    Department of Applied Mathematics and Computer Science, Technical University of Denmark, Richard Petersens Plads, Lyngby
    2800, Denmark
    不详
    DC
    94720-1776, United States
    J. Mach. Learn. Res., (3691-3734):
  • [6] Exploring Latent Sparse Graph for Large-Scale Semi-supervised Learning
    Wang, Zitong
    Wang, Li
    Chan, Raymond
    Zeng, Tieyong
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT IV, 2023, 13716 : 367 - 383
  • [7] Incremental learning algorithm for large-scale semi-supervised ordinal regression
    Chen, Haiyan
    Jia, Yizhen
    Ge, Jiaming
    Gu, Bin
    NEURAL NETWORKS, 2022, 149 : 124 - 136
  • [8] Semi-Supervised Eigenvectors for Large-Scale Locally-Biased Learning
    Hansen, Toke J.
    Mahoney, Michael W.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2014, 15 : 3691 - 3734
  • [9] Semi-supervised Learning for Large Scale Image Cosegmentation
    Wang, Zhengxiang
    Liu, Rujie
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 393 - 400
  • [10] Learning Semi-Supervised Representation Towards a Unified Optimization Framework for Semi-Supervised Learning
    Li, Chun-Guang
    Lin, Zhouchen
    Zhang, Honggang
    Guo, Jun
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2767 - 2775