Restoring Punctuation and Capitalization Using Transformer Models

被引:5
作者
Varavs, Andris [1 ]
Salimbajevs, Askars [1 ]
机构
[1] Tilde, Vienibas Gatve 75A, Riga, Latvia
来源
STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2018 | 2018年 / 11171卷
关键词
Speech recognition; Punctuation restoration; Capitalization restoration; Transformer;
D O I
10.1007/978-3-030-00810-9_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Restoring punctuation and capitalization in the output of automatic speech recognition (ASR) system greatly improves readability and extends the number of downstream applications. We present a Transformer-based method for restoring punctuation and capitalization for Latvian and English, following the established approach of using neural machine translation (NMT) models. NMT methods here pose a challenge as the length of the predicted sequence does not always match the length of the input sequence. We offer two solutions to this problem: a simple target sequence cutting or padding by force and a more sophisticated attention alignment-based method. Our approach reaches new state of the art results for Latvian and competitive results on English.
引用
收藏
页码:91 / 102
页数:12
相关论文
共 30 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] [Anonymous], 2018, EMPIRICAL EVALUATION
  • [3] [Anonymous], 2015, P INT WORKSH SPOK LA
  • [4] [Anonymous], 2005, P MACH TRANSL SUMM 1, P79
  • [5] [Anonymous], 2014, IWSLT
  • [6] [Anonymous], 8 ANN C INT SPEECH C
  • [7] [Anonymous], RECENT ADV NATURAL L
  • [8] [Anonymous], 2014, 3 INT C LEARN REPR
  • [9] Bilingual Experiments on Automatic Recovery of Capitalization and Punctuation of Automatic Speech Transcripts
    Batista, Fernando
    Moniz, Helena
    Trancoso, Isabel
    Mamede, Nuno
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (02): : 474 - 485
  • [10] Beaufays F, 2013, INT CONF ACOUST SPEE, P6749, DOI 10.1109/ICASSP.2013.6638968