Automatic Speech Recognition System for Lithuanian Broadcast Audio

被引:5
作者
Alumae, Tanel [1 ]
Tilk, Ottokar [1 ]
机构
[1] Tallinn Univ Technol, Inst Cybernet, EE-19086 Tallinn, Estonia
来源
HUMAN LANGUAGE TECHNOLOGIES - THE BALTIC PERSPECTIVE | 2016年 / 289卷
关键词
speech recognition; punctuation restoration; Lithuanian;
D O I
10.3233/978-1-61499-701-6-39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes the development of an automatic broadcast data transcription system for Lithuanian. The system performs fully automatic transcription of broadcast media recordings, including speech/non-speech detection, speaker diarization, speech-to-text conversion and automatic punctuation restoration. The system was developed in collaboration with the Baltic Media Monitoring Group (BMMG). The system is currently used in production for performing various broadcast speech monitoring tasks.
引用
收藏
页码:39 / 45
页数:7
相关论文
共 19 条
  • [1] Alumae T., 2015, SLTU
  • [2] Multistage speaker diarization of broadcast news
    Barras, Claude
    Zhu, Xuan
    Meignier, Sylvain
    Gauvain, Jean-Luc
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (05): : 1505 - 1512
  • [3] Bird S., 2009, Natural Language Processing with Python
  • [4] Davel M., 2015, INTERSPEECH
  • [5] Front-End Factor Analysis for Speaker Verification
    Dehak, Najim
    Kenny, Patrick J.
    Dehak, Reda
    Dumouchel, Pierre
    Ouellet, Pierre
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04): : 788 - 798
  • [6] Fraga-Silva T., 2015, ASRU
  • [7] Gales M. J., 2015, ICASSP
  • [8] Grezl F., 2016, SLTU
  • [9] Harper Mary, 2013, ASRU
  • [10] Lileikyte R., 2016, SLTU