Focal Loss for Punctuation Prediction

被引:9
|
作者
Yi, Jiangyan [1 ]
Tao, Jianhua [1 ,2 ,3 ]
Tian, Zhengkun [1 ,3 ]
Bai, Ye [1 ,3 ]
Fan, Cunhang [1 ,3 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing, Peoples R China
[2] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
来源
INTERSPEECH 2020 | 2020年
基金
中国国家自然科学基金;
关键词
focal loss; class imbalance; punctuation prediction; speech recognition; SPEECH RECOGNITION; MODELS;
D O I
10.21437/Interspeech.2020-1638
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Many approaches have been proposed to predict punctuation marks. Previous results demonstrate that these methods are effective. However, there still exists class imbalance problem during training. Most of the classes in the training set for punctuation prediction are non-punctuation marks. This will affect the performance of punctuation prediction tasks. Therefore, this paper uses a focal loss to alleviate this issue. The focal loss can down-weight easy examples and focus training on a sparse set of hard examples. Experiments are conducted on IWSLT2011 datasets. The results show that the punctuation predicting models trained with a focal loss obtain performance improvement over that trained with a cross entropy loss by up to 2.7% absolute overall F-1-score on test set. The proposed model also outperforms previous state-of-the-art models.
引用
收藏
页码:721 / 725
页数:5
相关论文
共 50 条
  • [41] Improved Training for End-to-End Streaming Automatic Speech Recognition Model with Punctuation
    Kim, Hanbyul
    Seo, Seunghyun
    Lee, Lukas
    Baek, Seolki
    INTERSPEECH 2023, 2023, : 1653 - 1657
  • [42] Punctuation Prediction using a Bidirectional Recurrent Neural Network with Part-of-Speech Tagging
    Juin, Chin Char
    Wei, Richard Xiong Jun
    D'Haro, Luis Fernando
    Banchs, Rafael E.
    TENCON 2017 - 2017 IEEE REGION 10 CONFERENCE, 2017, : 1806 - 1811
  • [43] Restoring Punctuation and Capitalization Using Transformer Models
    Varavs, Andris
    Salimbajevs, Askars
    STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2018, 2018, 11171 : 91 - 102
  • [44] SELF-ATTENTION BASED MODEL FOR PUNCTUATION PREDICTION USING WORD AND SPEECH EMBEDDINGS
    Yi, Jiangyan
    Tao, Jianhua
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7270 - 7274
  • [45] Feature-weighted AdaBoost classifier for punctuation prediction in Tamil and Hindi NLP systems
    Mrinalini, K.
    Vijayalakshmi, P.
    Nagarajan, T.
    EXPERT SYSTEMS, 2022, 39 (04)
  • [46] Wind Turbine Blade Icing Prediction Using Focal Loss Function and CNN-Attention-GRU Algorithm
    Tao, Cheng
    Tao, Tao
    Bai, Xinjian
    Liu, Yongqian
    ENERGIES, 2023, 16 (15)
  • [47] Incorporating External POS Tagger for Punctuation Restoration
    Shi, Ning
    Wang, Wei
    Wang, Boxin
    Li, Jinfeng
    Liu, Xiangyu
    Lin, Zhouhan
    INTERSPEECH 2021, 2021, : 1987 - 1991
  • [48] Attention-based Graph ResNet with focal loss for epileptic seizure detection
    Dong, Changxu
    Zhao, Yanna
    Zhang, Gaobo
    Xue, Mingrui
    Chu, Dengyu
    He, Jiatong
    Ge, Xinting
    JOURNAL OF AMBIENT INTELLIGENCE AND SMART ENVIRONMENTS, 2022, 14 (01) : 61 - 73
  • [49] A Three-dimensional Detector Based on Focal Loss for Pulmonary Nodules Detection
    Wang, Lei
    Dai, Yaping
    Jia, Zhiyang
    Nie, Yongkang
    Liu, Liang
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 8445 - 8449
  • [50] Combining Prosodic and Lexical Classifiers for Two-Pass Punctuation Detection in a Russian ASR System
    Khomitsevich, Olga
    Chistikov, Pavel
    Krivosheeva, Tatiana
    Epimakhova, Natalia
    Chernykh, Irina
    SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 161 - 169