Focal Loss for Punctuation Prediction

被引:9
|
作者
Yi, Jiangyan [1 ]
Tao, Jianhua [1 ,2 ,3 ]
Tian, Zhengkun [1 ,3 ]
Bai, Ye [1 ,3 ]
Fan, Cunhang [1 ,3 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing, Peoples R China
[2] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
来源
INTERSPEECH 2020 | 2020年
基金
中国国家自然科学基金;
关键词
focal loss; class imbalance; punctuation prediction; speech recognition; SPEECH RECOGNITION; MODELS;
D O I
10.21437/Interspeech.2020-1638
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Many approaches have been proposed to predict punctuation marks. Previous results demonstrate that these methods are effective. However, there still exists class imbalance problem during training. Most of the classes in the training set for punctuation prediction are non-punctuation marks. This will affect the performance of punctuation prediction tasks. Therefore, this paper uses a focal loss to alleviate this issue. The focal loss can down-weight easy examples and focus training on a sparse set of hard examples. Experiments are conducted on IWSLT2011 datasets. The results show that the punctuation predicting models trained with a focal loss obtain performance improvement over that trained with a cross entropy loss by up to 2.7% absolute overall F-1-score on test set. The proposed model also outperforms previous state-of-the-art models.
引用
收藏
页码:721 / 725
页数:5
相关论文
共 50 条
  • [31] FOCAL LOSS AND DOUBLE-EDGE-TRIGGERED DETECTOR FOR ROBUST SMALL-FOOTPRINT KEYWORD SPOTTING
    Liu, Bin
    Nie, Shuai
    Zhang, Yaping
    Liang, Shan
    Yang, Zhanlei
    Liu, Wenju
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6361 - 6365
  • [32] Bidirectional LSTM for Automatic Punctuation Restoration
    Salimbajevs, Askars
    HUMAN LANGUAGE TECHNOLOGIES - THE BALTIC PERSPECTIVE, 2016, 289 : 59 - 65
  • [33] RESTORING PUNCTUATION AND CAPITALIZATION IN TRANSCRIBED SPEECH
    Gravano, Agustin
    Jansche, Martin
    Bacchiani, Michiel
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4741 - +
  • [34] Imbalanced survival prediction for gastric cancer patients based on improved XGBoost with cost sensitive and focal loss
    Xu, Liangchen
    Guo, Chonghui
    EXPERT SYSTEMS, 2024, 41 (11)
  • [35] CAPITALIZATION AND PUNCTUATION RESTORATION FOR ROMANIAN LANGUAGE
    Caranica, Alexandru
    Cucu, Horia
    Buzo, Andi
    Burileanu, Corneliu
    UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2015, 77 (03): : 95 - 106
  • [36] Credit Risk Prediction Based on DenseNet-BC of Fusion Focal Loss and Static restart SGD
    Su, Ke
    Zheng, Shanhong
    Liu, Gang
    2021 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, INTERNET OF PEOPLE, AND SMART CITY INNOVATIONS (SMARTWORLD/SCALCOM/UIC/ATC/IOP/SCI 2021), 2021, : 494 - 500
  • [37] Extending the punctuation module for European Portuguese
    Batista, Fernando
    Moniz, Helena
    Trancoso, Isabel
    Meinedo, Hugo
    Mata, Ana Isabel
    Mamede, Nuno
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1509 - +
  • [38] An Explainable ADASYN-Based Focal Loss Approach for Credit Assessment
    Shahee, Shaukat Ali
    Patel, Rujavi
    JOURNAL OF FORECASTING, 2025,
  • [39] Constrained-Focal-Loss Based Deep Learning for Segmentation of Spores
    Zhao, Yaochi
    Lin, Fusheng
    Liu, Shiguang
    Hu, Zhuhua
    Li, Hui
    Bai, Yong
    IEEE ACCESS, 2019, 7 : 165029 - 165038
  • [40] Focal Loss for Region Proposal Network
    Chen, Chengpeng
    Song, Xinhang
    Jiang, Shuqiang
    PATTERN RECOGNITION AND COMPUTER VISION, PT II, 2018, 11257 : 368 - 380