Focal Loss for Punctuation Prediction

被引:9
|
作者
Yi, Jiangyan [1 ]
Tao, Jianhua [1 ,2 ,3 ]
Tian, Zhengkun [1 ,3 ]
Bai, Ye [1 ,3 ]
Fan, Cunhang [1 ,3 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing, Peoples R China
[2] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
来源
INTERSPEECH 2020 | 2020年
基金
中国国家自然科学基金;
关键词
focal loss; class imbalance; punctuation prediction; speech recognition; SPEECH RECOGNITION; MODELS;
D O I
10.21437/Interspeech.2020-1638
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Many approaches have been proposed to predict punctuation marks. Previous results demonstrate that these methods are effective. However, there still exists class imbalance problem during training. Most of the classes in the training set for punctuation prediction are non-punctuation marks. This will affect the performance of punctuation prediction tasks. Therefore, this paper uses a focal loss to alleviate this issue. The focal loss can down-weight easy examples and focus training on a sparse set of hard examples. Experiments are conducted on IWSLT2011 datasets. The results show that the punctuation predicting models trained with a focal loss obtain performance improvement over that trained with a cross entropy loss by up to 2.7% absolute overall F-1-score on test set. The proposed model also outperforms previous state-of-the-art models.
引用
收藏
页码:721 / 725
页数:5
相关论文
共 50 条
  • [21] Punctuation Prediction for Chinese Spoken Sentence Based on Model Combination
    Chen, Xiao
    Ke, Dengfeng
    Xu, Bo
    PRACTICAL APPLICATIONS OF INTELLIGENT SYSTEMS, ISKE 2013, 2014, 279 : 1069 - 1078
  • [22] Attention-based bidirectional LSTM for Chinese punctuation prediction
    Li, Jinliang
    Yin, Chengfeng
    Jia, Zhen
    Li, Tianrui
    Tang, Min
    DATA SCIENCE AND KNOWLEDGE ENGINEERING FOR SENSING DECISION SUPPORT, 2018, 11 : 708 - 714
  • [23] Attention-based bidirectional LSTM for Chinese punctuation prediction
    Li, Jinliang
    Yin, Chengfeng
    Jia, Zhen
    Li, Tianrui
    Tang, Min
    DATA SCIENCE AND KNOWLEDGE ENGINEERING FOR SENSING DECISION SUPPORT, 2018, 11 : 485 - 491
  • [24] Punctuation Prediction for Vietnamese Texts Using Conditional Random Fields
    Pham, Quang H.
    Nguyen, Binh T.
    Nguyen Viet Cuong
    SOICT 2019: PROCEEDINGS OF THE TENTH INTERNATIONAL SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY, 2019, : 322 - 327
  • [25] Token-Level Supervised Contrastive Learning for Punctuation Restoration
    Huang, Qiushi
    Ko, Tom
    Tang, H. Lilian
    Liu, Xubo
    Wu, Bo
    INTERSPEECH 2021, 2021, : 2012 - 2016
  • [26] EDCLoc: a prediction model for mRNA subcellular localization using improved focal loss to address multi-label class imbalance
    Deng, Yu
    Jia, Jianhua
    Yi, Mengyue
    BMC GENOMICS, 2024, 25 (01):
  • [27] Task-based Meta Focal Loss for Multilingual Low-resource Speech Recognition
    Chen, Yaqi
    Zhang, Wenlin
    Zhang, Hao
    Qu, Dan
    Yang, Xu-Kui
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (11)
  • [28] Experiments in Character-level Neural Network Models for Punctuation
    Gale, William
    Parthasarathy, Sarangarajan
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2794 - 2798
  • [29] FOCAL TEXT: AN ACCURATE TEXT DETECTION WITH FOCAL LOSS
    Tian, Xiaowei
    Wu, Dao
    Wang, Rui
    Cao, Xiaochun
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2984 - 2988
  • [30] Analysis of Punctuation Prediction Models for Automated Transcript Generation in MOOC Videos
    Garg, Bhrigu
    Anika
    PROCEEDINGS OF THE 2018 IEEE 6TH INTERNATIONAL CONFERENCE ON MOOCS, INNOVATION AND TECHNOLOGY IN EDUCATION (MITE 2018), 2018, : 19 - 26