Enhancing Chinese Word Segmentation via Pseudo Labels for Practicability

被引:0
|
作者
Huang, Kaiyu [1 ]
Liu, Junpeng [1 ]
Huang, Degen [1 ]
Xiong, Deyi [2 ,3 ]
Liu, Zhuang [4 ]
Su, Jinsong [5 ]
机构
[1] Dalian Univ Technol, Dalian, Peoples R China
[2] Tianjin Univ, Tianjin, Peoples R China
[3] Global Tone Commun Technol Co Ltd, Beijing, Peoples R China
[4] Dongbei Univ Finance & Econ, Dalian, Peoples R China
[5] Xiamen Univ, Xiamen, Peoples R China
来源
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021 | 2021年
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pre-trained language models (e.g., BERT) significantly alleviate two traditional challenging problems for Chinese word segmentation (CWS): segmentation ambiguity and out-of-vocabulary (OOV) words. However, such improvements are usually achieved on traditional benchmark datasets and not close to an important goal of CWS: practicability (i.e., low complexity as a standalone task and high beneficiality to downstream tasks). To make a trade-off between traditional evaluation and practicability for CWS, we propose a semisupervised neural method via pseudo labels. The neural method consists of a teacher model and a student model, which distills knowledge from unlabeled data to the student model so as to improve both in-domain and out-of-domain CWS. Experiments show that our proposed method can not only keep the practicability of the lightweight student model but also improve the performance of segmentation effectively. We also evaluate a range of heterogeneous neural architectures of CWS on downstream Chinese NLP tasks. Results of further experiments demonstrate that our proposed segmenter is reliable and practical as a pre-processing step of the downstream NLP tasks at the minimum cost.(1)
引用
收藏
页码:4369 / 4381
页数:13
相关论文
共 50 条
  • [1] Enhancing Chinese Word Segmentation with Character Clustering
    Liu, Yijia
    Che, Wanxiang
    Liu, Ting
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, 2013, 8208 : 52 - 60
  • [2] Compete to Win: Enhancing Pseudo Labels for Barely-Supervised Medical Image Segmentation
    Wu H.
    Li X.
    Lin Y.
    Cheng K.-T.
    IEEE Transactions on Medical Imaging, 2023, 42 (11) : 3244 - 3255
  • [3] Consistency Check for Chinese Word Segmentation via Contextual Similarity
    Liu W.
    Huang K.
    Yu H.
    Huang D.
    Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 2022, 58 (01): : 99 - 105
  • [4] Point-Supervised Panoptic Segmentation via Estimating Pseudo Labels from Learnable Distance
    Li, Jing
    Fan, Junsong
    Zhang, Zhaoxiang
    COMPUTER VISION - ECCV 2024, PT XVI, 2025, 15074 : 95 - 112
  • [5] Unsupervised Neural Word Segmentation for Chinese via Segmental Language Modeling
    Sun, Zhiqing
    Deng, Zhi-Hong
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 4915 - 4920
  • [6] Word Segmentation for Chinese Novels
    Qiu, Likun
    Zhang, Yue
    PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 2440 - 2446
  • [7] Toward Better Chinese Word Segmentation for SMT via Bilingual Constraints
    Zeng, Xiaodong
    Chao, Lidia S.
    Wong, Derek F.
    Trancoso, Isabel
    Tian, Liang
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 1360 - 1369
  • [8] Unsupervised Continual Learning via Pseudo Labels
    He, Jiangpeng
    Zhu, Fengqing
    CONTINUAL SEMI-SUPERVISED LEARNING, CSSL 2021, 2022, 13418 : 15 - 32
  • [9] Synthetic Word Parsing Improves Chinese Word Segmentation
    Cheng, Fei
    Duh, Kevin
    Matsumoto, Yuji
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 262 - 267
  • [10] Accurate Linear-Time Chinese Word Segmentation via Embedding Matching
    Ma, Jianqiang
    Hinrichs, Erhard
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, 2015, : 1733 - 1743