Transfer knowledge for punctuation prediction via adversarial training

被引:0
作者
Yi, Jiangyan [1 ]
Tao, Jianhua [1 ,2 ,3 ]
Bai, Ye [1 ,2 ]
Tian, Zhengkun [1 ,2 ]
Fan, Cunhang [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
[3] 95 Zhongguancun East Rd, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Adversarial multi-task learning; Multi-modal embeddings; Part-of-speech tagging; Orthogonality constraints; Punctuation prediction; MODELS;
D O I
10.1016/j.specom.2023.03.003
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Previous studies demonstrate that part-of-speech (POS) tags are helpful for punctuation restoration tasks. However, extra computation cost will be needed during decoding, due to POS tags are provided by an external POS tagger. This paper proposes to transfer knowledge via adversarial training and orthogonality constraints to fill in the gap. Adversarial multi-task learning is introduced to learn task invariant knowledge from the extra POS tagging task for a punctuation prediction task. Furthermore, orthogonality constraints are used to make private and shared features dissimilar. Only the punctuation predicting task is used during decoding. So extra computation is not needed. Experiments are conducted on IWSLT2011 datasets. The results show that the punctuation predicting models trained with adversarial learning obtain performance gains over the baseline models on test sets. The results also demonstrate that the models trained with orthogonality constraints further obtain performance improvement.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 50 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] Alam Tanvirul, 2020, P 6 WORKSHOP NOISY U, P132, DOI DOI 10.18653/V1/2020.WNUT-1.18
  • [3] Andrew M., P INT C INT C MACH L, V30, P1
  • [4] Beeferman D, 1998, INT CONF ACOUST SPEE, P689, DOI 10.1109/ICASSP.1998.675358
  • [5] Bousmalis K, 2016, ADV NEUR IN, V29
  • [6] Multitask learning
    Caruana, R
    [J]. MACHINE LEARNING, 1997, 28 (01) : 41 - 75
  • [7] Che XY, 2016, LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, P654
  • [8] Sentence Boundary Detection Based on Parallel Lexical and Acoustic Models
    Che, Xiaoyin
    Luo, Sheng
    Yang, Haojin
    Meinel, Christoph
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2528 - 2532
  • [9] Adversarial Multi-Criteria Learning for Chinese Word Segmentation
    Chen, Xinchi
    Shi, Zhan
    Qiu, Xipeng
    Huang, Xuanjing
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1193 - 1203
  • [10] Cho E, 2015, 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, P3650