BERT-based Lexical Substitution

被引:0
作者
Zhou, Wangchunshu [1 ]
Ge, Tao [2 ]
Xu, Ke [1 ]
Wei, Furu [2 ]
Zhou, Ming [2 ]
机构
[1] Beihang Univ, Beijing, Peoples R China
[2] Microsoft Res Asia, Beijing, Peoples R China
来源
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019) | 2019年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous studies on lexical substitution tend to obtain substitute candidates by finding the target word's synonyms from lexical resources (e.g., WordNet) and then rank the candidates based on its contexts. These approaches have two limitations: (1) They are likely to overlook good substitute candidates that are not the synonyms of the target words in the lexical resources; (2) They fail to take into account the substitution's influence on the global context of the sentence. To address these issues, we propose an end-to-end BERT-based lexical substitution approach which can propose and validate substitute candidates without using any annotated data or manually curated resources. Our approach first applies dropout to the target word's embedding for partially masking the word, allowing BERT to take balanced consideration of the target word's semantics and contexts for proposing substitute candidates, and then validates the candidates based on their substitution's influence on the global contextualized representation of the sentence. Experiments show our approach performs well in both proposing and ranking substitute candidates, achieving the state-of-the-art results in both LS07 and LS14 benchmarks.
引用
收藏
页码:3368 / 3373
页数:6
相关论文
共 20 条
[1]  
Apidianaki M., 2016, Proceedings_of_the_2016_Conference_on_Empirical_Methods_in_Natural_Language Processing_(EMNLP), P2028
[2]   Creating a system for lexical substitutions from scratch using crowdsourcing [J].
Biemann, Chris .
LANGUAGE RESOURCES AND EVALUATION, 2013, 47 (01) :97-122
[3]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[4]  
Dinu Georgiana., 2010, Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, P1162
[5]  
Erk K., 2008, P C EMPIRICAL METHOD, P897
[6]  
Hassan S, 2007, APPLIED MATHEMATICS FOR SCIENCE AND ENGINEERING, P412
[7]  
Hintz G, 2016, PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P118
[8]  
Kishida K, 2005, PROPERTY AVERAGE PRE
[9]  
Kremer Gerhard, 2014, P 14 C EUR CHAPT ASS, P540
[10]  
McCarthy Diana, 2007, P 4 INT WORKSH SEM E