Unsupervised Text Generation by Learning from Search

被引:0
作者
Li, Jingjing [1 ]
Li, Zichao [2 ]
Mou, Lili [3 ,4 ]
Jiang, Xin [2 ]
Lyu, Michael R. [1 ]
King, Irwin [1 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] Huawei Noahs Ark Lab, Montreal, PQ, Canada
[3] Univ Alberta, Edmonton, AB, Canada
[4] Alberta Machine Intelligence Inst Amii, Edmonton, AB, Canada
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020 | 2020年 / 33卷
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we propose TGLS, a novel framework for unsupervised Text Generation by Learning from Search. We start by applying a strong search algorithm (in particular, simulated annealing) towards a heuristically defined objective that (roughly) estimates the quality of sentences. Then, a conditional generative model learns from the search results, and meanwhile smooth out the noise of search. The alternation between search and learning can be repeated for performance bootstrapping. We demonstrate the effectiveness of TGLS on two real-world natural language generation tasks, unsupervised paraphrasing and text formalization. Our model significantly outperforms unsupervised baseline methods in both tasks. Especially, it achieves comparable performance to strong supervised methods for paraphrase generation.(1)
引用
收藏
页数:12
相关论文
共 58 条
  • [1] Amplayo RK, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P1934
  • [2] [Anonymous], 2016, P 20 SIGNLL C COMPUT
  • [3] Bao Y, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P6008
  • [4] Chang KW, 2015, PR MACH LEARN RES, V37, P2058
  • [5] Chu Eric, 2018, ARXIV181005739
  • [6] Daume III H., 2005, P 22 INT C MACH LEAR, P169, DOI [DOI 10.1145/1102351.1102373, 10.1145/1102351.1102373]
  • [7] Du W., 2019, IJCNLP, P6014
  • [8] Fu Y., 2019, ADV NEUR IN, P13623
  • [9] Fu ZX, 2018, AAAI CONF ARTIF INTE, P663
  • [10] Goyal K, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P1724