Unsupervised Text Generation by Learning from Search

被引:0
作者
Li, Jingjing [1 ]
Li, Zichao [2 ]
Mou, Lili [3 ,4 ]
Jiang, Xin [2 ]
Lyu, Michael R. [1 ]
King, Irwin [1 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] Huawei Noahs Ark Lab, Montreal, PQ, Canada
[3] Univ Alberta, Edmonton, AB, Canada
[4] Alberta Machine Intelligence Inst Amii, Edmonton, AB, Canada
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020 | 2020年 / 33卷
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we propose TGLS, a novel framework for unsupervised Text Generation by Learning from Search. We start by applying a strong search algorithm (in particular, simulated annealing) towards a heuristically defined objective that (roughly) estimates the quality of sentences. Then, a conditional generative model learns from the search results, and meanwhile smooth out the noise of search. The alternation between search and learning can be repeated for performance bootstrapping. We demonstrate the effectiveness of TGLS on two real-world natural language generation tasks, unsupervised paraphrasing and text formalization. Our model significantly outperforms unsupervised baseline methods in both tasks. Especially, it achieves comparable performance to strong supervised methods for paraphrase generation.(1)
引用
收藏
页数:12
相关论文
共 58 条
  • [31] Prabhumoye S, 2018, PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, P866
  • [32] Qian LH, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P3173
  • [33] Radford A., 2019, OpenAI Blog, V1, P9, DOI DOI 10.18653/V1/P19-1195
  • [34] Ranzato M., 2017, ICLR
  • [35] Rao Sudha, 2018, NAACL HLT, V1, P129
  • [36] Rose S., 2010, Text Mining: Applications and Theory, P1, DOI [DOI 10.1002/9780470689646.CH1, 10.1002/9780470689646.ch1, 10.1002/9780470689646.CH1]
  • [37] Schumann R., 2020, ACL
  • [38] Shen Tianxiao., 2017, NIPS, P6833
  • [39] A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
    Silver, David
    Hubert, Thomas
    Schrittwieser, Julian
    Antonoglou, Ioannis
    Lai, Matthew
    Guez, Arthur
    Lanctot, Marc
    Sifre, Laurent
    Kumaran, Dharshan
    Graepel, Thore
    Lillicrap, Timothy
    Simonyan, Karen
    Hassabis, Demis
    [J]. SCIENCE, 2018, 362 (6419) : 1140 - +
  • [40] Sun H., 2012, Short Papers, V2, P38