Data-efficient Neural Text Compression with Interactive Learning

被引:0
作者
Avinesh, P. V. S. [1 ]
Meyer, Christian M. [1 ]
机构
[1] Tech Univ Darmstadt, Res Training Grp AIPHES, Dept Comp Sci, Darmstadt, Germany
来源
2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1 | 2019年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural sequence-to-sequence models have been successfully applied to text compression. However, these models were trained on huge automatically induced parallel corpora, which are only available for a few domains and tasks. In this paper, we propose a novel interactive setup to neural text compression that enables transferring a model to new domains and compression tasks with minimal human supervision. This is achieved by employing active learning, which intelligently samples from a large pool of unlabeled data. Using this setup, we can successfully adapt a model trained on small data of 40k samples for a headline generation task to a general text compression dataset at an acceptable compression quality with just 500 sampled instances annotated by a human.
引用
收藏
页码:2543 / 2554
页数:12
相关论文
共 44 条
[1]  
[Anonymous], 2014, Advances in neural information processing systems
[2]  
[Anonymous], 2016, P 2016 C EMPIRICAL M
[3]  
[Anonymous], 2003, HLT NAACL 2003 HUMAN, DOI [DOI 10.3115/1073445.1073462, 10.3115/1073445.1073462]
[4]   Joint Optimization of User-desired Content in Multi-document Summaries by Learning from User Feedback [J].
Avinesh, P. V. S. ;
Meyer, Christian M. .
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, :1353-1363
[5]  
Chan Y.S., 2007, Proc. of the 45th Annual Meeting of the Association of Computational Linguistics, P49
[6]  
Chen XM, 2016, INT SYM COMPUT INTEL, P315, DOI [10.1109/ISCID.2016.78, 10.1109/ISCID.2016.1079]
[7]  
Chopra S., 2016, P 2016 C N AM CHAPTE, P93
[8]  
Clarke J., 2007, P EMNLP CONLL, P1
[9]   Global inference for sentence compression an integer linear programming approach [J].
Clarke, James ;
Lapata, Mirella .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2008, 31 :399-429
[10]  
Clarke J, 2006, COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, P377