Data-efficient Neural Text Compression with Interactive Learning

被引：0

作者：

Avinesh, P. V. S. ^{[1
]}

Meyer, Christian M. ^{[1
]}

机构：

[1] Tech Univ Darmstadt, Res Training Grp AIPHES, Dept Comp Sci, Darmstadt, Germany

来源：

2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1 | 2019年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Neural sequence-to-sequence models have been successfully applied to text compression. However, these models were trained on huge automatically induced parallel corpora, which are only available for a few domains and tasks. In this paper, we propose a novel interactive setup to neural text compression that enables transferring a model to new domains and compression tasks with minimal human supervision. This is achieved by employing active learning, which intelligently samples from a large pool of unlabeled data. Using this setup, we can successfully adapt a model trained on small data of 40k samples for a headline generation task to a general text compression dataset at an acceptable compression quality with just 500 sampled instances annotated by a human.

引用

页码：2543 / 2554

页数：12

共 44 条

[1]

[Anonymous], 2014, Advances in neural information processing systems

[2]

[Anonymous], 2016, P 2016 C EMPIRICAL M

[3]

[Anonymous], 2003, HLT NAACL 2003 HUMAN, DOI [DOI 10.3115/1073445.1073462, 10.3115/1073445.1073462]

[4] Joint Optimization of User-desired Content in Multi-document Summaries by Learning from User Feedback [J].

Avinesh, P. V. S. ;

Meyer, Christian M. .

PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, :1353-1363

[5]

Chan Y.S., 2007, Proc. of the 45th Annual Meeting of the Association of Computational Linguistics, P49

[6]

Chen XM, 2016, INT SYM COMPUT INTEL, P315, DOI [10.1109/ISCID.2016.78, 10.1109/ISCID.2016.1079]

[7]

Chopra S., 2016, P 2016 C N AM CHAPTE, P93

[8]

Clarke J., 2007, P EMNLP CONLL, P1

[9] Global inference for sentence compression an integer linear programming approach [J].

Clarke, James ;

Lapata, Mirella .

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2008, 31 :399-429

[10]

Clarke J, 2006, COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, P377

← 1 2 3 4 5 →