Mining arguments in scientific abstracts with discourse-level embeddings

被引:13
作者
Accuosto, Pablo [1 ]
Saggion, Horacio [1 ]
机构
[1] Univ Pompeu Fabra, Dept Informat & Commun Technol, TALN Grp, Large Scale Text Understanding Syst Lab LaSTUS, C Tanger 122-140, Barcelona 08018, Spain
关键词
Abstracting - Computational linguistics;
D O I
10.1016/j.datak.2020.101840
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Argument mining consists in the automatic identification of argumentative structures in texts. In this work we leverage existing discourse-level annotations to facilitate the identification of argumentative components and relations in scientific texts, which has been recognized as a particularly challenging task. We propose a new annotation schema and use it to augment a corpus of computational linguistics abstracts that had previously been annotated with discourse units and relations. Our initial experiments with the enriched corpus confirm the potential value of incorporating discourse information in argument mining tasks. In order to tackle the limitations posed by the lack of corpora containing both discourse and argumentative annotations we explore two transfer learning approaches in which discourse parsing is used as an auxiliary task when training argument mining models. In this case, as no discourse information is used as input, the resulting models could be used to predict the argumentative structure of unannotated texts.
引用
收藏
页数:16
相关论文
共 65 条
[1]  
Accuosto P., 2019, 24 INT C APPL NAT LA, P1
[2]  
Accuosto P, 2019, 6TH WORKSHOP ON ARGUMENT MINING (ARGMINING 2019), P41
[3]  
Aharoni E., 2018, US Patent App, Patent No. [14/720, 847, 14720847]
[4]  
Alonso HM, 2017, 15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, P44
[5]  
[Anonymous], 2019, LANG RESOUR EVAL
[6]  
[Anonymous], 1998, HEDGING SCI RES ARTI
[7]  
[Anonymous], 1958, USES ARGUMENT
[8]  
[Anonymous], 2014, P C EMP METH NAT LAN
[9]   IDENTIFYING JUSTIFICATIONS IN WRITTEN DIALOGS BY CLASSIFYING TEXT AS ARGUMENTATIVE [J].
Biran, Or ;
Rambow, Owen .
INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2011, 5 (04) :363-381
[10]   Growth rates of modern science: A bibliometric analysis based on the number of publications and cited references [J].
Bornmann, Lutz ;
Mutz, Ruediger .
JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2015, 66 (11) :2215-2222