Tweet2Story: Extracting Narratives from Twitter

被引：0

作者：

Campos, Vasco ^{[1
,2
]}

Campos, Ricardo ^{[1
,3
]}

Jorge, Alipio ^{[1
,2
]}

机构：

[1] INESC TEC, Porto, Portugal

[2] Univ Porto, Fac Ciencias, Porto, Portugal

[3] Univ Beira Interior, Porto, Portugal

来源：

PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2023, PT I | 2023年 / 14115卷

关键词：

Narrative extraction; Open information extraction; Twitter;

D O I：

10.1007/978-3-031-49008-8_30

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Topics discussed on social media platforms contain a disparate amount of information written in colloquial language, making it difficult to understand the narrative of the topic. In this paper, we take a step forward, towards the resolution of this problem by proposing a framework that performs the automatic extraction of narratives from a document, such as tweet posts. To this regard, we propose a methodology that extracts information from the texts through a pipeline of tasks, such as co-reference resolution and the extraction of entity relations. The result of this process is embedded into an annotation file to be used by subsequent operations, such as visualization schemas. We named this framework Tweet2Story and measured its effectiveness under an evaluation schema that involved three different aspects: (i) as an Open Information extraction (OpenIE) task, (ii) by comparing the narratives of manually annotated news articles linked to tweets about the same topic and (iii) by comparing their knowledge graphs, produced by the narratives, in a qualitative way. The results obtained show a high precision and a moderate recall, on par with other OpenIE state-of-the-art frameworks and confirm that the narratives can be extracted from small texts. Furthermore, we show that the narrative can be visualized in an easily understandable way.

引用

页码：378 / 388

页数：11