Using large language models to create narrative events

被引：0

作者：

Bartalesi, Valentina ^{[1
]}

Lenzi, Emanuele ^{[1
,2
]}

De Martino, Claudio ^{[1
]}

机构：

[1] Natl Res Council Italy CNR, Inst Informat Sci & Technol Alessandro Faedo ISTI, Pisa, Italy

[2] Univ Pisa, Dept Informat Engn DII, Pisa, Italy

来源：

PEERJ COMPUTER SCIENCE | 2024年 / 10卷

关键词：

Large language models; Narratives; Events; Semantic web; Digital humanities; SCIENCE;

D O I：

10.7717/peerj-cs.2242

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Narratives play a crucial role in human communication, serving as a means to convey experiences, perspectives, and meanings across various domains. They are particularly significant in scientific communities, where narratives are often utilized to explain complex phenomena and share knowledge. This article explores the possibility of integrating large language models (LLMs) into a workflow that, exploiting the Semantic Web technologies, transforms raw textual data gathered by scientific communities into narratives. In particular, we focus on using LLMs to automatically create narrative events, maintaining the reliability of the generated texts. The study provides a conceptual definition of narrative events and evaluates the performance of different smaller LLMs compared to the requirements we identified. A key aspect of the experiment is the emphasis on maintaining the integrity of the original narratives in the LLM outputs, as experts often review texts produced by scientific communities to ensure their accuracy and reliability. We first perform an evaluation on a corpus of five narratives and then on a larger dataset comprising 124 narratives. LLaMA 2 is identified as the most suitable model for generating narrative events that closely align with the input texts, demonstrating its ability to generate high-quality narrative events. Prompt engineering techniques are then employed to enhance the performance of the selected model, leading to further improvements in the quality of the generated texts.

引用

页数：17

共 42 条

[1]

Aimhdhgroup, 2024, Figshare, DOI 10.6084/M9.FIGSHARE.25585722.V3

[2]

Aimhdhgroup, 2024, Figshare, DOI 10.6084/M9.FIGSHARE.25585683.V3

[3]

Aimhdhgroup, 2024, Figshare, DOI 10.6084/M9.FIGSHARE.25585824.V3

[4]

[Anonymous], 2024, Cambridge Dictionary

[5]

Bartalesi V, 2024, Dataset of five narratives DOT, DOI [10.6084/m9.figshare.25562406.v3, DOI 10.6084/M9.FIGSHARE.25562406.V3]

[6]

Bartalesi V, 2024, Dataset of five short narratives DOT, DOI [10.6084/m9.figshare.25562433.v2, DOI 10.6084/M9.FIGSHARE.25562433.V2]

[7]

Bartalesi V, 2024, Dataset of 124 narratives (^500 tokens) split into paragraphs, DOI [10.6084/m9.figshare.26046457.v3, DOI 10.6084/M9.FIGSHARE.26046457.V3]

[8]

Bartalesi V, 2024, Dataset of five short narratives (^500 tokens) split into paragraphs, DOI [10.6084/m9.figshare.26046448.v3, DOI 10.6084/M9.FIGSHARE.26046448.V3]

[9]

Bartalesi V, 2024, Dataset of five short narratives (^500 tokens) split into events, DOI [10.6084/m9.figshare.26046445.v3, DOI 10.6084/M9.FIGSHARE.26046445.V3]

[10]

Bartalesi V, 2024, Dataset of 124 narratives DOT, DOI [10.6084/m9.figshare.25562400.v4, DOI 10.6084/M9.FIGSHARE.25562400.V4]

← 1 2 3 4 5 →