Natural language generation from Universal Dependencies using data augmentation and pre-trained language models

被引：0

作者：

Nguyen D.T. ^{[1
]}

Tran T. ^{[1
]}

机构：

[1] Saigon University, Ho Chi Minh City

来源：

International Journal of Intelligent Information and Database Systems | 2023年 / 16卷 / 01期

关键词：

data augmentation; data-to-text generation; deep learning; fine-tune; pre-trained language models; sequence-to-sequence models; Universal Dependencies;

D O I：

10.1504/IJIIDS.2023.10053426

中图分类号：

学科分类号：

摘要：

Natural language generation (NLG) has focused on data-to-text tasks with different structured inputs in recent years. The generated text should contain given information, be grammatically correct, and meet other criteria. We propose in this research an approach that combines solid pre-trained language models with input data augmentation. The studied data in this work are Universal Dependencies (UDs) which is developed as a framework for consistent annotation of grammar (parts of speech, morphological features and syntactic dependencies) for cross-lingual learning. We study the English UD structures, which are modified into two groups. In the first group, the modification phase is to remove the order information of each word and lemmatise the tokens. In the second group, the modification phase is to remove the functional words and surface-oriented morphological details. With both groups of modified structures, we apply the same approach to explore how pre-trained sequence-to-sequence models text-to-text transfer transformer (T5) and BART perform on the training data. We augment the training data by creating several permutations for each input structure. The result shows that our approach can generate good quality English text with the exciting idea of studying strategies to represent UD inputs. Copyright © 2023 Inderscience Enterprises Ltd.

引用

页码：89 / 105

页数：16

共 50 条

[21] Assessing Phrase Break of ESL Speech with Pre-trained Language Models and Large Language Models
Wang, Zhiyi
Mao, Shaoguang
Wu, Wenshan
Xia, Yan
Deng, Yan
Tien, Jonathan
INTERSPEECH 2023, 2023, : 4194 - 4198
[22] Vulnerability Analysis of Continuous Prompts for Pre-trained Language Models
Li, Zhicheng
Shi, Yundi
Sheng, Xuan
Yin, Changchun
Zhou, Lu
Li, Piji
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT IX, 2023, 14262 : 508 - 519
[23] UniRaG: Unification, Retrieval, and Generation for Multimodal Question Answering With Pre-Trained Language Models
Lim, Qi Zhi
Lee, Chin Poo
Lim, Kian Ming
Samingan, Ahmad Kamsani
IEEE ACCESS, 2024, 12 : 71505 - 71519
[24] Adapting Pre-trained Language Models to Rumor Detection on Twitter
Slimi, Hamda
Bounhas, Ibrahim
Slimani, Yahya
JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2021, 27 (10) : 1128 - 1148
[25] ProSide: Knowledge Projector and Sideway for Pre-trained Language Models
He, Chaofan
Lu, Gewei
Shen, Liping
NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT II, NLPCC 2024, 2025, 15360 : 56 - 68
[26] Exploring Pre-trained Language Models for Vocabulary Alignment in the UMLS
Hao, Xubing
Abeysinghe, Rashmie
Shi, Jay
Cui, Licong
ARTIFICIAL INTELLIGENCE IN MEDICINE, PT I, AIME 2024, 2024, 14844 : 273 - 278
[27] Pre-trained Language Models in Biomedical Domain: A Systematic Survey
Wang, Benyou
Xie, Qianqian
Pei, Jiahuan
Chen, Zhihong
Tiwari, Prayag
Li, Zhao
Fu, Jie
ACM COMPUTING SURVEYS, 2024, 56 (03)
[28] Cross-Domain Authorship Attribution Using Pre-trained Language Models
Barlas, Georgios
Stamatatos, Efstathios
ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2020, PT I, 2020, 583 : 255 - 266
[29] Comprehensive Research on Druggable Proteins: From PSSM to Pre-Trained Language Models
Chu, Hongkang
Liu, Taigang
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (08)
[30] Automatic extraction of 12 cardiovascular concepts from German discharge letters using pre-trained language models
Richter-Pechanski, Phillip
Geis, Nicolas A.
Kiriakou, Christina
Schwab, Dominic M.
Dieterich, Christoph
DIGITAL HEALTH, 2021, 7

← 1 2 3 4 5 →