Pashto poetry generation: deep learning with pre-trained transformers for low-resource languages

被引:0
|
作者
Ullah, Imran [1 ]
Ullah, Khalil [1 ]
Khan, Hamad [1 ]
Aurangzeb, Khursheed [2 ]
Anwar, Muhammad Shahid [3 ]
Syed, Ikram [3 ]
机构
[1] Univ Malakand, Software Engn, Chakdara, Pakistan
[2] King Saud Univ, Coll Comp & Informat Sci, Dept Comp Engn, Riyadh, Saudi Arabia
[3] Gachon Univ, Dept AI & Software, Seongnam Si, South Korea
关键词
Machine learning; Deep learning; LaMini-Cerebras-590M; Bloomz-560m; Natural language processing; Poetry generation;
D O I
10.7717/peerj-cs.2163
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generating poetry using machine and deep learning techniques has been a challenging and exciting topic of research in recent years. It has significance fi cance in natural language processing and computational linguistics. This study introduces an innovative approach to generate high-quality Pashto poetry by leveraging two pre- trained transformer models, LaMini-Cerebras-590M and bloomz-560m. The models were trained on an extensive new and quality Pashto poetry dataset to learn the underlying complex patterns and structures. The trained models are then used to generate new Pashto poetry by providing them with a seed text or prompt. To evaluate the quality of the generated poetry, we conducted both subjective and objective evaluations, including human evaluation. The experimental results demonstrate that the proposed approach can generate Pashto poetry that is comparable in quality to human-generated poetry. The study provides a valuable contribution to the fi eld of Pashto language and poetry generation and has potential applications in natural language processing and computational linguistics.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Text data augmentation and pre-trained Language Model for enhancing text classification of low-resource languages
    Ziyaden, Atabay
    Yelenov, Amir
    Hajiyev, Fuad
    Rustamov, Samir
    Pak, Alexandr
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [2] Enriching the Transfer Learning with Pre-Trained Lexicon Embedding for Low-Resource Neural Machine Translation
    Mieradilijiang Maimaiti
    Yang Liu
    Huanbo Luan
    Maosong Sun
    TsinghuaScienceandTechnology, 2022, 27 (01) : 150 - 163
  • [3] Enriching the Transfer Learning with Pre-Trained Lexicon Embedding for Low-Resource Neural Machine Translation
    Maimaiti, Mieradilijiang
    Liu, Yang
    Luan, Huanbo
    Sun, Maosong
    TSINGHUA SCIENCE AND TECHNOLOGY, 2022, 27 (01) : 150 - 163
  • [4] Ensemble Learning with Pre-Trained Transformers for Crash Severity Classification: A Deep NLP Approach
    Jaradat, Shadi
    Nayak, Richi
    Paz, Alexander
    Elhenawy, Mohammed
    ALGORITHMS, 2024, 17 (07)
  • [5] Deep Learning-based POS Tagger and Chunker for Odia Language Using Pre-trained Transformers
    Dalai, Tusarkanta
    Kumarmishra, Tapas
    Sa, Andpankaj K.
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (02)
  • [6] Classification and Analysis of Pistachio Species with Pre-Trained Deep Learning Models
    Singh, Dilbag
    Taspinar, Yavuz Selim
    Kursun, Ramazan
    Cinar, Ilkay
    Koklu, Murat
    Ozkan, Ilker Ali
    Lee, Heung-No
    ELECTRONICS, 2022, 11 (07)
  • [7] A Deep Learning Sentiment Analyser for Social Media Comments in Low-Resource Languages
    Kastrati, Zenun
    Ahmedi, Lule
    Kurti, Arianit
    Kadriu, Fatbardh
    Murtezaj, Doruntina
    Gashi, Fatbardh
    ELECTRONICS, 2021, 10 (10)
  • [8] Biologically Inspired Design Concept Generation Using Generative Pre-Trained Transformers
    Zhu, Qihao
    Zhang, Xinyu
    Luo, Jianxi
    JOURNAL OF MECHANICAL DESIGN, 2023, 145 (04)
  • [9] Improving stance detection accuracy in low-resource languages: a deep learning framework with ParsBERT
    Rahimi, Mohammad
    Kiani, Vahid
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024, : 517 - 535
  • [10] Unregistered Multiview Mammogram Analysis with Pre-trained Deep Learning Models
    Carneiro, Gustavo
    Nascimento, Jacinto
    Bradley, Andrew P.
    MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 : 652 - 660