Web Chat-based Application with Large Language Model and Transformers from Hugging Face for Self-Learning on Storytelling Skills

被引:0
作者
Agatha, Victoria [1 ]
Setyawan, Iwan [1 ]
机构
[1] Satya Wacana Christian Univ, Fac Elect & Comp Engn, Salatiga, Indonesia
来源
2024 INTERNATIONAL ELECTRONICS SYMPOSIUM, IES 2024 | 2024年
关键词
Large Language Model; Storytelling; Hugging Face;
D O I
10.1109/IES63037.2024.10665795
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The ability of storytelling greatly affects a person's success. These abilities are usually taught by parents at home or teachers at school. With the development of Artificial Intelligent technology, it is now possible to automatically generate stories using Large Language Model (LLM) which can understand and create language like humans. In this paper, the authors propose a self-learning system in storytelling by utilizing and combining four models from Hugging Face Hub. The proposed system is a web chat-based application so that users can communicate with LLM where LLM has received an image input from the users. The four models are as follows. Falcon 7B Instruct model as LLM that gets caption information from BLIP Image Captioning Large model. Its responses in the form of text can be read by the users and can be heard through audio synthesized by the MMS TTS Eng model. The user can also see the detected objects in the image which is detected by DETR ResNet 50 model. Our experiments show that the proposed system is sufficient to produce a good story and fit the context of the image, with an average user score of 89.76.
引用
收藏
页码:614 / 618
页数:5
相关论文
共 17 条
  • [1] Agosto D.E., 2013, Storytelling, Self, Society, V9, P53, DOI [https://doi.org/10.13110/storselfsoci.9.1.0053, DOI 10.13110/STORSELFSOCI.9.1.0053]
  • [2] Almazrouei E., 2023, The Falcon Series of Open Language Models, V2, P1
  • [3] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
  • [4] ibm, What is LangChain?
  • [5] Jain S.M., 2022, Introduction to Transformers for NLP: With the Hugging Face Library and Models to Solve Problems
  • [6] Johnson V., Falcon vs. LLaMA: A Comparison of Two Large Language Models
  • [7] ChatGPT for good? On opportunities and challenges of large language models for education
    Kasneci, Enkelejda
    Sessler, Kathrin
    Kuechemann, Stefan
    Bannert, Maria
    Dementieva, Daryna
    Fischer, Frank
    Gasser, Urs
    Groh, Georg
    Guennemann, Stephan
    Huellermeier, Eyke
    Krusche, Stepha
    Kutyniok, Gitta
    Michaeli, Tilman
    Nerdel, Claudia
    Pfeffer, Juergen
    Poquet, Oleksandra
    Sailer, Michael
    Schmidt, Albrecht
    Seidel, Tina
    Stadler, Matthias
    Weller, Jochen
    Kuhn, Jochen
    Kasneci, Gjergji
    [J]. LEARNING AND INDIVIDUAL DIFFERENCES, 2023, 103
  • [8] Li JN, 2022, PR MACH LEARN RES
  • [9] Muchmore M., 2024, What Is Copilot? Microsoft's AI Assistant Explained
  • [10] Pratap V., 2023, Scaling Speech Technology to 1,000+ Languages, V1, P1