Evaluation of Question-Answering Based Text Summarization using LLM

被引:0
作者
Ding, Junhua [1 ]
Huyen Nguyen [1 ]
Chen, Haihua [1 ]
机构
[1] Univ North Texas, Dept Informat Sci, Denton, TX 76203 USA
来源
2024 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE TESTING, AITEST | 2024年
关键词
text summarization; generative AI; question-answering; large language model;
D O I
10.1109/AITest62860.2024.00025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Question-answering based text summarization can produce personalized and specific summaries; however, the primary challenge is the generation and selection of questions that users expect the summary to answer. Large language models (LLMs) provide an automatic method for generating these questions from the original text. By prompting the LLM to answer these selected questions based on the original text, high-quality summaries can be produced. In this paper, we experiment with an approach for question generation, selection, and text summarization using the LLM tool GPT4o. We also conduct a comparative study of existing summarization approaches and evaluation metrics to understand how to produce personalized and useful summaries. Based on the experiment results, we explain why question-answering based text summarization achieves better performance.
引用
收藏
页码:142 / 149
页数:8
相关论文
共 32 条
  • [1] Achiam OJ, 2023, Arxiv, DOI [arXiv:2303.08774, 10.48550/arXiv.2303.08774, DOI 10.48550/ARXIV.2303.08774]
  • [2] Cajueiro DO, 2023, Arxiv, DOI arXiv:2301.03403
  • [3] Cao M, 2022, Arxiv, DOI arXiv:2204.09519
  • [4] Chen S, 2021, 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), P5935
  • [5] Chin-Yew Lin, 2004, Text Summarization Branches Out, P74
  • [6] Towards Question-Answering as an Automatic Metric for Evaluating the Content Quality of a Summary
    Deutsch, Daniel
    Bedrax-Weiss, Tania
    Roth, Dan
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2021, 9 : 774 - 789
  • [7] Automatic text summarization: A comprehensive survey
    El-Kassas, Wafaa S.
    Salama, Cherif R.
    Rafea, Ahmed A.
    Mohamed, Hoda K.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 165
  • [8] Eyal M, 2019, Arxiv, DOI arXiv:1906.00318
  • [9] Flesch Rudolf Franz, 1979, How to write plain English: A book for lawyers and consumers
  • [10] Graham Y., 2015, P 2015 C EMPIRICAL M, P128