ChatGPT vs state-of-the-art models: a benchmarking study in keyphrase generation task

被引:2
作者
Martinez-Cruz, Roberto [1 ,2 ]
Lopez-Lopez, Alvaro J. [2 ]
Portela, Jose [2 ]
机构
[1] Moodys Analyt, Moodys Data Solut, Prague, Czech Republic
[2] Comillas Pontifical Univ, Inst Res Technol, ICAI Sch Engn, Madrid, Spain
关键词
ChatGPT; Text generation; Keyphrase generation; Natural language processing; Deep learning; Domain adaptation; Long documents; Large language models;
D O I
10.1007/s10489-024-05901-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transformer-based language models, including ChatGPT, have demonstrated exceptional performance in various natural language generation tasks. However, there has been limited research evaluating ChatGPT's keyphrase generation ability, which involves identifying informative phrases that accurately reflect a document's content. This study seeks to address this gap by comparing ChatGPT's keyphrase generation performance with state-of-the-art models, while also testing its potential as a solution for two significant challenges in the field: domain adaptation and keyphrase generation from long documents. We conducted experiments on eight publicly available datasets spanning scientific, news, and biomedical domains, analyzing performance across both short and long documents. Our results show that ChatGPT outperforms current state-of-the-art models in all tested datasets and environments, generating high-quality keyphrases that adapt well to diverse domains and document lengths.
引用
收藏
页数:25
相关论文
共 73 条
[1]   Bilateral Optic Disc Edema in a Patient with Lead Poisoning [J].
Aghdam, Kaveh Abri ;
Zand, Amin ;
Sanjari, Mostafa Soltan .
JOURNAL OF OPHTHALMIC & VISION RESEARCH, 2019, 14 (04) :513-517
[2]  
Ahmad WU, 2021, 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), P1389
[3]   Bi-LSTM-CRF Sequence Labeling for Keyphrase Extraction from Scholarly Documents [J].
Al-Zaidy, Rabah A. ;
Caragea, Cornelia ;
Giles, C. Lee .
WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, :2551-2557
[4]  
Augenstein I, 2017, SemEval 2017 Task 10: ScienceIE-Extracting Keyphrases and Relations from Scientific Publications
[5]  
Bennani-Smires K, 2018, Arxiv, DOI [arXiv:1801.04470, DOI 10.48550/ARXIV.1801.04470]
[6]  
Bohm Florian, 2019, BETTER REWARDS YIELD
[7]  
Bougouin A., 2013, P INT JOINT C NAT LA
[8]  
Brown TB, 2020, ADV NEUR IN, V33
[9]  
Chan HP, 2019, Neural Keyphrase Generation via Reinforcement Learning with Adaptive Rewards
[10]  
Chen J, 2018, Keyphrase Generation with Correlation Constraints