Advancing Quality Assessment in Vertical Field: Scoring Calculation for Text Inputs to Large Language Models

被引：0

作者：

Yi, Jun-Kai ^{[1
]}

Yao, Yi-Fan ^{[1
]}

机构：

[1] Beijing Informat Sci & Technol Univ, Coll Automat, Beijing 100192, Peoples R China

来源：

APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 16期

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

VFS evaluation algorithm; text quality; large language models; generative AI;

D O I：

10.3390/app14166955

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

With the advent of Transformer-based generative AI, there has been a surge in research focused on large-scale generative language models, especially in natural language processing applications. Moreover, these models have demonstrated immense potential across various vertical fields, ranging from education and history to mathematics, medicine, information processing, and cybersecurity. In research on AI applications in Chinese, it has been found that the quality of text generated by generative AI has become a central focus of attention. However, research on the quality of input text still remains an overlooked priority. Consequently, based on the vectorization comparison of vertical field lexicons and text structure analysis, proposes three input indicators D1, D2, and D3 that affect the quality of generation. Based on this, we studied a text quality evaluation algorithm called VFS (Vertical Field Score) and designed an output evaluation metric named V-L (Vertical-Length). Our experiments indicate that higher-scoring input texts enable generative AI to produce more effective outputs. This enhancement aids users, particularly in leveraging generative AI for question-answering in specific vertical fields, thereby improving response effectiveness and accuracy.

引用

页数：15

共 24 条

[1] Improving the Reliability of Deep Neural Networks in NLP: A Review [J].

Alshemali, Basemah ;

Kalita, Jugal .

KNOWLEDGE-BASED SYSTEMS, 2020, 191

[2]

Azaria A., 2022, ChatGPT usage and limitations

[3]

Lipton ZC, 2015, Arxiv, DOI [arXiv:1506.00019, DOI 10.48550/ARXIV.1506.00019]

[4]

Cui JX, 2024, Arxiv, DOI arXiv:2306.16092

[5]

Dong XM, 2023, Arxiv, DOI [arXiv:2307.07306, 10.48550/ARXIV.2307.07306]

[6]

Dong YH, 2024, Arxiv, DOI [arXiv:2304.07590, DOI 10.48550/ARXIV.2304.07590]

[7]

Ganesan AV, 2021, 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), P4515, DOI 10.18653/v1/2021.naacl-main.357

[8]

Gillioz Anthony, 2020, 2020 15th Conference on Computer Science and Information Systems (FedCSIS), P179, DOI 10.15439/2020F20

[9]

Johnson R, 2016, INT C MACHINE LEARNI, P526

[10] A review on genetic algorithm: past, present, and future [J].

Katoch, Sourabh ;

Chauhan, Sumit Singh ;

Kumar, Vijay .

MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (05) :8091-8126

← 1 2 3 →