An Industrial Short Text Classification Method Based on Large Language Model and Knowledge Base

被引:0
作者
Yin, Haoran [1 ]
机构
[1] Shanghai Univ Engn Sci, Sch Elect & Elect Engn, Shanghai, Peoples R China
来源
2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024 | 2024年
关键词
short text classification; deep learning; LLM; knowledge base; attention; CNN;
D O I
10.1109/IJCNN60899.2024.10650933
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text classification is a common task in natural language processing tasks. However, in reality, such as industrial datasets, there are numerous short texts that are ignored by existing models, in which important information is inevitably missed due to the short word count. Recent studies tend to enrich the features in short text classification by introducing conceptual information. However, these simple conceptual information is also relatively sparse, with limited feature augment. In this paper, we propose Input Embedding, Knowledge Base Retrieval, Large Language Model Text Augmentation, and Text Encoding for the task of short text classification (IKLT). The four modules utilize the current well-performing forms of Large Language Model and Retrieval Augmentation to further enrich the feature information and have interpretability. Moreover, in addition to some public datasets we establish an industrial short text dataset for comparison experiments. The experimental results show that our proposed framework for short text classification achieves improved results on all five datasets.
引用
收藏
页数:7
相关论文
共 42 条
[1]  
Al Qadi L, 2019, 2019 2ND INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), P238, DOI 10.1109/ictcs.2019.8923073
[2]   Review of short-text classification [J].
Alsmadi, Issa ;
Gan, Keng Hoon .
INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2019, 15 (02) :155-182
[3]  
Chen JD, 2019, AAAI CONF ARTIF INTE, P6252
[4]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[5]  
Douze Matthijs, 2024, arXiv
[6]  
Ferragina P., 2010, P 19 ACM INT C INFOR, P1625, DOI [10.1145/1871437.1871689, DOI 10.1145/1871437.1871689, 10.1145/1871437.]
[7]   Text Classification Research Based on Improved Word2vec and CNN [J].
Gao, Mengyuan ;
Li, Tinghui ;
Huang, Peifang .
SERVICE-ORIENTED COMPUTING, ICSOC 2018, 2019, 11434 :126-135
[8]  
He Pengcheng, 2021, INT C LEARN REPR
[9]  
Hu YB, 2018, I C CONT AUTOMAT ROB, P1432, DOI 10.1109/ICARCV.2018.8581332
[10]  
Jeong Cheonsu, 2023, ARXIV