HugNLP: A Unified and Comprehensive Library for Natural Language Processing

被引:4
作者
Wang, Jianing [1 ]
Chen, Nuo [1 ]
Sun, Qiushi [2 ]
Huang, Wenkang [3 ]
Wang, Chengyu [4 ]
Gao, Ming [1 ]
机构
[1] East China Normal Univ, Shanghai, Peoples R China
[2] Natl Univ Singapore, Singapore, Singapore
[3] Ant Grp, Shanghai, Peoples R China
[4] Alibaba Grp, Hangzhou, Peoples R China
来源
PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023 | 2023年
关键词
Natural Language Processing; Pre-trained Language Models; Deep Learning Framework;
D O I
10.1145/3583780.3614742
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we introduce HugNLP, a unified and comprehensive library for natural language processing (NLP) with the prevalent backend of Hugging Face Transformers, which is designed for NLP researchers to easily utilize off-the-shelf algorithms and develop novel methods with user-defined models and tasks in real-world scenarios. HugNLP consists of a hierarchical structure including models, processors and applications that unifies the learning process of pre-trained language models (PLMs) on different NLP tasks. Additionally, we present some featured NLP applications to show the effectiveness of HugNLP, such as knowledge-enhanced PLMs, universal information extraction, low-resource mining, and code understanding and generation, etc. The source code will be released on GitHub (https://github.com/HugAILab/HugNLP).
引用
收藏
页码:5111 / 5116
页数:6
相关论文
共 45 条
[1]  
Amini Massih-Reza, 2022, ABS220212040 CORR
[2]  
[Anonymous], 2022, AAAI
[3]  
Ben-Zaken E, 2022, PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): (SHORT PAPERS), VOL 2, P1
[4]  
Brown TB, 2020, ARXIV, DOI DOI 10.48550/ARXIV.2005.14165
[5]   Learning from labeled and unlabeled data: An empirical study across techniques and domains [J].
Chawla, N ;
Karakoulas, G .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2005, 23 :331-366
[6]  
Cui YM, 2020, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, P657
[7]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[8]  
Dong Qingxiu, 2023, ABS230100234 CORR, DOI [10.48550/arXiv.2301.00234, DOI 10.48550/ARXIV.2301.00234]
[9]  
Feng ZY, 2020, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, P1536
[10]  
Gal Y, 2016, PR MACH LEARN RES, V48