Increasing Accessibility of Language Models with Multi-stage Information Extraction

被引:4
|
作者
Czejdo, Conrad [1 ]
Bhattacharya, Sambit [1 ]
机构
[1] Fayetteville State Univ, Dept Math & Comp Sci, Fayetteville, NC 28301 USA
基金
美国国家科学基金会;
关键词
Deep Learning (DL); Natural Language Processing (NLP); Language Models (LM); one-shot learning; API;
D O I
10.12720/jait.13.2.181-185
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The capabilities of Language Models (LMs) have continued to increase in recent years, as have their computational requirements. Widely available APIs have also become available. These APIs present new challenges for ease of gradient based fine-tuning by users, resulting in the use models which may be larger than necessary and more expensive, therefore reducing accessibility. In this paper, we present a new methodology for increasing performance of single-shot LMs by chaining multiple smaller LMs. Additionally, as the derived representation is in plain-text it is readily human interpretable. We show that optimizing the context which leads to this derived representation results in improved performance and reduced cost.
引用
收藏
页码:181 / 185
页数:5
相关论文
共 50 条
  • [1] Multi-stage guided code generation for Large Language Models
    Han, Yewei
    Lyu, Chen
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 139
  • [2] ENGINEERING MODELS OF OIL REFINING: INCREASING THE EFFICIENCY OF MULTI-STAGE GASOLINE PRODUCTION
    Ivashkina, Elena N.
    Koksharov, Anton G.
    Ivanchina, Emiliya D.
    Chuzlov, Vyacheslav A.
    Nazarova, Galina Y.
    Chernyakova, Ekaterina S.
    Dolganov, Igor M.
    BULLETIN OF THE TOMSK POLYTECHNIC UNIVERSITY-GEO ASSETS ENGINEERING, 2023, 334 (04): : 195 - 208
  • [3] A Multi-stage Approach to Curve Extraction
    Guo, Yuliang
    Kumar, Naman
    Narayanan, Maruthi
    Kimia, Benjamin
    COMPUTER VISION - ECCV 2014, PT I, 2014, 8689 : 663 - 678
  • [4] Multi-stage Chinese collocation extraction
    Xu, RF
    Lu, Q
    Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 3254 - 3259
  • [5] Multi-stage models of cancer and disease
    Webster, Anthony
    INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2021, 50
  • [6] Character string extraction by multi-stage relaxation
    Hase, H
    Shinokawa, T
    Yoneda, M
    Sakai, M
    Maruyama, H
    PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, 1997, : 298 - 302
  • [7] A multi-stage chiral extraction model.
    Koska, J
    Haynes, CA
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1997, 213 : 160 - IEC
  • [8] A multi-stage Chinese collocation extraction system
    Xu, Ruifeng
    Lu, Qin
    ADVANCES IN MACHINE LEARNING AND CYBERNETICS, 2006, 3930 : 740 - 749
  • [9] DYNAMICS AND CONTROL OF MULTI-STAGE LIQUID EXTRACTION
    CADMAN, TW
    HSU, CK
    TRANSACTIONS OF THE INSTITUTION OF CHEMICAL ENGINEERS AND THE CHEMICAL ENGINEER, 1970, 48 (7-10): : T209 - &
  • [10] Multi-stage Vs Single-Stage: A Local Information Focused Approach for Overlapping Event Extraction
    Han, Shuaihu
    Yang, Guohua
    Zhang, Dawei
    Tao, Jianhua
    Che, Feihu
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT VII, 2024, 15022 : 277 - 291