Large language models (LLMs): survey, technical frameworks, and future challenges

被引:11
|
作者
Kumar, Pranjal [1 ]
机构
[1] Lovely Profess Univ, Sch Comp Sci & Engn, Dept Intelligent Syst, Phagwara 144411, Punjab, India
关键词
Generative language models; Artificial intelligence; Natural language processing; Machine learning; Neural networks; Large language models; ARTIFICIAL-INTELLIGENCE; KNOWLEDGE;
D O I
10.1007/s10462-024-10888-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Artificial intelligence (AI) has significantly impacted various fields. Large language models (LLMs) like GPT-4, BARD, PaLM, Megatron-Turing NLG, Jurassic-1 Jumbo etc., have contributed to our understanding and application of AI in these domains, along with natural language processing (NLP) techniques. This work provides a comprehensive overview of LLMs in the context of language modeling, word embeddings, and deep learning. It examines the application of LLMs in diverse fields including text generation, vision-language models, personalized learning, biomedicine, and code generation. The paper offers a detailed introduction and background on LLMs, facilitating a clear understanding of their fundamental ideas and concepts. Key language modeling architectures are also discussed, alongside a survey of recent works employing LLM methods for various downstream tasks across different domains. Additionally, it assesses the limitations of current approaches and highlights the need for new methodologies and potential directions for significant advancements in this field.
引用
收藏
页数:51
相关论文
共 50 条
  • [31] Reducing the Energy Dissipation of Large Language Models (LLMs) with Approximate Memories
    Gao, Zhen
    Deng, Jie
    Reviriego, Pedro
    Liu, Shanshan
    Lombardi, Fabrizio
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [32] Towards trustworthy LLMs: a review on debiasing and dehallucinating in large language models
    Lin, Zichao
    Guan, Shuyan
    Zhang, Wending
    Zhang, Huiyan
    Li, Yugang
    Zhang, Huaping
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (09)
  • [33] Reinforcement Learning With Large Language Models (LLMs) Interaction For Network Services
    Du, Hongyang
    Zhang, Ruichen
    Niyato, Dusit
    Kang, Jiawen
    Xiong, Zehui
    Kim, Dong In
    2024 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS, ICNC, 2024, : 799 - 803
  • [34] Enhancing Accessibility in Software Engineering Projects with Large Language Models (LLMs)
    Aljedaani, Wajdi
    Eler, Marcelo Medeiros
    Parthasarathy, P. D.
    PROCEEDINGS OF THE 56TH ACM TECHNICAL SYMPOSIUM ON COMPUTER SCIENCE EDUCATION, SIGCSE TS 2025, VOL 1, 2025, : 25 - 31
  • [35] Performance of large language models (LLMs) in providing prostate cancer information
    Alasker, Ahmed
    Alsalamah, Seham
    Alshathri, Nada
    Almansour, Nura
    Alsalamah, Faris
    Alghafees, Mohammad
    Alkhamees, Mohammad
    Alsaikhan, Bader
    BMC UROLOGY, 2024, 24 (01):
  • [36] Enhancing Accessibility in Software Engineering Projects with Large Language Models (LLMs)
    Aljedaani, Wajdi
    Eler, Marcelo Medeiros
    Parthasarathy, P. D.
    PROCEEDINGS OF THE 56TH ACM TECHNICAL SYMPOSIUM ON COMPUTER SCIENCE EDUCATION, SIGCSE TS 2025, VOL 2, 2025, : 25 - 31
  • [37] AGE-RELATED VALUE ORIENTATIONS IN LARGE LANGUAGE MODELS (LLMS)
    Zhang, Xin
    Ren, Yuanyi
    Song, Guojie
    INNOVATION IN AGING, 2024, 8 : 1010 - 1010
  • [38] Harnessing large language models (LLMs) for candidate gene prioritization and selection
    Mohammed Toufiq
    Darawan Rinchai
    Eleonore Bettacchioli
    Basirudeen Syed Ahamed Kabeer
    Taushif Khan
    Bishesh Subba
    Olivia White
    Marina Yurieva
    Joshy George
    Noemie Jourde-Chiche
    Laurent Chiche
    Karolina Palucka
    Damien Chaussabel
    Journal of Translational Medicine, 21
  • [39] Artificial Intelligence in the Era of Large Language Models: Technical Significance, Industry Applications, and Challenges
    Chen, Guang
    Guo, Jun
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2024, 47 (04): : 20 - 28
  • [40] Multimodal Large Language Models in Health Care:Applications,Challenges, and Future Outlook
    AlSaad, Rawan
    Abd-alrazaq, Alaa
    Boughorbel, Sabri
    Ahmed, Arfan
    Renault, Max-Antoine
    Damseh, Rafat
    Sheikh, Javaid
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2024, 26