Survey of Different Large Language Model Architectures: Trends, Benchmarks, and Challenges

被引:1
|
作者
Shao, Minghao [1 ]
Basit, Abdul [2 ]
Karri, Ramesh [1 ]
Shafique, Muhammad [2 ]
机构
[1] NYU, Tandon Sch Engn, New York, NY 10012 USA
[2] New York Univ Abu Dhabi, Abu Dhabi Engn Div, Abu Dhabi, U Arab Emirates
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Surveys; Transformers; Benchmark testing; Encoding; Large language models; Adaptation models; Market research; Decoding; Training; Computational modeling; Large language models (LLMs); Transformer architecture; generative models; survey; multimodal learning; deep learning; natural language processing (NLP); GENERATIVE ADVERSARIAL NETWORKS;
D O I
10.1109/ACCESS.2024.3482107
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Large Language Models (LLMs) represent a class of deep learning models adept at understanding natural language and generating coherent responses to various prompts or queries. These models far exceed the complexity of conventional neural networks, often encompassing dozens of neural network layers and containing billions to trillions of parameters. They are typically trained on vast datasets, utilizing architectures based on transformer blocks. Present-day LLMs are multi-functional, capable of performing a range of tasks from text generation and language translation to question answering, as well as code generation and analysis. An advanced subset of these models, known as Multimodal Large Language Models (MLLMs), extends LLM capabilities to process and interpret multiple data modalities, including images, audio, and video. This enhancement empowers MLLMs with capabilities like video editing, image comprehension, and captioning for visual content. This survey provides a comprehensive overview of the recent advancements in LLMs. We begin by tracing the evolution of LLMs and subsequently delve into the advent and nuances of MLLMs. We analyze emerging state-of-the-art MLLMs, exploring their technical features, strengths, and limitations. Additionally, we present a comparative analysis of these models and discuss their challenges, potential limitations, and prospects for future development.
引用
收藏
页码:188664 / 188706
页数:43
相关论文
共 50 条
  • [32] Assessment of large language models for use in generative design of model based spacecraft system architectures
    Timperley, Louis Richard
    Berthoud, Lucy
    Snider, Chris
    Tryfonas, Theo
    JOURNAL OF ENGINEERING DESIGN, 2025,
  • [33] Embracing Large Language Models for Medical Applications: Opportunities and Challenges
    Karabacak, Mert
    Margetis, Konstantinos
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (05)
  • [34] Challenges in applying large language models to requirements engineering tasks
    Norheim, Johannes J.
    Rebentisch, Eric
    Xiao, Dekai
    Draeger, Lorenz
    Kerbrat, Alain
    de Weck, Olivier L.
    DESIGN SCIENCE, 2024, 10
  • [35] Large Language Models for Business Process Management: Opportunities and Challenges
    Vidgof, Maxim
    Bachhofner, Stefan
    Mendling, Jan
    BUSINESS PROCESS MANAGEMENT FORUM, BPM 2023 FORUM, 2023, 490 : 107 - 123
  • [36] Hardware and Software Optimizations for Accelerating Deep Neural Networks: Survey of Current Trends, Challenges, and the Road Ahead
    Capra, Maurizio
    Bussolino, Beatrice
    Marchisio, Alberto
    Masera, Guido
    Martina, Maurizio
    Shafique, Muhammad
    IEEE ACCESS, 2020, 8 : 225134 - 225180
  • [37] WEDA: Exploring Copyright Protection for Large Language Model Downstream Alignment
    Wang, Shen
    Dong, Jialiang
    Wu, Longfei
    Guan, Zhitao
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 4755 - 4767
  • [38] Middleware Architectures for the Smart Grid: Survey and Challenges in the Foreseeable Future
    Martinez, Jose-Fernan
    Rodriguez-Molina, Jesus
    Castillejo, Pedro
    de Diego, Ruben
    ENERGIES, 2013, 6 (07) : 3593 - 3621
  • [39] A survey on potentials, pathways and challenges of large language models in new-generation intelligent manufacturing
    Zhang, Chao
    Xu, Qingfeng
    Yu, Yongrui
    Zhou, Guanghui
    Zeng, Keyan
    Chang, Fengtian
    Ding, Kai
    ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2025, 92
  • [40] Large Language Models in Finance: A Survey
    Li, Yinheng
    Wang, Shaofei
    Ding, Han
    Chen, Hang
    PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON AI IN FINANCE, ICAIF 2023, 2023, : 374 - 382