Survey of Different Large Language Model Architectures: Trends, Benchmarks, and Challenges

被引:1
|
作者
Shao, Minghao [1 ]
Basit, Abdul [2 ]
Karri, Ramesh [1 ]
Shafique, Muhammad [2 ]
机构
[1] NYU, Tandon Sch Engn, New York, NY 10012 USA
[2] New York Univ Abu Dhabi, Abu Dhabi Engn Div, Abu Dhabi, U Arab Emirates
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Surveys; Transformers; Benchmark testing; Encoding; Large language models; Adaptation models; Market research; Decoding; Training; Computational modeling; Large language models (LLMs); Transformer architecture; generative models; survey; multimodal learning; deep learning; natural language processing (NLP); GENERATIVE ADVERSARIAL NETWORKS;
D O I
10.1109/ACCESS.2024.3482107
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Large Language Models (LLMs) represent a class of deep learning models adept at understanding natural language and generating coherent responses to various prompts or queries. These models far exceed the complexity of conventional neural networks, often encompassing dozens of neural network layers and containing billions to trillions of parameters. They are typically trained on vast datasets, utilizing architectures based on transformer blocks. Present-day LLMs are multi-functional, capable of performing a range of tasks from text generation and language translation to question answering, as well as code generation and analysis. An advanced subset of these models, known as Multimodal Large Language Models (MLLMs), extends LLM capabilities to process and interpret multiple data modalities, including images, audio, and video. This enhancement empowers MLLMs with capabilities like video editing, image comprehension, and captioning for visual content. This survey provides a comprehensive overview of the recent advancements in LLMs. We begin by tracing the evolution of LLMs and subsequently delve into the advent and nuances of MLLMs. We analyze emerging state-of-the-art MLLMs, exploring their technical features, strengths, and limitations. Additionally, we present a comparative analysis of these models and discuss their challenges, potential limitations, and prospects for future development.
引用
收藏
页码:188664 / 188706
页数:43
相关论文
共 50 条
  • [1] Federated Large Language Model: Solutions, Challenges and Future Directions
    Hu, Jiahui
    Wang, Dan
    Wang, Zhibo
    Pang, Xiaoyi
    Xu, Huiyu
    Ren, Ju
    Ren, Kui
    IEEE WIRELESS COMMUNICATIONS, 2024,
  • [2] A Review on Large Language Models: Architectures, Applications, Taxonomies, Open Issues and Challenges
    Raiaan, Mohaimenul Azam Khan
    Mukta, Md. Saddam Hossain
    Fatema, Kaniz
    Fahad, Nur Mohammad
    Sakib, Sadman
    Mim, Most Marufatul Jannat
    Ahmad, Jubaer
    Ali, Mohammed Eunus
    Azam, Sami
    IEEE ACCESS, 2024, 12 : 26839 - 26874
  • [3] Large Language Models on Graphs: A Comprehensive Survey
    Jin, Bowen
    Liu, Gang
    Han, Chi
    Jiang, Meng
    Ji, Heng
    Han, Jiawei
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (12) : 8622 - 8642
  • [4] Large Language Model for Medical Images: A Survey of Taxonomy, Systematic Review, and Future Trends
    Wang, Peng
    Lu, Wenpeng
    Lu, Chunlin
    Zhou, Ruoxi
    Li, Min
    Qin, Libo
    BIG DATA MINING AND ANALYTICS, 2025, 8 (02): : 496 - 517
  • [5] Survey on Large Language Model-Enhanced Reinforcement Learning: Concept, Taxonomy, and Methods
    Cao, Yuji
    Zhao, Huan
    Cheng, Yuheng
    Shu, Ting
    Chen, Yue
    Liu, Guolong
    Liang, Gaoqi
    Zhao, Junhua
    Yan, Jinyue
    Li, Yun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [6] A survey on augmenting knowledge graphs (KGs) with large language models (LLMs): models, evaluation metrics, benchmarks, and challenges
    Ibrahim, Nourhan
    Aboulela, Samar
    Ibrahim, Ahmed
    Kashef, Rasha
    Discover Artificial Intelligence, 2024, 4 (01):
  • [7] When geoscience meets generative AI and large language models: Foundations, trends, and future challenges
    Hadid, Abdenour
    Chakraborty, Tanujit
    Busby, Daniel
    EXPERT SYSTEMS, 2024, 41 (10)
  • [8] Towards Large-Scale Small Object Detection: Survey and Benchmarks
    Cheng, Gong
    Yuan, Xiang
    Yao, Xiwen
    Yan, Kebing
    Zeng, Qinghua
    Xie, Xingxing
    Han, Junwei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13467 - 13488
  • [9] Trends, Challenges, and Applications of Large Language Models in Healthcare: A Bibliometric and Scoping Review
    Carchiolo, Vincenza
    Malgeri, Michele
    FUTURE INTERNET, 2025, 17 (02)
  • [10] Security and Privacy Challenges of Large Language Models: A Survey
    Das, Badhan chandra
    Amini, M. hadi
    Wu, Yanzhao
    ACM COMPUTING SURVEYS, 2025, 57 (06)