Survey of Different Large Language Model Architectures: Trends, Benchmarks, and Challenges

被引：1

作者：

Shao, Minghao ^{[1
]}

Basit, Abdul ^{[2
]}

Karri, Ramesh ^{[1
]}

Shafique, Muhammad ^{[2
]}

机构：

[1] NYU, Tandon Sch Engn, New York, NY 10012 USA

[2] New York Univ Abu Dhabi, Abu Dhabi Engn Div, Abu Dhabi, U Arab Emirates

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Surveys; Transformers; Benchmark testing; Encoding; Large language models; Adaptation models; Market research; Decoding; Training; Computational modeling; Large language models (LLMs); Transformer architecture; generative models; survey; multimodal learning; deep learning; natural language processing (NLP); GENERATIVE ADVERSARIAL NETWORKS;

D O I：

10.1109/ACCESS.2024.3482107

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Large Language Models (LLMs) represent a class of deep learning models adept at understanding natural language and generating coherent responses to various prompts or queries. These models far exceed the complexity of conventional neural networks, often encompassing dozens of neural network layers and containing billions to trillions of parameters. They are typically trained on vast datasets, utilizing architectures based on transformer blocks. Present-day LLMs are multi-functional, capable of performing a range of tasks from text generation and language translation to question answering, as well as code generation and analysis. An advanced subset of these models, known as Multimodal Large Language Models (MLLMs), extends LLM capabilities to process and interpret multiple data modalities, including images, audio, and video. This enhancement empowers MLLMs with capabilities like video editing, image comprehension, and captioning for visual content. This survey provides a comprehensive overview of the recent advancements in LLMs. We begin by tracing the evolution of LLMs and subsequently delve into the advent and nuances of MLLMs. We analyze emerging state-of-the-art MLLMs, exploring their technical features, strengths, and limitations. Additionally, we present a comparative analysis of these models and discuss their challenges, potential limitations, and prospects for future development.

引用

页码：188664 / 188706

页数：43

共 50 条

[1] Federated Large Language Model: Solutions, Challenges and Future Directions
Hu, Jiahui
Wang, Dan
Wang, Zhibo
Pang, Xiaoyi
Xu, Huiyu
Ren, Ju
Ren, Kui
IEEE WIRELESS COMMUNICATIONS, 2024,
[2] A Review on Large Language Models: Architectures, Applications, Taxonomies, Open Issues and Challenges
Raiaan, Mohaimenul Azam Khan
Mukta, Md. Saddam Hossain
Fatema, Kaniz
Fahad, Nur Mohammad
Sakib, Sadman
Mim, Most Marufatul Jannat
Ahmad, Jubaer
Ali, Mohammed Eunus
Azam, Sami
IEEE ACCESS, 2024, 12 : 26839 - 26874
[3] Large Language Models on Graphs: A Comprehensive Survey
Jin, Bowen
Liu, Gang
Han, Chi
Jiang, Meng
Ji, Heng
Han, Jiawei
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (12) : 8622 - 8642
[4] Large Language Model for Medical Images: A Survey of Taxonomy, Systematic Review, and Future Trends
Wang, Peng
Lu, Wenpeng
Lu, Chunlin
Zhou, Ruoxi
Li, Min
Qin, Libo
BIG DATA MINING AND ANALYTICS, 2025, 8 (02): : 496 - 517
[5] Survey on Large Language Model-Enhanced Reinforcement Learning: Concept, Taxonomy, and Methods
Cao, Yuji
Zhao, Huan
Cheng, Yuheng
Shu, Ting
Chen, Yue
Liu, Guolong
Liang, Gaoqi
Zhao, Junhua
Yan, Jinyue
Li, Yun
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
[6] A survey on augmenting knowledge graphs (KGs) with large language models (LLMs): models, evaluation metrics, benchmarks, and challenges
Ibrahim, Nourhan
Aboulela, Samar
Ibrahim, Ahmed
Kashef, Rasha
Discover Artificial Intelligence, 2024, 4 (01):
[7] When geoscience meets generative AI and large language models: Foundations, trends, and future challenges
Hadid, Abdenour
Chakraborty, Tanujit
Busby, Daniel
EXPERT SYSTEMS, 2024, 41 (10)
[8] Towards Large-Scale Small Object Detection: Survey and Benchmarks
Cheng, Gong
Yuan, Xiang
Yao, Xiwen
Yan, Kebing
Zeng, Qinghua
Xie, Xingxing
Han, Junwei
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13467 - 13488
[9] Trends, Challenges, and Applications of Large Language Models in Healthcare: A Bibliometric and Scoping Review
Carchiolo, Vincenza
Malgeri, Michele
FUTURE INTERNET, 2025, 17 (02)
[10] Security and Privacy Challenges of Large Language Models: A Survey
Das, Badhan chandra
Amini, M. hadi
Wu, Yanzhao
ACM COMPUTING SURVEYS, 2025, 57 (06)

← 1 2 3 4 5 →