Survey of Different Large Language Model Architectures: Trends, Benchmarks, and Challenges

被引：1

作者：

Shao, Minghao ^{[1
]}

Basit, Abdul ^{[2
]}

Karri, Ramesh ^{[1
]}

Shafique, Muhammad ^{[2
]}

机构：

[1] NYU, Tandon Sch Engn, New York, NY 10012 USA

[2] New York Univ Abu Dhabi, Abu Dhabi Engn Div, Abu Dhabi, U Arab Emirates

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Surveys; Transformers; Benchmark testing; Encoding; Large language models; Adaptation models; Market research; Decoding; Training; Computational modeling; Large language models (LLMs); Transformer architecture; generative models; survey; multimodal learning; deep learning; natural language processing (NLP); GENERATIVE ADVERSARIAL NETWORKS;

D O I：

10.1109/ACCESS.2024.3482107

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Large Language Models (LLMs) represent a class of deep learning models adept at understanding natural language and generating coherent responses to various prompts or queries. These models far exceed the complexity of conventional neural networks, often encompassing dozens of neural network layers and containing billions to trillions of parameters. They are typically trained on vast datasets, utilizing architectures based on transformer blocks. Present-day LLMs are multi-functional, capable of performing a range of tasks from text generation and language translation to question answering, as well as code generation and analysis. An advanced subset of these models, known as Multimodal Large Language Models (MLLMs), extends LLM capabilities to process and interpret multiple data modalities, including images, audio, and video. This enhancement empowers MLLMs with capabilities like video editing, image comprehension, and captioning for visual content. This survey provides a comprehensive overview of the recent advancements in LLMs. We begin by tracing the evolution of LLMs and subsequently delve into the advent and nuances of MLLMs. We analyze emerging state-of-the-art MLLMs, exploring their technical features, strengths, and limitations. Additionally, we present a comparative analysis of these models and discuss their challenges, potential limitations, and prospects for future development.

引用

页码：188664 / 188706

页数：43

共 50 条

[31] Large Language Models for Wearable Sensor-Based Human Activity Recognition, Health Monitoring, and Behavioral Modeling: A Survey of Early Trends, Datasets, and Challenges
Ferrara, Emilio
SENSORS, 2024, 24 (15)
[32] Assessment of large language models for use in generative design of model based spacecraft system architectures
Timperley, Louis Richard
Berthoud, Lucy
Snider, Chris
Tryfonas, Theo
JOURNAL OF ENGINEERING DESIGN, 2025,
[33] Embracing Large Language Models for Medical Applications: Opportunities and Challenges
Karabacak, Mert
Margetis, Konstantinos
CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (05)
[34] Challenges in applying large language models to requirements engineering tasks
Norheim, Johannes J.
Rebentisch, Eric
Xiao, Dekai
Draeger, Lorenz
Kerbrat, Alain
de Weck, Olivier L.
DESIGN SCIENCE, 2024, 10
[35] Large Language Models for Business Process Management: Opportunities and Challenges
Vidgof, Maxim
Bachhofner, Stefan
Mendling, Jan
BUSINESS PROCESS MANAGEMENT FORUM, BPM 2023 FORUM, 2023, 490 : 107 - 123
[36] Hardware and Software Optimizations for Accelerating Deep Neural Networks: Survey of Current Trends, Challenges, and the Road Ahead
Capra, Maurizio
Bussolino, Beatrice
Marchisio, Alberto
Masera, Guido
Martina, Maurizio
Shafique, Muhammad
IEEE ACCESS, 2020, 8 : 225134 - 225180
[37] WEDA: Exploring Copyright Protection for Large Language Model Downstream Alignment
Wang, Shen
Dong, Jialiang
Wu, Longfei
Guan, Zhitao
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 4755 - 4767
[38] Middleware Architectures for the Smart Grid: Survey and Challenges in the Foreseeable Future
Martinez, Jose-Fernan
Rodriguez-Molina, Jesus
Castillejo, Pedro
de Diego, Ruben
ENERGIES, 2013, 6 (07) : 3593 - 3621
[39] A survey on potentials, pathways and challenges of large language models in new-generation intelligent manufacturing
Zhang, Chao
Xu, Qingfeng
Yu, Yongrui
Zhou, Guanghui
Zeng, Keyan
Chang, Fengtian
Ding, Kai
ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2025, 92
[40] Large Language Models in Finance: A Survey
Li, Yinheng
Wang, Shaofei
Ding, Han
Chen, Hang
PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON AI IN FINANCE, ICAIF 2023, 2023, : 374 - 382

← 1 2 3 4 5 →