Survey of Different Large Language Model Architectures: Trends, Benchmarks, and Challenges

被引：1

作者：

Shao, Minghao ^{[1
]}

Basit, Abdul ^{[2
]}

Karri, Ramesh ^{[1
]}

Shafique, Muhammad ^{[2
]}

机构：

[1] NYU, Tandon Sch Engn, New York, NY 10012 USA

[2] New York Univ Abu Dhabi, Abu Dhabi Engn Div, Abu Dhabi, U Arab Emirates

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Surveys; Transformers; Benchmark testing; Encoding; Large language models; Adaptation models; Market research; Decoding; Training; Computational modeling; Large language models (LLMs); Transformer architecture; generative models; survey; multimodal learning; deep learning; natural language processing (NLP); GENERATIVE ADVERSARIAL NETWORKS;

D O I：

10.1109/ACCESS.2024.3482107

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Large Language Models (LLMs) represent a class of deep learning models adept at understanding natural language and generating coherent responses to various prompts or queries. These models far exceed the complexity of conventional neural networks, often encompassing dozens of neural network layers and containing billions to trillions of parameters. They are typically trained on vast datasets, utilizing architectures based on transformer blocks. Present-day LLMs are multi-functional, capable of performing a range of tasks from text generation and language translation to question answering, as well as code generation and analysis. An advanced subset of these models, known as Multimodal Large Language Models (MLLMs), extends LLM capabilities to process and interpret multiple data modalities, including images, audio, and video. This enhancement empowers MLLMs with capabilities like video editing, image comprehension, and captioning for visual content. This survey provides a comprehensive overview of the recent advancements in LLMs. We begin by tracing the evolution of LLMs and subsequently delve into the advent and nuances of MLLMs. We analyze emerging state-of-the-art MLLMs, exploring their technical features, strengths, and limitations. Additionally, we present a comparative analysis of these models and discuss their challenges, potential limitations, and prospects for future development.

引用

页码：188664 / 188706

页数：43

共 50 条

[21] A Survey: Collaborative Hardware and Software Design in the Era of Large Language Models
Guo, Cong
Cheng, Feng
Du, Zhixu
Kiessling, James
Ku, Jonathan
Li, Shiyu
Li, Ziru
Ma, Mingyuan
Molom-Ochir, Tergel
Morris, Benjamin
Shan, Haoxuan
Sun, Jingwei
Wang, Yitu
Wei, Chiyue
Wu, Xueying
Wu, Yuhao
Yang, Hao Frank
Zhang, Jingyang
Zhang, Junyao
Zheng, Qilin
Zhou, Guanglei
Li, Hai
Chen, Yiran
IEEE CIRCUITS AND SYSTEMS MAGAZINE, 2025, 25 (01) : 35 - 57
[22] A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions
Huang, Lei
Yu, Weijiang
Ma, Weitao
Zhong, Weihong
Feng, Zhangyin
Wang, Haotian
Chen, Qianglong
Peng, Weihua
Feng, Xiaocheng
Qin, Bing
Liu, Ting
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2025, 43 (02)
[23] A survey of table reasoning with large language models
Zhang, Xuanliang
Wang, Dingzirui
Dou, Longxu
Zhu, Qingfu
Che, Wanxiang
FRONTIERS OF COMPUTER SCIENCE, 2025, 19 (09)
[24] The rise and potential of large language model based agents: a survey
Xi, Zhiheng
Chen, Wenxiang
Guo, Xin
He, Wei
Ding, Yiwen
Hong, Boyang
Zhang, Ming
Wang, Junzhe
Jin, Senjie
Zhou, Enyu
Zheng, Rui
Fan, Xiaoran
Wang, Xiao
Xiong, Limao
Zhou, Yuhao
Wang, Weiran
Jiang, Changhao
Zou, Yicheng
Liu, Xiangyang
Yin, Zhangyue
Dou, Shihan
Weng, Rongxiang
Qin, Wenjuan
Zheng, Yongyan
Qiu, Xipeng
Huang, Xuanjing
Zhang, Qi
Gui, Tao
SCIENCE CHINA-INFORMATION SCIENCES, 2025, 68 (02)
[25] A Survey on Hardware Accelerators for Large Language Models
Kachris, Christoforos
APPLIED SCIENCES-BASEL, 2025, 15 (02):
[26] When Search Engine Services Meet Large Language Models: Visions and Challenges
Xiong, Haoyi
Bian, Jiang
Li, Yuchen
Li, Xuhong
Du, Mengnan
Wang, Shuaiqiang
Yin, Dawei
Helal, Sumi
IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (06) : 4558 - 4577
[27] Current status and trends in large language modeling research
Wang Y.
Li Q.
Dai Z.
Xu Y.
Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2024, 46 (08): : 1411 - 1425
[28] A survey on extremism analysis using natural language processing: definitions, literature review, trends and challenges
Torregrosa, Javier
Bello-Orgaz, Gema
Martinez-Camara, Eugenio
Del Ser, Javier
Camacho, David
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2022, 14 (8) : 9869 - 9905
[29] A survey on extremism analysis using natural language processing: definitions, literature review, trends and challenges
Javier Torregrosa
Gema Bello-Orgaz
Eugenio Martínez-Cámara
Javier Del Ser
David Camacho
Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 9869 - 9905
[30] A survey of transformers and large language models for ECG diagnosis: advances, challenges, and future directions
Mohammed Yusuf Ansari
Mohammed Yaqoob
Mohammed Ishaq
Eduardo Feo Flushing
Iffa Afsa changaai Mangalote
Sarada Prasad Dakua
Omar Aboumarzouk
Raffaella Righetti
Marwa Qaraqe
Artificial Intelligence Review, 58 (9)

← 1 2 3 4 5 →