Survey of Different Large Language Model Architectures: Trends, Benchmarks, and Challenges

被引:1
|
作者
Shao, Minghao [1 ]
Basit, Abdul [2 ]
Karri, Ramesh [1 ]
Shafique, Muhammad [2 ]
机构
[1] NYU, Tandon Sch Engn, New York, NY 10012 USA
[2] New York Univ Abu Dhabi, Abu Dhabi Engn Div, Abu Dhabi, U Arab Emirates
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Surveys; Transformers; Benchmark testing; Encoding; Large language models; Adaptation models; Market research; Decoding; Training; Computational modeling; Large language models (LLMs); Transformer architecture; generative models; survey; multimodal learning; deep learning; natural language processing (NLP); GENERATIVE ADVERSARIAL NETWORKS;
D O I
10.1109/ACCESS.2024.3482107
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Large Language Models (LLMs) represent a class of deep learning models adept at understanding natural language and generating coherent responses to various prompts or queries. These models far exceed the complexity of conventional neural networks, often encompassing dozens of neural network layers and containing billions to trillions of parameters. They are typically trained on vast datasets, utilizing architectures based on transformer blocks. Present-day LLMs are multi-functional, capable of performing a range of tasks from text generation and language translation to question answering, as well as code generation and analysis. An advanced subset of these models, known as Multimodal Large Language Models (MLLMs), extends LLM capabilities to process and interpret multiple data modalities, including images, audio, and video. This enhancement empowers MLLMs with capabilities like video editing, image comprehension, and captioning for visual content. This survey provides a comprehensive overview of the recent advancements in LLMs. We begin by tracing the evolution of LLMs and subsequently delve into the advent and nuances of MLLMs. We analyze emerging state-of-the-art MLLMs, exploring their technical features, strengths, and limitations. Additionally, we present a comparative analysis of these models and discuss their challenges, potential limitations, and prospects for future development.
引用
收藏
页码:188664 / 188706
页数:43
相关论文
共 50 条
  • [21] A Survey: Collaborative Hardware and Software Design in the Era of Large Language Models
    Guo, Cong
    Cheng, Feng
    Du, Zhixu
    Kiessling, James
    Ku, Jonathan
    Li, Shiyu
    Li, Ziru
    Ma, Mingyuan
    Molom-Ochir, Tergel
    Morris, Benjamin
    Shan, Haoxuan
    Sun, Jingwei
    Wang, Yitu
    Wei, Chiyue
    Wu, Xueying
    Wu, Yuhao
    Yang, Hao Frank
    Zhang, Jingyang
    Zhang, Junyao
    Zheng, Qilin
    Zhou, Guanglei
    Li, Hai
    Chen, Yiran
    IEEE CIRCUITS AND SYSTEMS MAGAZINE, 2025, 25 (01) : 35 - 57
  • [22] A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions
    Huang, Lei
    Yu, Weijiang
    Ma, Weitao
    Zhong, Weihong
    Feng, Zhangyin
    Wang, Haotian
    Chen, Qianglong
    Peng, Weihua
    Feng, Xiaocheng
    Qin, Bing
    Liu, Ting
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2025, 43 (02)
  • [23] A survey of table reasoning with large language models
    Zhang, Xuanliang
    Wang, Dingzirui
    Dou, Longxu
    Zhu, Qingfu
    Che, Wanxiang
    FRONTIERS OF COMPUTER SCIENCE, 2025, 19 (09)
  • [24] The rise and potential of large language model based agents: a survey
    Xi, Zhiheng
    Chen, Wenxiang
    Guo, Xin
    He, Wei
    Ding, Yiwen
    Hong, Boyang
    Zhang, Ming
    Wang, Junzhe
    Jin, Senjie
    Zhou, Enyu
    Zheng, Rui
    Fan, Xiaoran
    Wang, Xiao
    Xiong, Limao
    Zhou, Yuhao
    Wang, Weiran
    Jiang, Changhao
    Zou, Yicheng
    Liu, Xiangyang
    Yin, Zhangyue
    Dou, Shihan
    Weng, Rongxiang
    Qin, Wenjuan
    Zheng, Yongyan
    Qiu, Xipeng
    Huang, Xuanjing
    Zhang, Qi
    Gui, Tao
    SCIENCE CHINA-INFORMATION SCIENCES, 2025, 68 (02)
  • [25] A Survey on Hardware Accelerators for Large Language Models
    Kachris, Christoforos
    APPLIED SCIENCES-BASEL, 2025, 15 (02):
  • [26] When Search Engine Services Meet Large Language Models: Visions and Challenges
    Xiong, Haoyi
    Bian, Jiang
    Li, Yuchen
    Li, Xuhong
    Du, Mengnan
    Wang, Shuaiqiang
    Yin, Dawei
    Helal, Sumi
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (06) : 4558 - 4577
  • [27] Current status and trends in large language modeling research
    Wang Y.
    Li Q.
    Dai Z.
    Xu Y.
    Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2024, 46 (08): : 1411 - 1425
  • [28] A survey on extremism analysis using natural language processing: definitions, literature review, trends and challenges
    Torregrosa, Javier
    Bello-Orgaz, Gema
    Martinez-Camara, Eugenio
    Del Ser, Javier
    Camacho, David
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2022, 14 (8) : 9869 - 9905
  • [29] A survey on extremism analysis using natural language processing: definitions, literature review, trends and challenges
    Javier Torregrosa
    Gema Bello-Orgaz
    Eugenio Martínez-Cámara
    Javier Del Ser
    David Camacho
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 9869 - 9905
  • [30] A survey of transformers and large language models for ECG diagnosis: advances, challenges, and future directions
    Mohammed Yusuf Ansari
    Mohammed Yaqoob
    Mohammed Ishaq
    Eduardo Feo Flushing
    Iffa Afsa changaai Mangalote
    Sarada Prasad Dakua
    Omar Aboumarzouk
    Raffaella Righetti
    Marwa Qaraqe
    Artificial Intelligence Review, 58 (9)