A Quantitative Analysis of State Space Model-Based Large Language Model: Study of Hungry Hungry Hippos

被引:0
|
作者
Yoon, Dongho [1 ]
Kim, Taehun [1 ]
Lee, Jae W. [2 ]
Rhu, Minsoo [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Daejeon 34141, South Korea
[2] Seoul Natl Univ, Seoul 08826, South Korea
关键词
Computational modeling; Convolution; Mathematical models; Graphics processing units; Memory management; Computational complexity; Vectors; GPU; h3; large language models; state space model;
D O I
10.1109/LCA.2024.3422492
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As the need for processing long contexts in large language models (LLMs) increases, attention-based LLMs face significant challenges due to their high computation and memory requirements. To overcome this challenge, there have been several recent works that seek to alleviate attention's system-level bottlenecks. An approach that has been receiving a lot of attraction lately is state space models (SSMs) thanks to their ability to substantially reduce computational complexity and memory footprint. Despite the excitement around SSMs, there is a lack of an in-depth characterization and analysis on this important model architecture. In this paper, we delve into a representative SSM named Hungry Hungry Hippos (H3), examining its advantages as well as its current limitations. We also discuss future research directions on improving the efficiency of SSMs via hardware architectural support.
引用
收藏
页码:154 / 157
页数:4
相关论文
共 50 条
  • [1] LUNA: A Model-Based Universal Analysis Framework for Large Language Models
    Song, Da
    Xie, Xuan
    Song, Jiayang
    Zhu, Derui
    Huang, Yuheng
    Felix, Juefei-Xu
    Ma, Lei
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2024, 50 (07) : 1921 - 1948
  • [2] Large language model-based evolutionary optimizer: Reasoning with elitism
    Brahmachary, Shuvayan
    Joshi, Subodh M.
    Panda, Aniruddha
    Koneripalli, Kaushik
    Sagotra, Arun Kumar
    Patel, Harshil
    Sharma, Ankush
    Jagtap, Ameya D.
    Kalyanaraman, Kaushic
    NEUROCOMPUTING, 2025, 622
  • [3] Large Language Model-based Test Case Generation for GP Agents
    Jorgensen, Steven
    Nadizar, Giorgia
    Pietropolli, Gloria
    Manzoni, Luca
    Medvet, Eric
    O'Reilly, Una-May
    Hemberg, Erik
    PROCEEDINGS OF THE 2024 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, GECCO 2024, 2024, : 914 - 923
  • [4] Hallucination Reduction and Optimization for Large Language Model-Based Autonomous Driving
    Wang, Jue
    SYMMETRY-BASEL, 2024, 16 (09):
  • [5] Effects of Large Language Model-Based Offerings on the Well-Being of Students: Qualitative Study
    Selim, Rania
    Basu, Arunima
    Anto, Ailin
    Foscht, Thomas
    Eisingerich, Andreas Benedikt
    JMIR FORMATIVE RESEARCH, 2024, 8
  • [6] Exploring the potential of large language model-based chatbots in challenges of ribosome profiling data analysis: a review
    Ding, Zheyu
    Wei, Rong
    Xia, Jianing
    Mu, Yonghao
    Wang, Jiahuan
    Lin, Yingying
    BRIEFINGS IN BIOINFORMATICS, 2024, 26 (01)
  • [7] Designing a Large Language Model-Based Coaching Intervention for Lifestyle Behavior Change
    Meywirth, Sophia
    DESIGN SCIENCE RESEARCH FOR A RESILIENT FUTURE, DESRIST 2024, 2024, 14621 : 81 - 94
  • [8] Ranked List Truncation for Large Language Model-based Re-Ranking
    Meng, Chuan
    Arabzadeh, Negar
    Askari, Arian
    Aliannejadi, Mohammad
    de Rijke, Maarten
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 141 - 151
  • [9] LAMARS: Large Language Model-Based Anticipation Mechanism Acceleration in Real-Time Robotic Systems
    Gao, Yifang
    Luo, Wei
    Wang, Xuye
    Zhang, Shunshun
    Goh, Patrick
    IEEE ACCESS, 2025, 13 : 3864 - 3880
  • [10] Fast computation of uncertainty lower bounds for state-space model-based operational modal analysis
    Shi, Yuanfeng
    Li, Binbin
    Au, Siu-Kui
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2022, 169