Time-Series Large Language Models: A Systematic Review of State-of-the-Art

被引:0
作者
Abdullahi, Shamsu [1 ,2 ]
Danyaro, Kamaluddeen Usman [1 ]
Zakari, Abubakar [1 ,3 ]
Aziz, Izzatdin Abdul [1 ]
Zawawi, Noor Amila Wan Abdullah [4 ]
Adamu, Shamsuddeen [1 ]
机构
[1] Univ Teknol PETRONAS, Dept Comp & Informat Sci, Seri Iskandar 32610, Perak, Malaysia
[2] Hassan Usman Katsina Polytech, Dept Comp Sci, Katsina 820102, Katsina State, Nigeria
[3] Aliko Dangote Univ Sci & Technol, Dept Comp Sci, Kano 713101, Nigeria
[4] Univ Teknol PETRONAS, Civil & Environm Engn Dept, Seri Iskandar 32610, Perak, Malaysia
来源
IEEE ACCESS | 2025年 / 13卷
关键词
Time series analysis; Measurement; Tokenization; Systematic literature review; Systematics; Databases; Transformers; Search problems; Anomaly detection; Time-series; large language models; forecasting; tokenization; time-series LLMs; PREDICTION;
D O I
10.1109/ACCESS.2025.3535782
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Large Language Models (LLMs) have transformed Natural Language Processing (NLP) and Software Engineering by fostering innovation, streamlining processes, and enabling data-driven decision-making. Recently, the adoption of LLMs in time-series analysis has catalyzed the emergence of time-series LLMs, a rapidly evolving research area. Existing reviews provide foundational insights into time-series LLMs but lack a comprehensive examination of recent advancements and do not adequately address critical challenges in this domain. This Systematic Literature Review (SLR) bridges these gaps by analysing state-of-the-art contributions in time-series LLMs, focusing on architectural innovations, tokenisation strategies, tasks, datasets, evaluation metrics, and unresolved challenges. Using a rigorous methodology based on PRISMA guidelines, over 700 studies from 2020 to 2024 were reviewed, with 59 relevant studies selected from journals, conferences, and workshops. Key findings reveal advancements in architectures and novel tokenization strategies tailored for temporal data. Forecasting dominates the identified tasks with 79.66% of the selected studies, while classification and anomaly detection remain underexplored. Furthermore, the analysis reveals a strong reliance on datasets from the energy and transportation domains, highlighting the need for more diverse datasets. Despite these advancements, significant challenges persist, including tokenization inefficiencies, prediction hallucinations, and difficulties in modelling long-term dependencies. These issues hinder the robustness, scalability, and adaptability of time-series LLMs across diverse applications. To address these challenges, this SLR outlines a research roadmap emphasizing the improvement of tokenization methods, the development of mechanisms for capturing long-term dependencies, the mitigation of hallucination effects, and the design of scalable, interpretable models for diverse time-series tasks.
引用
收藏
页码:30235 / 30261
页数:27
相关论文
共 50 条
  • [1] Time-series forecasting in smart manufacturing systems: An experimental evaluation of the state-of-the-art algorithms
    Farahani, Mojtaba A.
    El Kalach, Fadi
    Harper, Austin
    Mccormick, M. R.
    Harik, Ramy
    Wuest, Thorsten
    ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2025, 95
  • [2] How Dense Autoencoders can still Achieve the State-of-the-art in Time-Series Anomaly Detection
    Jensen, Louis
    Fosa, Jayme
    Teitelbaum, Ben
    Chin, Peter
    20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 1272 - 1277
  • [3] Unsupervised anomaly detection in time-series: An extensive evaluation and analysis of state-of-the-art methods
    Mejri, Nesryne
    Lopez-Fuentes, Laura
    Roy, Kankana
    Chernakov, Pavel
    Ghorbel, Enjie
    Aouada, Djamila
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 256
  • [4] Can Large Language Models be Anomaly Detectors for Time Series?
    Alnegheimish, Sarah
    Nguyen, Linh
    Berti-Equille, Laure
    Veeramachaneni, Kalyan
    2024 IEEE 11TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS, DSAA 2024, 2024, : 218 - 227
  • [5] Application of Large Language Models in Cybersecurity: A Systematic Literature Review
    Hasanov, Ismayil
    Virtanen, Seppo
    Hakkala, Antti
    Isoaho, Jouni
    IEEE ACCESS, 2024, 12 : 176751 - 176778
  • [6] Time Series Classification With Large Language Models via Linguistic Scaffolding
    Jang, Hyeongwon
    Yong Yang, June
    Hwang, Jaeryong
    Yang, Eunho
    IEEE ACCESS, 2024, 12 : 170387 - 170398
  • [7] Large Language Models for Intelligent Transportation: A Review of the State of the Art and Challenges
    Wandelt, Sebastian
    Zheng, Changhong
    Wang, Shuang
    Liu, Yucheng
    Sun, Xiaoqian
    APPLIED SCIENCES-BASEL, 2024, 14 (17):
  • [8] TIME-SERIES FORECASTING FOR SOME STATISTICAL MODELS
    Al Mutairi, Alya
    ADVANCES AND APPLICATIONS IN STATISTICS, 2022, 78 : 83 - 92
  • [9] A comprehensive evaluation of state-of-the-art time-series deep learning models for activity-recognition in post-stroke rehabilitation assessment
    Boukhennoufa, Issam
    Zhai, Xiaojun
    Utti, Victor
    Jackson, Jo
    McDonald-Maier, Klaus D.
    2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 2242 - 2247
  • [10] The Drone Scheduling Problem: A Systematic State-of-the-Art Review
    Pasha, Junayed
    Elmi, Zeinab
    Purkayastha, Sumit
    Fathollahi-Fard, Amir M.
    Ge, Ying-En
    Lau, Yui-Yip
    Dulebenets, Maxim A.
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (09) : 14224 - 14247