BjTT: A Large-Scale Multimodal Dataset for Traffic Prediction

被引:4
|
作者
Zhang, Chengyang [1 ]
Zhang, Yong [1 ]
Shao, Qitan [1 ]
Feng, Jiangtao [1 ]
Li, Bo [1 ]
Lv, Yisheng [2 ]
Piao, Xinglin [1 ]
Yin, Baocai [1 ]
机构
[1] Beijing Univ Technol, Beijing Inst Artificial Intelligence, Sch Informat Sci & Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Roads; Social networking (online); Transportation; Data collection; Task analysis; Blogs; Meteorology; Traffic prediction; large-scale; new dataset; FLOW; NETWORKS; MODELS;
D O I
10.1109/TITS.2024.3440650
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Traffic prediction plays a significant role in Intelligent Transportation Systems (ITS). Although many datasets have been introduced to support the study of traffic prediction, most of them only provide time-series traffic data. However, urban transportation systems are always susceptible to various factors, including unusual weather and traffic accidents. Therefore, relying solely on historical data for traffic prediction greatly limits the accuracy of the prediction. In this paper, we introduce Beijing Text-Traffic (BjTT), a large-scale multimodal dataset for traffic prediction. BjTT comprises over 32,000 time-series traffic records, capturing velocity and congestion levels on more than 1,200 roads within the 5th ring area of Beijing. Meanwhile, each piece of traffic data is coupled with a text describing the traffic system (including time, location, and events). We detail the data collection and processing procedures and present a statistical analysis of the BjTT dataset. Furthermore, we conduct comprehensive experiments on the dataset with state-of-the-art traffic prediction methods and text-guided generative models, which reveal the unique characteristics of the BjTT. The dataset is available at https://github.com/ChyaZhang/BjTT.
引用
收藏
页码:18992 / 19003
页数:12
相关论文
共 50 条
  • [31] Lagrangian Models for Controlling Large-Scale Heterogeneous Traffic
    Molnar, Tamas G.
    Upadhyay, Devesh
    Hopka, Michael
    Van Nieuwstadt, Michiel
    Orosz, Gabor
    2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 3152 - 3157
  • [32] Unraveling Complexity: An Exploration Into Large-Scale Multimodal Signal Processing
    Wen, Zhenyu
    Ye, Yuheng
    Su, Jie
    Li, Taotao
    Wan, Jinhao
    Zheng, Shilian
    Hong, Zhen
    He, Shibo
    Duan, Haoran
    Li, Yuexiang
    Huang, Yawen
    Zheng, Yefeng
    IEEE INTELLIGENT SYSTEMS, 2024, 39 (06) : 48 - 57
  • [33] Celeb-DF: A Large-scale Challenging Dataset for DeepFake Forensics
    Li, Yuezun
    Yang, Xin
    Sun, Pu
    Qi, Honggang
    Lyu, Siwei
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3204 - 3213
  • [34] A Large-Scale Diverse GNSS/SINS Dataset: Construction, Publication, and Application
    Zhu, Feng
    Chen, Xi
    Cai, Qinqing
    Zhang, Xiaohong
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [35] A Hybrid Deep Learning Model for Predicting Depression Symptoms From Large-Scale Textual Dataset
    Almutairi, Sulaiman
    Abohashrh, Mohammed
    Razzaq, Hasanain Hayder
    Zulqarnain, Muhammad
    Namoun, Abdallah
    Khan, Faheem
    IEEE ACCESS, 2024, 12 : 168477 - 168499
  • [36] Introduction and Analysis of a Large-Scale Benchmark Automatic Vehicle Identification Dataset
    He, Zhaocheng
    Chen, Kaiying
    Chen, Xinyu
    Sun, Weiwei
    INTERNATIONAL CONFERENCE ON TRANSPORTATION AND DEVELOPMENT 2018: CONNECTED AND AUTONOMOUS VEHICLES AND TRANSPORTATION SAFETY, 2018, : 35 - 43
  • [37] JHU-CROWD plus plus : Large-Scale Crowd Counting Dataset and A Benchmark Method
    Sindagi, Vishwanath A.
    Yasarla, Rajeev
    Patel, Vishal M.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (05) : 2594 - 2609
  • [38] MANNET: A LARGE-SCALE MANIPULATED IMAGE DETECTION DATASET AND BASELINE EVALUATIONS
    Singh, Aditya
    Chhabra, Saheb
    Majumdar, Puspita
    Singh, Richa
    Vatsa, Mayank
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1780 - 1784
  • [39] Electrical Thermal Image Semantic Segmentation: Large-Scale Dataset and Baseline
    Wang, Futian
    Guo, Yin
    Li, Chenglong
    Lu, Andong
    Ding, Zhongfeng
    Tang, Jin
    Luo, Bin
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [40] Incidents1M: A Large-Scale Dataset of Images With Natural Disasters, Damage, and Incidents
    Weber, Ethan
    Papadopoulos, Dim P.
    Lapedriza, Agata
    Ofli, Ferda
    Imran, Muhammad
    Torralba, Antonio
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 4768 - 4781