BjTT: A Large-Scale Multimodal Dataset for Traffic Prediction

被引:4
|
作者
Zhang, Chengyang [1 ]
Zhang, Yong [1 ]
Shao, Qitan [1 ]
Feng, Jiangtao [1 ]
Li, Bo [1 ]
Lv, Yisheng [2 ]
Piao, Xinglin [1 ]
Yin, Baocai [1 ]
机构
[1] Beijing Univ Technol, Beijing Inst Artificial Intelligence, Sch Informat Sci & Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Roads; Social networking (online); Transportation; Data collection; Task analysis; Blogs; Meteorology; Traffic prediction; large-scale; new dataset; FLOW; NETWORKS; MODELS;
D O I
10.1109/TITS.2024.3440650
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Traffic prediction plays a significant role in Intelligent Transportation Systems (ITS). Although many datasets have been introduced to support the study of traffic prediction, most of them only provide time-series traffic data. However, urban transportation systems are always susceptible to various factors, including unusual weather and traffic accidents. Therefore, relying solely on historical data for traffic prediction greatly limits the accuracy of the prediction. In this paper, we introduce Beijing Text-Traffic (BjTT), a large-scale multimodal dataset for traffic prediction. BjTT comprises over 32,000 time-series traffic records, capturing velocity and congestion levels on more than 1,200 roads within the 5th ring area of Beijing. Meanwhile, each piece of traffic data is coupled with a text describing the traffic system (including time, location, and events). We detail the data collection and processing procedures and present a statistical analysis of the BjTT dataset. Furthermore, we conduct comprehensive experiments on the dataset with state-of-the-art traffic prediction methods and text-guided generative models, which reveal the unique characteristics of the BjTT. The dataset is available at https://github.com/ChyaZhang/BjTT.
引用
收藏
页码:18992 / 19003
页数:12
相关论文
共 50 条
  • [1] A Large-Scale Spatio-Temporal Multimodal Fusion Framework for Traffic Prediction
    Zhou, Bodong
    Liu, Jiahui
    Cui, Songyi
    Zhao, Yaping
    BIG DATA MINING AND ANALYTICS, 2024, 7 (03): : 621 - 636
  • [2] Large-Scale Traffic Prediction With Hierarchical Hypergraph Message Passing Networks
    Wang, Jingcheng
    Zhang, Yong
    Hu, Yongli
    Yin, Baocai
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (06): : 7103 - 7113
  • [3] Spatiotemporal Patterns in Large-Scale Traffic Speed Prediction
    Asif, Muhammad Tayyab
    Dauwels, Justin
    Goh, Chong Yang
    Oran, Ali
    Fathi, Esmail
    Xu, Muye
    Dhanya, Menoth Mohan
    Mitrovic, Nikola
    Jaillet, Patrick
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2014, 15 (02) : 794 - 804
  • [4] Large-Scale Measurements and Prediction of DC-WAN Traffic
    Wang, Zhaohua
    Li, Zhenyu
    Pan, Heng
    Liu, Guangming
    Chen, Yunfei
    Wu, Qinghua
    Tyson, Gareth
    Cheng, Gang
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 34 (05) : 1390 - 1405
  • [5] Empirical analysis of large-scale multimodal traffic with multi-sensor data
    Fu, Hui
    Wang, Yefei
    Tang, Xianma
    Zheng, Nan
    Geroliminis, Nikolaos
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 118
  • [6] Metamodel-based calibration of large-scale multimodal microscopic traffic simulation
    Patwary, A. U. Z.
    Huang, Wei
    Lo, Hong K.
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2021, 124
  • [7] A Framework for Large-Scale Synthetic Graph Dataset Generation
    Darabi, Sajad
    Bigaj, Piotr
    Majchrowski, Dawid
    Kasymov, Artur
    Morkisz, Pawel
    Fit-Florea, Alex
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025,
  • [8] 100-Driver: A Large-Scale, Diverse Dataset for Distracted Driver Classification
    Wang, Jing
    Li, Wenjing
    Li, Fang
    Zhang, Jun
    Wu, Zhongcheng
    Zhong, Zhun
    Sebe, Nicu
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (07) : 7061 - 7072
  • [9] SELMA: SEmantic Large-Scale Multimodal Acquisitions in Variable Weather, Daytime and Viewpoints
    Testolina, Paolo
    Barbato, Francesco
    Michieli, Umberto
    Giordani, Marco
    Zanuttigh, Pietro
    Zorzi, Michele
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (07) : 7012 - 7024
  • [10] DANEWSROOM: A Large-scale Danish Summarisation Dataset
    Varab, Daniel
    Schluter, Natalie
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6731 - 6739