BjTT: A Large-Scale Multimodal Dataset for Traffic Prediction

被引:4
|
作者
Zhang, Chengyang [1 ]
Zhang, Yong [1 ]
Shao, Qitan [1 ]
Feng, Jiangtao [1 ]
Li, Bo [1 ]
Lv, Yisheng [2 ]
Piao, Xinglin [1 ]
Yin, Baocai [1 ]
机构
[1] Beijing Univ Technol, Beijing Inst Artificial Intelligence, Sch Informat Sci & Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Roads; Social networking (online); Transportation; Data collection; Task analysis; Blogs; Meteorology; Traffic prediction; large-scale; new dataset; FLOW; NETWORKS; MODELS;
D O I
10.1109/TITS.2024.3440650
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Traffic prediction plays a significant role in Intelligent Transportation Systems (ITS). Although many datasets have been introduced to support the study of traffic prediction, most of them only provide time-series traffic data. However, urban transportation systems are always susceptible to various factors, including unusual weather and traffic accidents. Therefore, relying solely on historical data for traffic prediction greatly limits the accuracy of the prediction. In this paper, we introduce Beijing Text-Traffic (BjTT), a large-scale multimodal dataset for traffic prediction. BjTT comprises over 32,000 time-series traffic records, capturing velocity and congestion levels on more than 1,200 roads within the 5th ring area of Beijing. Meanwhile, each piece of traffic data is coupled with a text describing the traffic system (including time, location, and events). We detail the data collection and processing procedures and present a statistical analysis of the BjTT dataset. Furthermore, we conduct comprehensive experiments on the dataset with state-of-the-art traffic prediction methods and text-guided generative models, which reveal the unique characteristics of the BjTT. The dataset is available at https://github.com/ChyaZhang/BjTT.
引用
收藏
页码:18992 / 19003
页数:12
相关论文
共 50 条
  • [41] Discovering Mobile Application Usage Patterns from a Large-Scale Dataset
    Silva, Fabricio A.
    Domingues, Augusto C. S. A.
    Braga Silva, Thais R. M.
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2018, 12 (05)
  • [42] TRAFFIC DESIGN METHOD FOR LARGE-SCALE SWITCHED TELECOMMUNICATION NETWORKS
    MASE, K
    KAWANO, W
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART I-COMMUNICATIONS, 1995, 78 (08): : 10 - 22
  • [43] Quality Prediction System for Large-Scale Digitisation Workflows
    Clausner, Christian
    Pletschacher, Stefan
    Antonacopoulos, Apostolos
    PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 138 - 143
  • [44] A dynamic spatial-temporal deep learning framework for traffic speed prediction on large-scale road networks
    Zheng, Ge
    Chai, Wei Koong
    Katos, Vasilis
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 195
  • [45] Evaluation of Large-Scale Proteomics for Prediction of Cardiovascular Events
    Helgason, Hannes
    Eiriksdottir, Thjodbjorg
    Ulfarsson, Magnus O.
    Choudhary, Abhishek
    Lund, Sigrun H.
    Ivarsdottir, Erna V.
    Eldjarn, Grimur Hjorleifsson
    Einarsson, Gudmundur
    Ferkingstad, Egil
    Moore, Kristjan H. S.
    Honarpour, Narimon
    Liu, Thomas
    Wang, Huei
    Hucko, Thomas
    Sabatine, Marc S.
    Morrow, David A.
    Giugliano, Robert P.
    Ostrowski, Sisse Rye
    Pedersen, Ole Birger
    Bundgaard, Henning
    Erikstrup, Christian
    Arnar, David O.
    Thorgeirsson, Gudmundur
    Masson, Gisli
    Magnusson, Olafur Th.
    Saemundsdottir, Jona
    Gretarsdottir, Solveig
    Steinthorsdottir, Valgerdur
    Thorleifsson, Gudmar
    Helgadottir, Anna
    Sulem, Patrick
    Thorsteinsdottir, Unnur
    Holm, Hilma
    Gudbjartsson, Daniel
    Stefansson, Kari
    JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2023, 330 (08): : 725 - 735
  • [46] FMFCC-V: An Asian Large-Scale Challenging Dataset for DeepFake Detection
    Li, Gen
    Zhao, Xianfeng
    Cao, Yun
    Pei, Pengfei
    Li, Jinchuan
    Zhang, Zeyu
    PROCEEDINGS OF THE 2022 ACM WORKSHOP ON INFORMATION HIDING AND MULTIMEDIA SECURITY, IH-MMSEC 2022, 2022, : 7 - 18
  • [47] Hand Pose Understanding With Large-Scale Photo-Realistic Rendering Dataset
    Deng, Xiaoming
    Zhang, Yinda
    Shi, Jian
    Zhu, Yuying
    Cheng, Dachuan
    Zuo, Dexin
    Cui, Zhaopeng
    Tan, Ping
    Chang, Liang
    Wang, Hongan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 4275 - 4290
  • [48] MultiScene: A Large-Scale Dataset and Benchmark for Multiscene Recognition in Single Aerial Images
    Hua, Yuansheng
    Mou, Lichao
    Jin, Pu
    Zhu, Xiao Xiang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [49] Active Traffic Sensor Location Problem for the Uniqueness of Path Flow Identification in Large-Scale Networks
    Almutairi, Ahmed
    Owais, Mahmoud
    IEEE ACCESS, 2024, 12 : 180385 - 180403
  • [50] Deep Learning for Large-Scale Traffic-Sign Detection and Recognition
    Tabernik, Domen
    Skocaj, Danijel
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (04) : 1427 - 1440