Meta-Learning-Based Spatial-Temporal Adaption for Coldstart Air Pollution Prediction

被引:1
作者
Wu, Zhiyuan [1 ]
Liu, Ning [1 ]
Li, Guodong [2 ]
Liu, Xinyu [3 ]
Wang, Yue [1 ]
Zhang, Lin [3 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China
[2] Tsinghua Univ, Tsinghua Berkeley Shenzhen Inst, Shenzhen, Peoples R China
[3] Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Shenzhen, Peoples R China
关键词
Compendex;
D O I
10.1155/2023/3734557
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Air pollution is a significant public concern worldwide, and accurate data-driven air pollution prediction is crucial for developing alerting systems and making urban decisions. As more and more cities establish their monitoring networks, there is a pressing need for coldstart model training with limited data accumulation in new cities. However, traditional spatial-temporal modeling and transfer learning schemes have been challenged under this scenario because of insufficient usage of available source data and suboptimal transferring strategy. To address these issues, we propose a meta-learning-based spatial-temporal adaptation solution for coldstart air pollution prediction. Our approach is a model-agnostic framework that enables a given backbone predictor with adaption ability across different space and time locations. Specifically, it learns a factorization of the available source data distribution and recognizes the target city as one of its components, greatly reducing the data accumulation requirement and providing coldstart capability. Furthermore, we design a novel bidirectional meta-learner that can simultaneously leverage task embeddings learned from data and features constructed based on prior knowledge. We conduct comprehensive experiments on both synthetic and real-world air pollution datasets of four distinct pollutants. The results demonstrate that our proposed method achieves a 5.2% lower 24-hour prediction mean absolute error (MAE) than pretraining and fine-tuning solutions when facing a new city with only 200 hours of data, which empirically verifies the effectiveness of our approach as a coldstart training solution.
引用
收藏
页数:22
相关论文
共 69 条
[1]  
Rusu AA, 2019, Arxiv, DOI arXiv:1807.05960
[2]  
All‚on A, 2020, Arxiv, DOI arXiv:2006.09204
[3]  
Arango S. P., 2021, Advanced analytics and learning on temporal data, P123
[4]  
Cunjun Yu, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12357), P507, DOI 10.1007/978-3-030-58610-2_30
[5]  
Da Li., 2018, P AAAI C ARTIFICIAL, V32, DOI DOI 10.1609/AAAI.V32I1.11596
[6]   A Continual Learning Survey: Defying Forgetting in Classification Tasks [J].
De Lange, Matthias ;
Aljundi, Rahaf ;
Masana, Marc ;
Parisot, Sarah ;
Jia, Xu ;
Leonardis, Ales ;
Slabaugh, Greg ;
Tuytelaars, Tinne .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (07) :3366-3385
[7]  
Deleu T., 2022, arXiv
[8]   PM2.5 concentration prediction using hidden semi-Markov model-based times series data mining [J].
Dong, Ming ;
Yang, Dong ;
Kuang, Yan ;
He, David ;
Erdal, Serap ;
Kenski, Donna .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (05) :9046-9055
[9]   AdaRNN: Adaptive Learning and Forecasting for Time Series [J].
Du, Yuntao ;
Wang, Jindong ;
Feng, Wenjie ;
Pan, Sinno ;
Qin, Tao ;
Xu, Renjun ;
Wang, Chongjun .
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, :402-411
[10]  
Edwards H, 2017, Arxiv, DOI arXiv:1606.02185