A Semi-supervised Framework for Simultaneous Classification and Regression of Zero-Inflated Time Series Data with Application to Precipitation Prediction

被引:8
作者
Abraham, Zubin [1 ]
Tan, Pang-Ning [1 ]
机构
[1] Michigan State Univ, Dept Comp Sci, E Lansing, MI 48824 USA
来源
2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009) | 2009年
关键词
Semi-supervised; Zero-Inflated Time Series Data; Simultaneous Classification and Regression;
D O I
10.1109/ICDMW.2009.80
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Time series data with abundant number of zeros are common in many applications, including climate and ecological modeling, disease monitoring, manufacturing defect detection, and traffic accident monitoring. Classical regression models are inappropriate to handle data with such skewed distribution because they tend to underestimate the frequency of zeros and the magnitude of non-zero values in the data. This paper presents a hybrid framework that simultaneously perform classification and regression to accurately predict future values of a zero-inflated time series. A classifier is initially used to determine whether the value at a given time step is zero while a regression model is invoked to estimate its magnitude only if the predicted value has been classified as non-zero. The proposed framework is extended to a semi-supervised learning setting via graph regularization. The effectiveness of the framework is demonstrated via its application to the precipitation prediction problem for climate impact assessment studies.
引用
收藏
页码:644 / 649
页数:6
相关论文
共 20 条
[1]   Modelling spatial zero-inflated continuous data with an exponentially compound Poisson process [J].
Ancelet, Sophie ;
Etienne, Marie-Pierre ;
Benoit, Hugues ;
Parent, Eric .
ENVIRONMENTAL AND ECOLOGICAL STATISTICS, 2010, 17 (03) :347-376
[2]  
[Anonymous], 1999, P 16 INT C MACH LEAR
[3]  
Blum A., 1998, Proceedings of the Eleventh Annual Conference on Computational Learning Theory, P92, DOI 10.1145/279943.279962
[4]  
Blum A., 2001, P 18 INT C MACH LEAR, P19
[5]  
*CAN CLIM CHANG SC, ENV CAN
[6]  
Cheng H., 2008, P ACM SIGKDD INT C D
[7]   Semisupervised learning of classifiers: Theory, algorithms, and their application to human-computer interaction [J].
Cohen, I ;
Cozman, FG ;
Sebe, N ;
Cirelo, MC ;
Huang, TS .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (12) :1553-1567
[8]  
Cozman F. G., 2003, P 20 INT C MACH LEAR
[9]  
Cozman FG, 2002, P 15 INT FLOR ART IN, P327
[10]   Downscaling climate model outputs into local and regional weather elements by classification and regression [J].
Enke, W ;
Spekat, A .
CLIMATE RESEARCH, 1997, 8 (03) :195-207