Weighted Machine Learning for Spatial-Temporal Data

被引:13
作者
Hashemi, Mahdi [1 ]
Karimi, Hassan A. [2 ]
机构
[1] George Mason Univ, Dept Informat Sci & Technol, Fairfax, VA 22030 USA
[2] Univ Pittsburgh, Sch Comp & Informat, Pittsburgh, PA 15213 USA
关键词
Machine learning; Training; Correlation; Support vector machines; Kernel; Spatial databases; Bandwidth; Analytical learning; autocorrelation; inductive learning; machine learning; spatial data; temporal data; ALGORITHMS; CLASSIFICATION; MODEL; PATTERNS; SENSOR; TIME;
D O I
10.1109/JSTARS.2020.2995834
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Applying machine learning techniques to spatial-temporal data poses the question that how the recorded location and time for training samples should contribute to the training and testing process. The prior knowledge of how spatial-temporal phenomena are autocorrelated cannot be properly captured by machine learning techniques, which either ignore location and time altogether or consider them as input features. Not to mention that the latter approach leads to slightly increased sparseness of data in the feature space and more free parameters in the predictor; thus, demanding for larger training datasets. We use the prior knowledge about the spatial-temporal autocorrelation to determine how relevant each training sample would be, given its spatial and temporal distances to the irresponsive (unlabeled) sample. Weighted machine learning techniques use this prior knowledge by taking the relevance of training samples with regard to the irresponsive sample into account as training samples' weights. The proposed approach overcomes the aforementioned issues by enriching the training process with the prior knowledge about spatial-temporal autocorrelation. Because the spatial-temporal weight of training samples depends on the irresponsive sample's location and time, the machine needs to be trained separately for each irresponsive sample. However, we show that in practice using only a small subset of training samples with largest spatial-temporal weights not only mitigates the training time but also results in the best accuracy in most cases.
引用
收藏
页码:3066 / 3082
页数:17
相关论文
共 65 条
[21]   Temporal context in floristic classification [J].
Fitzgerald, RW ;
Lees, BG .
COMPUTERS & GEOSCIENCES, 1996, 22 (09) :981-994
[22]  
Flaxman S. R., 2015, THESIS
[23]  
Fotheringham A. S, 2002, Geographically Weighted Regression: the Analysis of Spatially Varying Relationship
[24]  
Fotheringham AS., 2000, Quantitative geography: perspectives on spatial data analysis
[26]  
Gahegan M., 1999, Journal of Geographical Systems, V1, P3
[27]  
Gilardi N., 2003, Mapping radioactivity in the environment: spatial interpolation comparison, V97, P222
[28]  
Gilardi N., 2000, Journal of Geographic Information and Decision Analysis, V4, P11, DOI 10.1.1.384.908
[29]   A Machine Learning Based Spatio-Temporal Data Mining Approach for Detection of Harmful Algal Blooms in the Gulf of Mexico [J].
Gokaraju, Balakrishna ;
Durbha, Surya S. ;
King, Roger L. ;
Younan, Nicolas H. .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2011, 4 (03) :710-720
[30]  
Hashemi M, 2018, Stat Optimization Inform Comput, V6, P497, DOI 10.19139/soic.v6i4.479