Regression applied to symbolic interval-spatial data

被引:1
|
作者
Freitas, Wanessa W. L. [1 ]
de Souza, Renata M. C. R. [1 ]
Amaral, Getulio J. A. [2 ]
de Moraes, Ronei M. [3 ]
机构
[1] Univ Fed Pernambuco, Ctr Informat, Ave Jornalista Anibal Fernandes S-N,Cidade Univ, BR-50740560 Recife, PE, Brazil
[2] Univ Fed Pernambuco, Dept Estat, Ave Jornalista Anibal Fernandes S-N,Cidade Univ, BR-50740560 Recife, PE, Brazil
[3] Univ Fed Paraiba, Dept Estat, BR-58050585 Joao Pessoa, PB, Brazil
关键词
Symbolic data analysis; Interval data; Spatial analysis; Regression; MODEL;
D O I
10.1007/s10489-023-05051-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Symbolic data analysis is a research area related to machine learning and statistics, which provides tools to describe geo-objects, and enables several types of variables to be dealt with, including interval type variables. Moreover, despite the recent progress in understanding symbolic data, there are no studies in the literature that address this type of data in the context of spatial data analysis. Thus, in this paper, we propose two different approaches of the spatial regression model for symbolic interval-valued data. The first fits a linear regression model on the minimum and maximum values of the interval values and the second fits a linear regression model on the center and range values of the interval. In order to evaluate the performance of these approaches, we have performed Monte Carlo simulations in which we calculated the mean value of the performance metric of the models analyzed. Furthermore, we also analyzed two applications involving real data. In the first, we examined the performance of the models in the Brazilian State of Pernambuco. In the second application, we analyzed the performance of the models for the Brazilian Northeastern region. Both applications were related to socioeconomic variables. We observed that in areas with less spatial variability, the interval spatial regression model performs better when compared with a usual method. When considering areas with a higher spatial variability, both ways presented similar results.
引用
收藏
页码:1545 / 1565
页数:21
相关论文
共 50 条
  • [21] Exploratory spatial analysis for interval data: A new autocorrelation index with COVID-19 and rent price applications
    Freitas, Wanessa W. L.
    de Souza, Renata M. C. R.
    Amaral, Getulio J. A.
    De Bastiani, Fernanda
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 195
  • [22] Principal component analysis of interval data: a symbolic data analysis approach
    Carlo N. Lauro
    Francesco Palumbo
    Computational Statistics, 2000, 15 : 73 - 87
  • [23] Principal component analysis of interval data: a symbolic data analysis approach
    Lauro, CN
    Palumbo, F
    COMPUTATIONAL STATISTICS, 2000, 15 (01) : 73 - 87
  • [24] Regression analysis of spatial data
    Beale, Colin M.
    Lennon, Jack J.
    Yearsley, Jon M.
    Brewer, Mark J.
    Elston, David A.
    ECOLOGY LETTERS, 2010, 13 (02) : 246 - 264
  • [25] Searching for Nonlinear Relationships in fMRI Data with Symbolic Regression
    Hughes, James Alexander
    Daley, Mark
    PROCEEDINGS OF THE 2017 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'17), 2017, : 1129 - 1136
  • [26] Data-driven discovery of formulas by symbolic regression
    Sun, Sheng
    Ouyang, Runhai
    Zhang, Bochao
    Zhang, Tong-Yi
    MRS BULLETIN, 2019, 44 (07) : 559 - 564
  • [27] Lasso-constrained regression analysis for interval-valued data
    Giordani, Paolo
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2015, 9 (01) : 5 - 19
  • [28] Nonparametric Regression with Interval-Censored Data
    Deng, Wen Li
    Zheng, Zu Kang
    Zhang, Ri Quan
    ACTA MATHEMATICA SINICA-ENGLISH SERIES, 2014, 30 (08) : 1422 - 1434
  • [29] Adaptive Prediction Interval for Data Stream Regression
    Sun, Yibin
    Pfahringer, Bernhard
    Gomes, Heitor Murilo
    Bifet, Albert
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT III, PAKDD 2024, 2024, 14647 : 130 - 141
  • [30] Linear regression with interval-valued data
    Sun, Yan
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2016, 8 (01): : 54 - 60