Bayesian maximum entropy and data fusion for processing qualitative data: theory and application for crowdsourced cropland occurrences in Ethiopia

被引:0
作者
Patrick Bogaert
Sarah Gengler
机构
[1] Université Catholique de Louvain,Earth and Life Institute, Environmental Sciences
来源
Stochastic Environmental Research and Risk Assessment | 2018年 / 32卷
关键词
Categorical variables; Spatial prediction; Bayesian maximum entropy; Bayesian data fusion; Minimum divergence; Soft information; Cropland; Crowdsourcing;
D O I
暂无
中图分类号
学科分类号
摘要
Categorical data play an important role in a wide variety of spatial applications, while modeling and predicting this type of statistical variable has proved to be complex in many cases. Among other possible approaches, the Bayesian maximum entropy methodology has been developed and advocated for this goal and has been successfully applied in various spatial prediction problems. This approach aims at building a multivariate probability table from bivariate probability functions used as constraints that need to be fulfilled, in order to compute a posterior conditional distribution that accounts for hard or soft information sources. In this paper, our goal is to generalize further the theoretical results in order to account for a much wider type of information source, such as probability inequalities. We first show how the maximum entropy principle can be implemented efficiently using a linear iterative approximation based on a minimum norm criterion, where the minimum norm solution is obtained at each step from simple matrix operations that converges to the requested maximum entropy solution. Based on this result, we show then how the maximum entropy problem can be related to the more general minimum divergence problem, which might involve equality and inequality constraints and which can be solved based on iterated minimum norm solutions. This allows us to account for a much larger panel of information types, where more qualitative information, such as probability inequalities can be used. When combined with a Bayesian data fusion approach, this approach deals with the case of potentially conflicting information that is available. Although the theoretical results presented in this paper can be applied to any study (spatial or non-spatial) involving categorical data in general, the results are illustrated in a spatial context where the goal is to predict at best the occurrence of cultivated land in Ethiopia based on crowdsourced information. The results emphasize the benefit of the methodology, which integrates conflicting information and provides a spatially exhaustive map of these occurrence classes over the whole country.
引用
收藏
页码:815 / 831
页数:16
相关论文
共 175 条
  • [1] Abramov R(2007)A practical computational framework for the multidimensional moment-constrained maximum entropy principle J Comput Phys 211 198-209
  • [2] Abramov R(2010)The multidimensional maximum entropy moment problem : a review on numerical methods Commun Math Sci 8 377-392
  • [3] Allard D(2011)An efficient maximum entropy approach for categorical variable prediction Eur J Soil Sci 62 381-393
  • [4] D’Or D(2005)Maximum entropy and the problem of moments: a stable algorithm Phys Rev E 71 057701-541
  • [5] Froidevaux R(2015)Identification of long-term annual pattern of meteorological drought based on spatiotemporal methods: evaluation of different geostatistical approaches Nat Hazards 76 515-368
  • [6] Bandyopadhyay K(1993)The indicator approach to categorical soil data, I. Theory Eur J Soil Sci 44 361-448
  • [7] Bhattacharya A(2002)Spatial prediction of categorical variables: the Bayesian maximum entropy approach Stoch Environ Res Risk Assess 16 425-177
  • [8] Biswas P(2008)Bayesian maximum entropy prediction of soil categories using a traditional soil map as soft information Eur J Soil Sci 59 166-617
  • [9] Drabold D(1995)Maximum entropy-minimum norm method for the determination of level densities Physica A 220 611-2086
  • [10] Bayat B(2011)A multinomial logistic mixed model for the prediction of categorical spatial data Int J Geogr Inf Sci 25 2017-1799