Model-based analysis of latent factors

被引:1
作者
Gregorius, Hans-Rolf [1 ,2 ]
机构
[1] Inst Populat & Okol Genet, Pfingstanger 58, D-37075 Gottingen, Germany
[2] Univ Gottingen, Abt Forstgenet & Forstpflanzenzuchtung, Busgenweg 2, D-37077 Gottingen, Germany
关键词
MULTILOCUS GENOTYPE DATA; DIFFERENTIATION; POPULATIONS; COMMUNITIES; INFERENCE; GENETICS;
D O I
10.5194/we-18-153-2018
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
The detection of community or population structure through analysis of explicit cause-effect modeling of given observations has received considerable attention. The complexity of the task is mirrored by the large number of existing approaches and methods, the applicability of which heavily depends on the design of efficient algorithms of data analysis. It is occasionally even difficult to disentangle concepts and algorithms. To add more clarity to this situation, the present paper focuses on elaborating the system analytic framework that probably encompasses most of the common concepts and approaches by classifying them as model-based analyses of latent factors. Problems concerning the efficiency of algorithms are not of primary concern here. In essence, the framework suggests an input-output model system in which the inputs are provided as latent model parameters and the output is specified by the observations. There are two types of model involved, one of which organizes the inputs by assigning combinations of potentially interacting factor levels to each observed object, while the other specifies the mechanisms by which these combinations are processed to yield the observations. It is demonstrated briefly how some of the most popular methods (Structure, BAPS, Geneland) fit into the framework and how they differ conceptually from each other. Attention is drawn to the need to formulate and assess qualification criteria by which the validity of the model can be judged. One probably indispensable criterion concerns the cause-effect character of the model-based approach and suggests that measures of association between assignments of factor levels and observations be considered together with maximization of their likelihoods (or posterior probabilities). In particular the likelihood criterion is difficult to realize with commonly used estimates based on Markov chain Monte Carlo (MCMC) algorithms. Generally applicable MCMC-based alternatives that allow for approximate employment of the primary qualification criterion and the implied model validation including further descriptors of model characteristics are suggested.
引用
收藏
页码:153 / 162
页数:10
相关论文
共 50 条
  • [41] The phylogeography debate and the epistemology of model-based evolutionary biology
    Arroyo-Santos, Alfonso
    Olson, Mark E.
    Vergara-Silva, Francisco
    BIOLOGY & PHILOSOPHY, 2014, 29 (06) : 833 - 850
  • [42] Model-based Inference of a Directed Network of Circadian Neurons
    McBride, David
    Petzold, Linda
    JOURNAL OF BIOLOGICAL RHYTHMS, 2018, 33 (05) : 515 - 522
  • [43] Design-based and model-based inference in surveys of freshwater mollusks
    Dorazio, RM
    JOURNAL OF THE NORTH AMERICAN BENTHOLOGICAL SOCIETY, 1999, 18 (01): : 118 - 131
  • [44] Frequentist Model-based Statistical Induction and the Replication Crisis
    Spanos, Aris
    JOURNAL OF QUANTITATIVE ECONOMICS, 2022, 20 (SUPPL 1) : 133 - 159
  • [45] A spatio-temporal model based on discrete latent variables for the analysis of COVID-19 incidence
    Bartolucci, Francesco
    Farcomeni, Alessio
    SPATIAL STATISTICS, 2022, 49
  • [46] A 'post-honeymoon' measles epidemic in Burundi: mathematical model-based analysis and implications for vaccination timing
    Corey, Katelyn C.
    Noymer, Andrew
    PEERJ, 2016, 4
  • [47] Model-Free and Model-Based Policy Evaluation when Causality is Uncertain
    Bruns-Smith, David
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [48] Metabolic model-based analysis of the emergence of bacterial cross-feeding via extensive gene loss
    McNally, Colin P.
    Borenstein, Elhanan
    BMC SYSTEMS BIOLOGY, 2018, 12
  • [49] Spanning latent and observable factors
    Andreou, E.
    Gagliardini, P.
    Ghysels, E.
    Rubin, M.
    JOURNAL OF ECONOMETRICS, 2025, 248
  • [50] Separate encoding of model-based and model-free valuations in the human brain
    Beierholm, Ulrik R.
    Anen, Cedric
    Quartz, Steven
    Bossaerts, Peter
    NEUROIMAGE, 2011, 58 (03) : 955 - 962