Nearly maximally predictive features and their dimensions

被引:19
|
作者
Marzen, Sarah E. [1 ,2 ]
Crutchfield, James P. [3 ]
机构
[1] MIT, Dept Phys, Phys Living Syst Grp, Cambridge, MA 02139 USA
[2] Univ Calif Berkeley, Dept Phys, Berkeley, CA 94720 USA
[3] Univ Calif Davis, Dept Phys, Complex Sci Ctr, One Shields Ave, Davis, CA 95616 USA
基金
美国国家科学基金会;
关键词
INFORMATION-THEORETIC APPROACH; RENEWAL PROCESSES; COMPLEXITY; ENTROPY; IDENTIFIABILITY; DISTRIBUTIONS; CONVERGENCE; COMPUTATION; MECHANICS;
D O I
10.1103/PhysRevE.95.051301
中图分类号
O35 [流体力学]; O53 [等离子体物理学];
学科分类号
070204 ; 080103 ; 080704 ;
摘要
Scientific explanation often requires inferring maximally predictive features from a given data set. Unfortunately, the collection of minimal maximally predictive features for most stochastic processes is uncountably infinite. In such cases, one compromises and instead seeks nearly maximally predictive features. Here, we derive upper bounds on the rates at which the number and the coding cost of nearly maximally predictive features scale with desired predictive power. The rates are determined by the fractal dimensions of a process' mixed-state distribution. These results, in turn, show how widely used finite-order Markov models can fail as predictors and that mixed-state predictive features can offer a substantial improvement.
引用
收藏
页数:6
相关论文
共 6 条
  • [1] Maximally predictive states: From partial observations to long timescales
    Costa, Antonio C.
    Ahamed, Tosif
    Jordan, David
    Stephens, Greg J.
    CHAOS, 2023, 33 (02)
  • [2] NEARLY OPTIMAL CENTRAL LIMIT THEOREM AND BOOTSTRAP APPROXIMATIONS IN HIGH DIMENSIONS
    Chernozhukov, Victor
    Chetverikov, Denis
    Koike, Yuta
    ANNALS OF APPLIED PROBABILITY, 2023, 33 (03) : 2374 - 2425
  • [3] Using Shapley Values and Variational Autoencoders to Explain Predictive Models with Dependent Mixed Features
    Olsen, Lars Henry Berge
    Glad, Ingrid Kristine
    Jullum, Martin
    Aas, Kjersti
    JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23
  • [4] Finding Predictive EEG Complexity Features for Classification of Epileptic and Psychogenic Nonepileptic Seizures using Imperialist Competitive Algorithm
    Ahmadi, Negar
    Carrette, Evelien
    Aldenkamp, Albert P.
    Pechenizkiy, Mykola
    2018 31ST IEEE INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS 2018), 2018, : 164 - 169
  • [5] MA-DG: Learning Features of Sequences in Different Dimensions for Min-Entropy Evaluation via 2D-CNN and Multi-Head Self-Attention
    Huang, Yilong
    Fan, Fan
    Huang, Chaofeng
    Yang, Haomiao
    Gu, Min
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 7879 - 7894
  • [6] Home is where my villa is: a machine learning-based predictive suitability map for Roman features in Northern Noricum (ca. 50-500 CE/Lower Austria/AUT)
    Hagmann, Dominik
    JOURNAL OF MAPS, 2025, 21 (01):