Structured Neural Networks for Density Estimation and Causal Inference

被引:0
作者
Chen, Asic [1 ]
Shi, Ruian [1 ]
Gao, Xiang [1 ]
Baptista, Ricardo [2 ]
Krishnan, Rahul G. [1 ]
机构
[1] Univ Toronto, Vector Inst, Toronto, ON, Canada
[2] CALTECH, Pasadena, CA USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Injecting structure into neural networks enables learning functions that satisfy invariances with respect to subsets of inputs. For instance, when learning generative models using neural networks, it is advantageous to encode the conditional independence structure of observed variables, often in the form of Bayesian networks. We propose the Structured Neural Network (StrNN), which injects structure through masking pathways in a neural network. The masks are designed via a novel relationship we explore between neural network architectures and binary matrix factorization, to ensure that the desired independencies are respected. We devise and study practical algorithms for this otherwise NP-hard design problem based on novel objectives that control the model architecture. We demonstrate the utility of StrNN in three applications: (1) binary and Gaussian density estimation with StrNN, (2) real-valued density estimation with Structured Autoregressive Flows (StrAFs), autoregressive normalizing flows that leverage StrNN as a conditioner, and (3) interventional and counterfactual analysis with StrAFs. Our work opens up new avenues for learning neural networks that enable data-efficient generative modeling and the use of normalizing flows for causal effect estimation.
引用
收藏
页数:13
相关论文
共 46 条
[1]  
[Anonymous], 2008, INT C MACH LEARN
[2]  
Balazadeh Vahid, 2022, arXiv
[3]  
Balgi S, 2022, AAAI CONF ARTIF INTE, P11810
[4]  
Balgi S, 2022, Arxiv, DOI [arXiv:2202.09391, DOI 10.48550/ARXIV.2202.09391]
[5]  
Chen Ricky T. Q., 2018, Advances in Neural Information Processing Systems, V31
[6]   DETERRENT: Knowledge Guided Graph Attention Network for Detecting Healthcare Misinformation [J].
Cui, Limeng ;
Seo, Haeseung ;
Tabar, Maryam ;
Ma, Fenglong ;
Wang, Suhang ;
Lee, Dongwon .
KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, :492-502
[7]  
Dan C, 2017, Arxiv, DOI arXiv:1511.01699
[8]  
Ding Wenhao, 2023, PMLR, P812
[9]  
Drouin Alexandre, 2020, Advances in Neural Information Processing Systems, V33, P21865
[10]   Structure Learning in Graphical Modeling [J].
Drton, Mathias ;
Maathuis, Marloes H. .
ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 4, 2017, 4 :365-393