Learning Bayesian network parameters with soft-hard constraints

被引：1

作者：

Ru, Xinxin ^{[1
]}

Gao, Xiaoguang ^{[1
]}

Wang, Yangyang ^{[1
]}

Liu, Xiaohan ^{[1
]}

机构：

[1] Northwestern Polytech Univ, Sch Elect & Informat, Dongxiang Rd, Xian 710072, Shaanxi, Peoples R China

来源：

NEURAL COMPUTING & APPLICATIONS | 2022年 / 34卷 / 20期

基金：

中国国家自然科学基金;

关键词：

Bayesian networks; Parameter learning; Overfitting; Underfitting; MAXIMUM-LIKELIHOOD;

D O I：

10.1007/s00521-022-07429-5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In Bayesian network parameter learning, it is difficult to obtain accurate parameters when the data are insufficient, and overfitting easily occurs. However, underfitting is prone to happen when the learning results are blindly close to the constraints generated by expert knowledge. A soft-hard constraint parameter learning method is proposed to balance the overfitting and underfitting problems in parameter learning. In this paper, the constraints applied to the parameters are called hard constraints, while those used to the prior are called soft constraints. A partial maximum entropy prior quantization method is proposed for the soft constraint to obtain proper parameters. The equivalent sample threshold is proposed to limit the hyperparameters based on the actual data size for the hard constraint. To combine soft and hard constraints effectively, the soft constraint maximization and hard constraint minimization model is proposed. Experimental results show that this method can significantly improve the accuracy of parameter learning and actually balance overfitting and underfitting.

引用

页码：18195 / 18209

页数：15

共 32 条

[1] Bayesian combined neural network for traffic volume short-term forecasting at adjacent intersections [J].

AlKheder, Sharaf ;

Alkhamees, Wasan ;

Almutairi, Reyouf ;

Alkhedher, Mohammad .

NEURAL COMPUTING & APPLICATIONS, 2021, 33 (06) :1785-1836

[2]

Chang R., 2010, INT JOINT C NEURAL N, P1

[3] Robust Bayesian networks for low-quality data modeling and process monitoring applications [J].

Chen, Guangjie ;

Ge, Zhiqiang .

CONTROL ENGINEERING PRACTICE, 2020, 97

[4] JEFFREYS PRIOR IS ASYMPTOTICALLY LEAST FAVORABLE UNDER ENTROPY RISK [J].

CLARKE, BS ;

BARRON, AR .

JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 1994, 41 (01) :37-60

[5] An improved evolutionary approach-based hybrid algorithm for Bayesian network structure learning in dynamic constrained search space [J].

Dai, Jingguo ;

Ren, Jia ;

Du, Wencai ;

Shikhin, Vladimir ;

Ma, Jixin .

NEURAL COMPUTING & APPLICATIONS, 2020, 32 (05) :1413-1434

[6] The sample complexity of learning fixed-structure Bayesian networks [J].

Dasgupta, S .

MACHINE LEARNING, 1997, 29 (2-3) :165-180

[7]

de Campos CP, 2008, LECT NOTES COMPUT SC, V5304, P168, DOI 10.1007/978-3-540-88690-7_13

[8] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].

DEMPSTER, AP ;

LAIRD, NM ;

RUBIN, DB .

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38

[9] Learning Bayesian network parameters under order constraints [J].

Feelders, A ;

van der Gaag, LC .

INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2006, 42 (1-2) :37-53

[10]

Feelders A, 2012, COMPUTER SCI, P193

← 1 2 3 4 →