Capturing the form of feature interactions in black-box models

被引:3
作者
Zhang, Hanying [1 ,2 ]
Zhang, Xiaohang [1 ,2 ]
Zhang, Tianbo [3 ]
Zhu, Ji [4 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Econ & Management, Beijing, Peoples R China
[2] Beijing Univ Posts & Telecommun, Key Lab Trustworthy Distributed Comp & Serv, Beijing, Peoples R China
[3] Univ Washington, Dept Math, Seattle, WA USA
[4] Univ Michigan, Dept Stat, Ann Arbor, MI USA
基金
中国国家自然科学基金;
关键词
Model interpretation; Feature interaction; Product separability; Black-box; PERFORMANCE; FIND;
D O I
10.1016/j.ipm.2023.103373
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Detecting feature interactions is an important post-hoc method to explain black-box models. The literature on feature interactions mainly focus on detecting their existence and calculating their strength. Little attention has been given to the form how the features interact. In this paper, we propose a novel method to capture the form of feature interactions. First, the feature interaction sets in black-box models are detected by the high dimensional model representation-based method. Second, the pairwise separability of the detected feature interactions is determined by a novel model which is verified theoretically. Third, the set separability of the feature interactions is inferred based on pairwise separability. Fourth, the interaction form of each feature in product separable sets is explored. The proposed method not only provides detailed information about the internal structure of black-box models but also improves the performance of linear models by incorporating the appropriate feature interactions. The experimental results show that the accuracy of recognizing product separability in synthetic models is 100%. Experiments on three regression and three classification tasks demonstrate that the proposed method can capture the product separable form of feature interactions effectively and improve the prediction accuracy greatly.
引用
收藏
页数:16
相关论文
共 41 条
[31]  
Sorokina D., 2008, P 25 INT C MACH LEAR, V307, P1000, DOI [10.1145/1390156.1390282, DOI 10.1145/1390156.1390282]
[32]  
Tarjan R., 1972, SIAM Journal on Computing, V1, P146, DOI 10.1137/0201010
[33]   FACTORIZABILITY CONDITIONS FOR MULTIDIMENSIONAL POLYNOMIALS [J].
THEODOROU, NJ ;
TZAFESTAS, SG .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1985, 30 (07) :697-700
[34]   Accurate quantitative estimation of energy performance of residential buildings using statistical machine learning tools [J].
Tsanas, Athanasios ;
Xifara, Angeliki .
ENERGY AND BUILDINGS, 2012, 49 :560-567
[35]  
Tsang M., 2018, INT C LEARN REPR, P1
[36]   Necessary and sufficient conditions for a function to be separable [J].
Viazminsky, C. P. .
APPLIED MATHEMATICS AND COMPUTATION, 2008, 204 (02) :658-670
[37]   Modeling of strength of high-performance concrete using artificial neural networks [J].
Yeh, IC .
CEMENT AND CONCRETE RESEARCH, 1998, 28 (12) :1797-1808
[38]   Graph convolutional network with sample and feature weights for Alzheimer's disease diagnosis [J].
Zeng, Lu ;
Li, Hengxin ;
Xiao, Tingsong ;
Shen, Fumin ;
Zhong, Zhi .
INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (04)
[39]   Revealing the structure of prediction models through feature interaction detection [J].
Zhang, Xiaohang ;
Zhang, Hanying ;
Zhu, Ji ;
Li, Zhengren .
KNOWLEDGE-BASED SYSTEMS, 2022, 236
[40]   A Survey on Neural Network Interpretability [J].
Zhang, Yu ;
Tino, Peter ;
Leonardis, Ales ;
Tang, Ke .
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2021, 5 (05) :726-742