Decomposition-Based Classifier Chains for Multi-Dimensional Classification

被引:8
作者
Jia B.-B. [1 ,2 ]
Zhang M.-L. [1 ,3 ]
机构
[1] School of Computer Science and Engineering, Southeast University, Nanjing
[2] College of Electrical and Information Engineering, Lanzhou University of Technology, Lanzhou
[3] Key Laboratory of Computer Network and Information Integration, Ministry of Education, Southeast University, Nanjing
来源
IEEE Transactions on Artificial Intelligence | 2022年 / 3卷 / 02期
关键词
Class dependencies; classifier chains (CC); machine learning; multi-dimensional classification (MDC);
D O I
10.1109/TAI.2021.3110935
中图分类号
学科分类号
摘要
In multi-dimensional classification, the semantics of objects are characterized by multiple class variables from different dimensions. To model the dependencies among class variables, one natural strategy is to build a number of multiclass classifiers in a chaining structure, one per dimension, where the subsequent classifiers on the chain augment the feature space with all labeling information used by the preceding classifiers. However, it is shown that this strategy cannot compete with existing state-of-the-art approaches via comparative studies. One possible reason is that inaccurate predictions of preceding classifiers would degenerate the performance of subsequent ones. Besides, it is more difficult to learn a multiclass classifier than a binary one with the same accuracy, and better performance can be expected if the multi-dimensional classification problem can be solved by building multiple binary classifiers in a chaining structure. Based on these conjectures, this article proposes an approach, which builds a chain of binary classifiers to solve the multi-dimensional classification problem with the help of one-versus-one decomposition. To address the issue that different one-versus-one decomposed problems involve different training examples, the feature space is augmented with the binary predictions of preceding classifiers on the chain to train the subsequent ones. To alleviate the effect of the specified chaining order, the ensemble version of the proposed approach is further investigated. Comparative studies over 20 benchmark datasets clearly show the superiority of the proposed approach against the state-of-the-art multi-dimensional classification baselines. © 2020 IEEE.
引用
收藏
页码:176 / 191
页数:15
相关论文
共 36 条
[1]  
Al Muktadir A.H., Miyazawa T., Martinez-Julia P., Harai H., Kafle V.P., Multi-target classification based automatic virtual resource allocation scheme, IEICE Trans. Inf. Syst., 102, 5, pp. 898-909, (2019)
[2]  
Batal I., Hong C., Hauskrecht M., An efficient probabilistic framework for multi-dimensional classification, Proc. 22nd ACM Int. Conf. Inf. Knowl. Manage., pp. 2417-2422, (2013)
[3]  
Bielza C., Li G., Larranaga P., Multi-dimensional classification with Bayesian networks, Int. J. Approx. Reason., 52, 6, pp. 705-727, (2011)
[4]  
Borchani H., Bielza C., Toro C., Larranaga P., Predicting human immunodeficiency virus inhibitors usingmulti-dimensional Bayesian network classifiers, Artif. Intell. Med., 57, 3, pp. 219-229, (2013)
[5]  
Breiman L., Bagging predictors, Mach. Learn., 24, pp. 123-140, (1996)
[6]  
Demsar J., Statistical comparisons of classifiers over multiple data sets, J. Mach. Learn. Res., 7, pp. 1-30, (2006)
[7]  
Deng L., He X., Gao J., Deep stacking networks for information retrieval, Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., pp. 3153-3157, (2013)
[8]  
Dietterich T.G., Bakiri G., Solving multiclass learning problems via error-correcting output codes, J. Artif. Intell. Res., 2, pp. 263-286, (1995)
[9]  
Duan K.-B., Keerthi S.S., Which is the bestmulticlass SVM method? An empirical study, Proc. Int. Workshop Multiple Classifier Syst., pp. 278-285, (2005)
[10]  
Fan R., Chang K., Hsieh C., Wang X., Lin C.-J., LIBLINEAR: A library for large linear classification, J. Mach. Learn. Res., 9, pp. 1871-1874, (2008)