Rank Entropy-Based Decision Trees for Monotonic Classification

被引:148
作者
Hu, Qinghua [1 ]
Che, Xunjian [1 ]
Zhang, Lei [2 ]
Zhang, David [2 ,3 ]
Guo, Maozu [1 ]
Yu, Daren [1 ]
机构
[1] Harbin Inst Technol, Harbin 150001, Peoples R China
[2] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Hong Kong, Peoples R China
[3] Hong Kong Polytech Univ, Biometr Technol Ctr UGC CRC, Hong Kong, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Monotonic classification; rank entropy; rank mutual information; decision tree; MULTIATTRIBUTE UTILITY-THEORY; ROUGH APPROXIMATION; METHODOLOGY; SETS;
D O I
10.1109/TKDE.2011.149
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many decision making tasks, values of features and decision are ordinal. Moreover, there is a monotonic constraint that the objects with better feature values should not be assigned to a worse decision class. Such problems are called ordinal classification with monotonicity constraint. Some learning algorithms have been developed to handle this kind of tasks in recent years. However, experiments show that these algorithms are sensitive to noisy samples and do not work well in real-world applications. In this work, we introduce a new measure of feature quality, called rank mutual information (RMI), which combines the advantage of robustness of Shannon's entropy with the ability of dominance rough sets in extracting ordinal structures from monotonic data sets. Then, we design a decision tree algorithm (REMT) based on rank mutual information. The theoretic and experimental analysis shows that the proposed algorithm can get monotonically consistent decision trees, if training samples are monotonically consistent. Its performance is still good when data are contaminated with noise.
引用
收藏
页码:2052 / 2064
页数:13
相关论文
共 41 条
[1]  
[Anonymous], 2014, C4. 5: programs for machine learning
[2]  
[Anonymous], 2002, 14 BELG DUTCH C ART
[3]  
[Anonymous], METRIKA
[4]  
Ben-David A., 1989, Computational Intelligence, V5, P45, DOI 10.1111/j.1467-8640.1989.tb00314.x
[5]   Adding monotonicity to learning algorithms may impair their accuracy [J].
Ben-David, Arie ;
Sterling, Leon ;
Tran, TriDat .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) :6627-6634
[6]   AUTOMATIC-GENERATION OF SYMBOLIC MULTIATTRIBUTE ORDINAL KNOWLEDGE-BASED DSSS - METHODOLOGY AND APPLICATIONS [J].
BENDAVID, A .
DECISION SCIENCES, 1992, 23 (06) :1357-1372
[7]   MONOTONICITY MAINTENANCE IN INFORMATION-THEORETIC MACHINE LEARNING ALGORITHMS [J].
BENDAVID, A .
MACHINE LEARNING, 1995, 19 (01) :29-43
[8]  
Bioch J.C., 2000, P 12 BELG DUTCH ART, P85
[9]  
Bioch J.C., 2002, P 12 BELG DUTCH C MA, P3
[10]   An Information-Theoretic Foundation for the Measurement of Discrimination Information [J].
Cai, Di .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (09) :1262-1273