On improving efficency of SLIQ decision tree algorithm

被引：4

作者：

Chandra, B. ^{[1
]}

Varghese, P. Paul ^{[1
]}

机构：

[1] Indian Inst Technol, Dept Math, New Delhi 110016, India

来源：

2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6 | 2007年

关键词：

D O I：

10.1109/IJCNN.2007.4370932

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Decision trees have been widely used for classification in Data mining. Number of decision tree algorithms has been developed in the past. The SLIQ algorithm [ 2] was developed with an aim to reduce diversity of the decision tree at each split. However the number of split points which needs to be examined while building the decision tree becomes enormous as the SLIQ algorithm evaluates Gini Index at every successive midpoint of attribute values. The paper proposes a novel approach to tackle this problem by reducing the number of split points to a large extent in order to improve the performance of SLIQ algorithm. The improved performance is shown on large number of benchmark datasets taken from UCI machine learning repository.

引用

页码：66 / 71

页数：6

共 7 条

[1]

Chandra B, 2002, WISE 2002: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING (WORKSHOPS), P160

[2]

MEHTA M, 1996, EXTENDING DATABASE T, P18

[3]

Quinlan J. R., 1986, Machine Learning, V1, P81, DOI 10.1023/A:1022643204877

[4]

Quinlan J.R., 2014, C4. 5: programs for machine learning

[5] Improved use of continuous attributes in C4.5 [J].

Quinlan, JR .

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 :77-90

[6]

Rastogi R., 1998, Proceedings of the Twenty-Fourth International Conference on Very-Large Databases, P404

[7]

Shafer J.C., 1996, P 24 INT C VER LARG

← 1 →