Rough Set based Attribute Clustering for Sample Classification of Gene Expression Data

被引：8

作者：

Nayak, Rudra Kalyan ^{[1
]}

Mishra, Debahuti ^{[1
]}

Shaw, Kailash ^{[2
]}

Mishra, Sashikala ^{[1
]}

机构：

[1] Siksha O Anusandhan Deemed Univ, Inst Tech Educ & Res, Bhubaneswar, Odisha, India

[2] Gandhi Engn Coll, Bhubaneswar, Orissa, India

来源：

INTERNATIONAL CONFERENCE ON MODELLING OPTIMIZATION AND COMPUTING | 2012年 / 38卷

关键词：

Attribute clustering; Clustering; Rough set theory; k-means; k-medoids; DBSCAN;

D O I：

10.1016/j.proeng.2012.06.219

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Attribute clustering is one of the unsupervised data mining applications which have been previously used to identify statistical dependence between subsets of variables where the attributes within the same cluster have high similarity, but within different clusters have high dissimilarity. In this paper, we focus our discussion on the rough set theory for attribute clustering. Rough set theory is a theory adopted to deal with rough and uncertain knowledge, which analyzes the clusters and finds the data principles when previous knowledge is not available, providing a new method for data classification. Although there are numerous methods of rough set and cluster analysis, as the data objects are changing continuously, we have to improve these relevant technologies over time, and propose creative theory in response, meeting the demands of application. Lastly, the experimental result of our proposed algorithm Rough Set based attribute Clustering for Sample Classification (RSCSC) is compared with some of the traditional attribute clustering methods and it is proved to be efficient in finding the meaningful, feasible and compact patterns. (C) 2012 Published by Elsevier Ltd. Selection and/or peer-review under responsibility of Noorul Islam Centre for Higher Education

引用

页码：1788 / 1792

页数：5

共 16 条

[1] ROUGH FUZZY-SETS AND FUZZY ROUGH SETS [J].

DUBOIS, D ;

PRADE, H .

INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 1990, 17 (2-3) :191-209

[2] A METHOD FOR COMPARING 2 HIERARCHICAL CLUSTERINGS [J].

FOWLKES, EB ;

MALLOWS, CL .

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1983, 78 (383) :553-569

[3] Information-preserving hybrid data reduction based on fuzzy-rough techniques [J].

Hu, QH ;

Yu, DR ;

Xie, ZX .

PATTERN RECOGNITION LETTERS, 2006, 27 (05) :414-423

[4] Extensions to the k-means algorithm for clustering large data sets with categorical values [J].

Huang, ZX .

DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (03) :283-304

[5] Fuzzy-rough sets assisted attribute selection [J].

Jensen, Richard ;

Shen, Qiang .

IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2007, 15 (01) :73-89

[6] Cluster analysis for gene expression data: A survey [J].

Jiang, DX ;

Tang, C ;

Zhang, AD .

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2004, 16 (11) :1370-1386

[7] Density-based clustering [J].

Kriegel, Hans-Peter ;

Kroeger, Peer ;

Sander, Joerg ;

Zimek, Arthur .

WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 1 (03) :231-240

[8]

Li D., 2010, JDCTA, V4, P96

[9]

Mishra D., 2011, P 2011 INT C COMM CO, P307

[10]

Ng R.T., 1994, Proceedings of the 20th International Conference on Very Large Data Bases, VLDB '94, P144

← 1 2 →