Classification of Microarray Gene Expression Data using Associative Classification

被引:0
作者
Alagukumar, S. [1 ]
Lawrance, R. [1 ]
机构
[1] Ayya Nadar Janaki Ammal Coll, Dept Comp Applicat, Sivakasi 626124, India
来源
2016 INTERNATIONAL CONFERENCE ON COMPUTING TECHNOLOGIES AND INTELLIGENT DATA ENGINEERING (ICCTIDE'16) | 2016年
关键词
Microarray; Data Mining; Associative Classification;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data mining is one of the interdisciplinary fields on the research aspect. Recently, association rules have become an important concept for classification purposes, called Associative Classification. It is a data mining technique that combines association rule mining and classification technique to build classification models. The core objective of this work is to classify the gene expression using Associative Classification algorithm. Our research work has been formulated based on association rule and classification mining. The proposed algorithm contains four phases called as Statistical Gene filtering, Discretization, Class Association Rules and prediction or class assignment. The gene filtering phase is to find the differentially expressed genes and select the significant genes in the specific gene expression. The discretization phase is to convert the continuous values into discrete values and substitute the gene values into gene intervals. The role of the Class Association Rule phase is to generate the set of class association rules using closed frequent itemset and generate the classifier model. The last phase predicts the class from trained model using scoring function. The experimental results carried out by using breast cancer gene expression data which are available on the NCBI online biological database. It has been tested with two class and multi-class datasets and compared with the classical classification algorithms such as Linear Discriminant Analysis, SVM, and Decision Tree. The performance of the classifier model is evaluated using Leave-One-Out-Cross Validation method. The result of this work is used to the drug designer for the pathway analysis and disease treatment decisions.
引用
收藏
页数:8
相关论文
共 17 条
[1]  
Agrawal R, 2000, P 20 INT C VER LARG
[2]   Suite of decision tree-based classification algorithms on cancer gene expression data [J].
Al Snousy, Mohmad Badr ;
El-Deeb, Hesham Mohamed ;
Badran, Khaled ;
Al Khlil, Ibrahim Ali .
EGYPTIAN INFORMATICS JOURNAL, 2011, 12 (02) :73-82
[3]   A Selective Analysis of Microarray Data using Association Rule Mining [J].
Alagukumar, S. ;
Lawrance, R. .
GRAPH ALGORITHMS, HIGH PERFORMANCE IMPLEMENTATIONS AND ITS APPLICATIONS (ICGHIA 2014), 2015, 47 :3-12
[4]   Gene association analysis: a survey of frequent pattern mining from gene expression data [J].
Alves, Ronnie ;
Rodriguez-Baena, Domingo S. ;
Aguilar-Ruiz, Jesus S. .
BRIEFINGS IN BIOINFORMATICS, 2010, 11 (02) :210-224
[5]  
[Anonymous], 2015, INT J COMPUT APPL
[6]  
[Anonymous], 1998, BDA 1998 INT C ADV D
[7]  
[Anonymous], CLIN CANC RES
[8]   Mining gene expression databases for association rules [J].
Creighton, C ;
Hanash, S .
BIOINFORMATICS, 2003, 19 (01) :79-86
[9]  
Dash Rajashree., 2011, International Journal of Advances in Science and Technology, V2, P29
[10]   A Survey of Discretization Techniques: Taxonomy and Empirical Analysis in Supervised Learning [J].
Garcia, Salvador ;
Luengo, Julian ;
Antonio Saez, Jose ;
Lopez, Victoria ;
Herrera, Francisco .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (04) :734-750