An Incremental Linear Programming Based Tool for Analyzing Gene Expression Data

被引:0
|
作者
Panigrahi, Satish Chandra [1 ]
Alam, Md Shafiul [1 ]
Mukhopadhyay, Asish [1 ]
机构
[1] Univ Windsor, Sch Comp Sci, Windsor, ON N9B 3P4, Canada
来源
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2013, PT V | 2013年 / 7975卷
关键词
Gene expression analysis; DNA microarrays; linear separation; tissue classification; TIME ALGORITHMS; CLASSIFICATION; CANCER; PREDICTION; TUMOR;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The availability of large volumes of gene expression data from microarray analysis (cDNA and oligonucleotide) has opened a new door to the diagnoses and treatments of various diseases based on gene expression profiling. In this paper, we discuss a new profiling tool based on linear programming. Given gene expression data from two subclasses of the same disease (e.g. leukemia), we are able to determine efficiently if the samples are linearly separable with respect to triplets of genes. This was left as an open problem in an earlier study that considered only pairs of genes as linear separators. Our tool comes in two versions - offline and incremental. Tests show that the incremental version is markedly more efficient than the offline one. This paper also introduces a gene selection strategy that exploits the class distinction property of a gene by separability test by pairs and triplets. We applied our gene selection strategy to 4 publicly available gene-expression data sets. Our experiments show that gene spaces generated by our method achieves similar or even better classification accuracy than the gene spaces generated by t-values, FCS(Fisher Criterion Score) and SAM(Significance Analysis of Microarrays).
引用
收藏
页码:48 / 64
页数:17
相关论文
共 50 条
  • [31] A tool for gene expression based PubMed search through combining data sources
    Korotkiy, M
    Middelburg, R
    Dekker, H
    van Harmelen, F
    Lankelma, J
    BIOINFORMATICS, 2004, 20 (12) : 1980 - 1982
  • [32] Gene Expression Programming based on simulated annealing
    Jiang, SW
    Cai, ZH
    Zeng, D
    Liu, YD
    Li, Q
    2005 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING PROCEEDINGS, VOLS 1 AND 2, 2005, : 1218 - 1221
  • [33] Multiobjective optimization based on gene expression programming
    Xiang, Yong
    Tang, Chang-Jie
    Zeng, Tao
    Liu, Yin-Tian
    Qiao, Shao-Jie
    Sichuan Daxue Xuebao (Gongcheng Kexue Ban)/Journal of Sichuan University (Engineering Science Edition), 2007, 39 (04): : 124 - 129
  • [34] Function Finding based on Gene Expression Programming
    Mo, Haifang
    Wang, Jiangqing
    Qin, Jun
    Kang, Lishan
    SECOND INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTING: WGEC 2008, PROCEEDINGS, 2008, : 70 - +
  • [35] Linear Separability of Gene Expression Data Sets
    Unger, Giora
    Chor, Benny
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2010, 7 (02) : 375 - 381
  • [36] Biclustering of Linear Patterns In Gene Expression Data
    Gao, Qinghui
    Ho, Christine
    Jia, Yingmin
    Li, Jingyi Jessica
    Huang, Haiyan
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2012, 19 (06) : 619 - 631
  • [37] A relational data mining tool based on genetic programming
    Martin, L
    Moal, F
    Vrain, C
    PRINCIPLES OF DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 1510 : 130 - 138
  • [38] Linear programming for phylogenetic reconstruction based on gene rearrangements
    Tang, JJ
    Moret, BME
    COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2005, 3537 : 406 - 416
  • [39] A crowdsourcing approach for reusing and meta analyzing gene expression data
    Shah, Naisha
    Guo, Yongjian
    Wendelsdorf, Katherine V.
    Lu, Yong
    Sparks, Rachel
    Tsang, John S.
    NATURE BIOTECHNOLOGY, 2016, 34 (08) : 803 - 806
  • [40] Analyzing gene expression data in mice with the Neuro Behavior Ontology
    Hoehndorf, Robert
    Hancock, John M.
    Hardy, Nigel W.
    Mallon, Ann-Marie
    Schofield, Paul N.
    Gkoutos, Georgios V.
    MAMMALIAN GENOME, 2014, 25 (1-2) : 32 - 40