Mining Gene Expression Data Focusing Cancer Therapeutics: A Digest

被引:19
|
作者
Jauhari, Shaurya [1 ]
Rizvi, S. A. M. [1 ]
机构
[1] Jamia Millia Islamia, Dept Comp Sci, New Delhi 110025, India
关键词
Association rules; cancer; classification; clustering; data mining; gene expression data; gene therapy; epigenetics; next generation sequencing; clinicopathology; CLUSTER-ANALYSIS;
D O I
10.1109/TCBB.2014.2312002
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
An understanding towards genetics and epigenetics is essential to cope up with the paradigm shift which is underway. Personalized medicine and gene therapy will confluence the days to come. This review highlights traditional approaches as well as current advancements in the analysis of the gene expression data from cancer perspective. Due to improvements in biometric instrumentation and automation, it has become easier to collect a lot of experimental data in molecular biology. Analysis of such data is extremely important as it leads to knowledge discovery that can be validated by experiments. Previously, the diagnosis of complex genetic diseases has conventionally been done based on the non-molecular characteristics like kind of tumor tissue, pathological characteristics, and clinical phase. The microarray data can be well accounted for high dimensional space and noise. Same were the reasons for ineffective and imprecise results. Several machine learning and data mining techniques are presently applied for identifying cancer using gene expression data. While differences in efficiency do exist, none of the well-established approaches is uniformly superior to others. The quality of algorithm is important, but is not in itself a guarantee of the quality of a specific data analysis.
引用
收藏
页码:533 / 547
页数:15
相关论文
共 50 条
  • [41] Data Mining in Cancer Research
    Lisboa, Paulo J. G.
    Vellido, Alfredo
    Tagliaferri, Roberto
    Napolitano, Francesco
    Ceccarelli, Michele
    Martin-Guerrero, Jose D.
    Biganzoli, Elia
    IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2010, 5 (01) : 14 - 18
  • [42] Performance evaluation of Gene Expression Programming for hydraulic data mining
    Eldrandaly, Khalid
    Negm, Abdel-Azim
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2008, 5 (02) : 126 - 131
  • [43] Frequent Weighted Itemset Mining from Gene Expression Data
    Baralis, Elena
    Cagliero, Luca
    Cerquitelli, Tania
    Chiusano, Silvia
    Garza, Paolo
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2013,
  • [44] Evolving Multiobjective Cancer Subtype Diagnosis From Cancer Gene Expression Data
    Wang, Yunhe
    Ma, Zhiqiang
    Wong, Ka-Chun
    Li, Xiangtao
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (06) : 2431 - 2444
  • [45] A Comprehensive Survey of Recent Hybrid Feature Selection Methods in Cancer Microarray Gene Expression Data
    Almazrua, Halah
    Alshamlan, Hala
    IEEE ACCESS, 2022, 10 : 71427 - 71449
  • [46] P3M POSSIBILISTIC MULTI-STEP MAXMIN AND MERGING ALGORITHM WITH APPLICATION TO GENE EXPRESSION DATA MINING
    Romdhane, Lotfi Ben
    Shili, Hechmi
    Ayeb, Bechir
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2009, 18 (04) : 545 - 567
  • [47] Mining Frequent and Associated Gene Expression Patterns from Spatial Gene Expression Data: A Proposed Approach
    Anandhavalli, M.
    Ghose, M. K.
    Gauthaman, K.
    CONTEMPORARY COMPUTING, PT 1, 2010, 94 : 119 - +
  • [48] Clustering of high throughput gene expression data
    Pirim, Harun
    Eksioglu, Burak
    Perkins, Andy D.
    Yuceer, Cetin
    COMPUTERS & OPERATIONS RESEARCH, 2012, 39 (12) : 3046 - 3061
  • [49] Decision forest for classification of gene expression data
    Huang, Jianping
    Fang, Hong
    Fan, Xiaohui
    COMPUTERS IN BIOLOGY AND MEDICINE, 2010, 40 (08) : 698 - 704
  • [50] Vector Quantization of Microarray Gene Expression Data
    Prasad, T. V.
    Kohli, Maitrei
    WORLD CONGRESS ON ENGINEERING, WCE 2010, VOL I, 2010, : 231 - 235