Unveiling Fuzzy Associations Between Breast Cancer Prognostic Factors and Gene Expression Data

被引:2
作者
Javier Lopez, F. [1 ]
Cuadros, Marta [1 ]
Blanco, Armando [1 ]
Concha, Angel [2 ]
机构
[1] Univ Granada, Dept Comp Sci & AI, C Periodista Daniel Saucedo Aranda S-N, E-18071 Granada, Spain
[2] Hosp Univ Virgen De Las Nieves, Dept Pathol & Tissue & Tumor Bank, Infect Pathol Unit, Granada 18014, Spain
来源
PROCEEDINGS OF THE 20TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATION | 2009年
关键词
Breast cancer; prognostic factors; microarray; fuzzy; association rules; RULES; TUMOR; METASTASIS; BINDING; GREB1;
D O I
10.1109/DEXA.2009.36
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Breast cancer is the second most common cancer worldwide and the fifth most common cause of cancer death. There are many prognostic factors associated with breast cancer which are usually considered when determining how cancer will affect a patient. In addition, distinct molecular subtypes of breast tumors have been described by gene expression profiling. In this work we integrate information from the main prognostic factors in breast cancer with whole-genome microarray data to study the potential associations between these two types of data. The heterogeneity and noisy nature of the data along with its high dimensionality make necessary the use of data mining techniques to analyze the dataset. Fuzzy sets are particularly suitable to model imprecise and noisy data, while association rules are very appropriate to deal with heterogeneous and high dimensionality data. Thus, a fuzzy association rule mining algorithm was used to carry out this study. Many interesting associations have been obtained. Further studies and empirical evaluation of these associations are needed to obtain scientific evidence of such relations. Finally, a freely accessible web application has been developed which implements the fuzzy association rule mining algorithm used in this study (http://genome.ugr.es/biofar).
引用
收藏
页码:338 / +
页数:2
相关论文
共 37 条
  • [1] Abe O, 2005, LANCET, V366, P2087, DOI 10.1016/s0140-6736(05)66544-0
  • [2] Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
  • [3] [Anonymous], 2000, Genome Biol.
  • [4] PathFinder: mining signal transduction pathway segments from protein-protein interaction networks
    Bebek, Gurkan
    Yang, Jiong
    [J]. BMC BIOINFORMATICS, 2007, 8 (1)
  • [5] Integrated analysis of gene expression by association rules discovery
    Carmona-Saez, P
    Chagoyen, M
    Rodriguez, A
    Trelles, O
    Carazo, JM
    Pascual-Montano, A
    [J]. BMC BIOINFORMATICS, 2006, 7 (1)
  • [6] Chan Man Kuok, 1998, SIGMOD Record, V27, P41, DOI 10.1145/273244.273257
  • [7] Fuzzy association rules:: General model and applications
    Delgado, M
    Marín, N
    Sánchez, D
    Vila, MA
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2003, 11 (02) : 214 - 225
  • [8] Delgado M, 2003, P BISC INT WORKSH SO
  • [9] DUBITZKY W, 2004, PRACTICAL APPROACH M, P91
  • [10] Cluster analysis and display of genome-wide expression patterns
    Eisen, MB
    Spellman, PT
    Brown, PO
    Botstein, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) : 14863 - 14868