Leveraging additional knowledge to support coherent bicluster discovery in gene expression data

被引:8
作者
Visconti, Alessia [1 ]
Cordero, Francesca [1 ]
Pensa, Ruggero G. [1 ]
机构
[1] Univ Turin, Dept Comp Sci, I-10149 Turin, Italy
关键词
Biclustering; constraint-based mining; gene expression data; SACCHAROMYCES-CEREVISIAE; CLUSTER-ANALYSIS; YEAST; ONTOLOGY; CANCER; ALGORITHMS; CYTOSCAPE; PROFILES; NETWORKS; PATTERNS;
D O I
10.3233/IDA-140671
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The increasing availability of gene expression data has encouraged the development of purposely-built intelligent data analysis techniques. Grouping genes characterized by similar expression patterns is a widely accepted - and often mandatory - analysis step. Despite the fact that a number of biclustering methods have been developed to discover clusters of genes exhibiting a similar expression profile under a subgroup of experimental conditions, approaches driven by similarity measures based on expression profiles alone may lead to groups that are biologically meaningless. The integration of additional information, such as functional annotations, into biclustering algorithms can instead provide an effective support for identifying meaningful gene associations. In this paper we propose a new biclustering approach called Additional Information Driven Iterative Signature Algorithm, AID-ISA. It supports the extraction of biologically relevant biclusters by leveraging additional knowledge. We show that AID-ISA allows the discovery of coherent biclusters in baker's yeast and human gene expression data sets.
引用
收藏
页码:837 / 855
页数:19
相关论文
共 50 条
  • [31] A note on classification of gene expression data using support vector machines
    Fujarewicz, K
    Kimmel, M
    Rzeszowska-Wolny, J
    Swierniak, A
    JOURNAL OF BIOLOGICAL SYSTEMS, 2003, 11 (01) : 43 - 56
  • [32] Integrating gene expression and epidemiological data for the discovery of genetic interactions associated with cancer risk
    Bonifaci, Nuria
    Colas, Eva
    Serra-Musach, Jordi
    Karbalai, Nazanin
    Brunet, Joan
    Gomez, Antonio
    Esteller, Manel
    Fernandez-Taboada, Enrique
    Berenguer, Antoni
    Reventos, Jaume
    Mueller-Myhsok, Bertram
    Amundadottir, Laufey
    Duell, Eric J.
    Angel Pujana, Miquel
    CARCINOGENESIS, 2014, 35 (03) : 578 - 585
  • [33] Discovery and Preclinical Validation of Drug Indications Using Compendia of Public Gene Expression Data
    Sirota, Marina
    Dudley, Joel T.
    Kim, Jeewon
    Chiang, Annie P.
    Morgan, Alex A.
    Sweet-Cordero, Alejandro
    Sage, Julien
    Butte, Atul J.
    SCIENCE TRANSLATIONAL MEDICINE, 2011, 3 (96)
  • [34] CamurWeb: a classification software and a large knowledge base for gene expression data of cancer
    Emanuel Weitschek
    Silvia Di Lauro
    Eleonora Cappelli
    Paola Bertolazzi
    Giovanni Felici
    BMC Bioinformatics, 19
  • [35] Integrating Gene Expression Data and Pathway Knowledge for In Silico Hypothesis Generation with IMPRes
    Jiang, Yuexu
    Wang, Duolin
    Xu, Dong
    Joshi, Trupti
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 102 - 107
  • [36] CamurWeb: a classification software and a large knowledge base for gene expression data of cancer
    Weitschek, Emanuel
    Di Lauro, Silvia
    Cappelli, Eleonora
    Bertolazzi, Paola
    Felici, Giovanni
    BMC BIOINFORMATICS, 2018, 19 : 245 - 256
  • [37] Classification by integrating plant stress response gene expression data with biological knowledge
    Meng, Jun
    Li, Rui
    Luan, Yushi
    MATHEMATICAL BIOSCIENCES, 2015, 266 : 65 - 72
  • [38] Integrating biological knowledge based on functional annotations for biclustering of gene expression data
    Nepomuceno, Juan A.
    Troncoso, Alicia
    Nepomuceno-Chamorro, Isabel A.
    Aguilar-Ruiz, Jesus S.
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2015, 119 (03) : 163 - 180
  • [39] A contiguous column coherent evolution biclustering algorithm for time-series gene expression data
    Yun Xue
    Meizhen Zhang
    Zhengling Liao
    Meihang Li
    Jie Luo
    Xiaohui Hu
    International Journal of Machine Learning and Cybernetics, 2018, 9 : 441 - 453
  • [40] A contiguous column coherent evolution biclustering algorithm for time-series gene expression data
    Xue, Yun
    Zhang, Meizhen
    Liao, Zhengling
    Li, Meihang
    Luo, Jie
    Hu, Xiaohui
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (03) : 441 - 453