Experimental correlation analysis of bicluster coherence measures and gene ontology information

被引:1
|
作者
Padilha, Victor Alexandre [1 ]
de Leon Ferreira de Carvalho, Andre Carlos Ponce [1 ]
机构
[1] Univ Sao Paulo, Inst Math & Comp Sci, Av Trabalhador Sao Carlense 400, BR-13566590 Sao Carlos, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
Biclustering; Coherence measures; Gene ontology; Gene expression data; MICROARRAY DATA; EXPRESSION DATA; SACCHAROMYCES-CEREVISIAE; VALIDATION; ALGORITHMS; PATTERNS; QUALITY; GENOME;
D O I
10.1016/j.asoc.2019.105688
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Biclustering algorithms have become popular tools for gene expression data analysis. They can identify local patterns defined by subsets of genes and subsets of samples, which cannot be detected by traditional clustering algorithms. In spite of being useful, biclustering is an NP-hard problem. Therefore, the majority of biclustering algorithms look for biclusters optimizing a pre-established coherence measure. Many heuristics and validation measures have been proposed for biclustering over the last 20 years. However, there is a lack of an extensive comparison of bicluster coherence measures on practical scenarios. To deal with this lack, this paper experimentally analyzes 17 bicluster coherence measures and external measures calculated from information obtained in the gene ontologies. In this analysis, results were produced by 10 algorithms from the literature in 19 gene expression datasets. According to the experimental results, a few pairs of strongly correlated coherence measures could be identified, which suggests redundancy. Moreover, the pairs of strongly correlated measures might change when dealing with normalized or non-normalized data and biclusters enriched by different ontologies. Finally, there was no clear relation between coherence measures and assessment using information from gene ontology.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Microarray data classification by multi-information based gene scoring integrated with Gene Ontology
    Tseng, Vincent S.
    Yu, Hsieh-Hui
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2011, 5 (04) : 402 - 416
  • [42] Automatic extraction of gene ontology annotation and its correlation with clusters in protein networks
    Nikolai Daraselia
    Anton Yuryev
    Sergei Egorov
    Ilya Mazo
    Iaroslav Ispolatov
    BMC Bioinformatics, 8
  • [43] A New Family of Similarity Measures for Scoring Confidence of Protein Interactions Using Gene Ontology
    Paul, Madhusudan
    Anand, Ashish
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (01) : 19 - 30
  • [44] Gene ontology analysis of arthrogryposis (multiple congenital contractures)
    Kiefer, Jeff
    Hall, Judith G.
    AMERICAN JOURNAL OF MEDICAL GENETICS PART C-SEMINARS IN MEDICAL GENETICS, 2019, 181 (03) : 310 - 326
  • [45] The language of gene ontology: a Zipf’s law analysis
    Leila Ranandeh Kalankesh
    Robert Stevens
    Andy Brass
    BMC Bioinformatics, 13
  • [46] Visualization and analysis of microarray and gene ontology data with treemaps
    Eric H Baehrecke
    Niem Dang
    Ketan Babaria
    Ben Shneiderman
    BMC Bioinformatics, 5
  • [47] An Efficient Functional Analysis Method for Micro-array Data Using Gene Ontology
    Hong, Dong-wan
    Lee, Jong-keun
    Park, Sung-soo
    Hong, Sang-kyoon
    Yoon, Jee-hee
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2007, 3 (01): : 38 - 42
  • [48] Predicting protein subcellular localization based on information content of gene ontology terms
    Zhang, Shu-Bo
    Tang, Qiang-Rong
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2016, 65 : 1 - 7
  • [49] Grouping miRNAs of similar functions via weighted information content of gene ontology
    Chaowang Lan
    Qingfeng Chen
    Jinyan Li
    BMC Bioinformatics, 17
  • [50] Grouping miRNAs of similar functions via weighted information content of gene ontology
    Lan, Chaowang
    Chen, Qingfeng
    Li, Jinyan
    BMC BIOINFORMATICS, 2016, 17