A New Method to Combine Heterogeneous Data Sources for Candidate Gene Prioritization

被引:0
作者
Li, Yongjin [1 ]
Patra, Jagdish C. [1 ]
Sun, Jiabao [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore, Singapore
来源
2009 9TH IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING | 2009年
关键词
GENOMIC DATA FUSION; DISEASE-GENES; NETWORK;
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
How to effectively integrate heterogeneous data sources is becoming extremely challenging, because many useful but noisy data sources are available for the problem at hand. In this paper, for disease gene prioritization problem, we investigated multiple kernels learning (MKL) and N dimensional order statistics (NDOS) method, but found that neither could effectively extract useful information from noisy data. Especially, in MKL algorithm, ineffective data source may be given more weight, which downgrades the effectiveness of the combined kernel. We proposed an improved procedure based on NDOS. We first use cross validation to evaluate each individual data source, and only effective data sources are used in the prioritizations of candidate genes.
引用
收藏
页码:123 / 129
页数:7
相关论文
共 50 条
  • [21] Global Propagation Method for Predicting Protein Function by Integrating Multiple Data Sources
    Meng, Jun
    Zhang, Xin
    Luan, Yushi
    CURRENT BIOINFORMATICS, 2016, 11 (02) : 186 - 194
  • [22] Meta-Analysis of Heterogeneous Data Sources for Genome-Scale Identification of Risk Genes in Complex Phenotypes
    Pers, Tune H.
    Hansen, Niclas Tue
    Lage, Kasper
    Koefoed, Pernille
    Dworzynski, Piotr
    Miller, Martin Lee
    Flint, Tracey J.
    Mellerup, Erling
    Dam, Henrik
    Andreassen, Ole A.
    Djurovic, Srdjan
    Melle, Ingrid
    Borglum, Anders D.
    Werge, Thomas
    Purcell, Shaun
    Ferreira, Manuel A.
    Kouskoumvekaki, Irene
    Workman, Christopher T.
    Hansen, Torben
    Mors, Ole
    Brunak, Soren
    GENETIC EPIDEMIOLOGY, 2011, 35 (05) : 318 - 332
  • [23] A structured multi-head attention prediction method based on heterogeneous financial data
    Zhao, Cheng
    Li, Fangyong
    Peng, Zhe
    Zhou, Xiao
    Yan, Zhuge
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [24] A multi-source heterogeneous data fusion method for intelligent systems in the Internet of Things
    Sun, Rongrong
    Ren, Yuemei
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2024, 23
  • [25] Integrative enrichment analysis: a new computational method to detect dysregulated pathways in heterogeneous samples
    Yu, Xiangtian
    Zeng, Tao
    Li, Guojun
    BMC GENOMICS, 2015, 16
  • [26] A Bayesian Method for Dynamic Origin-Destination Demand Estimation Synthesizing Multiple Sources of Data
    Yu, Hang
    Zhu, Senlai
    Yang, Jie
    Guo, Yuntao
    Tang, Tianpei
    SENSORS, 2021, 21 (15)
  • [27] Carbon Emission Evaluation Method and Comparison Study of Transformer Substations Using Different Data Sources
    Liu, Xigang
    Zhang, Jian
    Hu, Yiqi
    Liu, Jiao
    Ding, Shijun
    Zhao, Gaowen
    Zhang, Yang
    Li, Jiawei
    Nie, Zhibao
    BUILDINGS, 2023, 13 (04)
  • [28] A new method to measure the divergence in evidential sensor data fusion
    Song, Yutong
    Deng, Yong
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2019, 15 (04)
  • [29] An incremental mixed data clustering method using a new distance measure
    Noorbehbahani, Fakhroddin
    Mousavi, Sayyed Rasoul
    Mirzaei, Abdolreza
    SOFT COMPUTING, 2015, 19 (03) : 731 - 743
  • [30] A Heterogeneous Network Based Method for Identifying GBM-Related Genes by Integrating Multi-Dimensional Data
    Peng, Chen
    Li, Ao
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2017, 14 (03) : 713 - 720