Integrative Gene Network Construction to Analyze Cancer Recurrence Using Semi-Supervised Learning

被引:30
|
作者
Park, Chihyun [1 ]
Ahn, Jaegyoon [1 ]
Kim, Hyunjin [1 ]
Park, Sanghyun [1 ]
机构
[1] Yonsei Univ, Dept Comp Sci, Seoul 120749, South Korea
来源
PLOS ONE | 2014年 / 9卷 / 01期
基金
新加坡国家研究基金会;
关键词
EXPRESSION; METASTASIS; PREDICTION; CYTOSCAPE;
D O I
10.1371/journal.pone.0086309
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The prognosis of cancer recurrence is an important research area in bioinformatics and is challenging due to the small sample sizes compared to the vast number of genes. There have been several attempts to predict cancer recurrence. Most studies employed a supervised approach, which uses only a few labeled samples. Semi-supervised learning can be a great alternative to solve this problem. There have been few attempts based on manifold assumptions to reveal the detailed roles of identified cancer genes in recurrence. Results: In order to predict cancer recurrence, we proposed a novel semi-supervised learning algorithm based on a graph regularization approach. We transformed the gene expression data into a graph structure for semi-supervised learning and integrated protein interaction data with the gene expression data to select functionally-related gene pairs. Then, we predicted the recurrence of cancer by applying a regularization approach to the constructed graph containing both labeled and unlabeled nodes. Conclusions: The average improvement rate of accuracy for three different cancer datasets was 24.9% compared to existing supervised and semi-supervised methods. We performed functional enrichment on the gene networks used for learning. We identified that those gene networks are significantly associated with cancer-recurrence-related biological functions. Our algorithm was developed with standard C++ and is available in Linux and MS Windows formats in the STL library. The executable program is freely available at: http://embio.yonsei.ac.kr/Park/ssl.php.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] A novel candidate disease gene prioritization method using deep graph convolutional networks and semi-supervised learning
    Azadifar, Saeid
    Ahmadi, Ali
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [22] Bayesian semi-supervised learning with support vector machine
    Chakraborty, Sounak
    STATISTICAL METHODOLOGY, 2011, 8 (01) : 68 - 82
  • [23] Semi-supervised regression using diffusion on graphs
    Timilsina, Mohan
    Figueroa, Alejandro
    d'Aquin, Mathieu
    Yang, Haixuan
    APPLIED SOFT COMPUTING, 2021, 104
  • [24] A novel logistic regression model combining semi-supervised learning and active learning for disease classification
    Chai, Hua
    Liang, Yong
    Wang, Sai
    Shen, Hai-wei
    SCIENTIFIC REPORTS, 2018, 8
  • [25] Semi-supervised consensus clustering for gene expression data analysis
    Wang, Yunli
    Pan, Youlian
    BIODATA MINING, 2014, 7
  • [26] Semi-supervised learning approaches to class assignment in ambiguous microstructures
    Kunselman, Courtney
    Attari, Vahid
    McClenny, Levi
    Braga-Neto, Ulisses
    Arroyave, Raymundo
    ACTA MATERIALIA, 2020, 188 : 49 - 62
  • [27] Semi-Supervised Local-Learning-based Feature Selection
    Wang, Jim Jing-Yan
    Yao, Jin
    Sun, Yijun
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 1942 - 1948
  • [28] RANDOM FOREST IN SEMI-SUPERVISED LEARNING (CO-FOREST)
    Settouti, Nesma
    Daho, Mostafa El Habib
    Lazouni, Mohammed El Amine
    Chikh, Mohammed Amine
    2013 8TH INTERNATIONAL WORKSHOP ON SYSTEMS, SIGNAL PROCESSING AND THEIR APPLICATIONS (WOSSPA), 2013, : 326 - 329
  • [29] Adaptive ensemble learning strategy for semi-supervised soft sensing
    Shi, Xudong
    Xiong, Weili
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2020, 357 (06): : 3753 - 3770
  • [30] Semi-Supervised Maximum Discriminative Local Margin for Gene Selection
    Li, Zejun
    Liao, Bo
    Cai, Lijun
    Chen, Min
    Liu, Wenhua
    SCIENTIFIC REPORTS, 2018, 8