Integrative Gene Network Construction to Analyze Cancer Recurrence Using Semi-Supervised Learning

被引:30
|
作者
Park, Chihyun [1 ]
Ahn, Jaegyoon [1 ]
Kim, Hyunjin [1 ]
Park, Sanghyun [1 ]
机构
[1] Yonsei Univ, Dept Comp Sci, Seoul 120749, South Korea
来源
PLOS ONE | 2014年 / 9卷 / 01期
基金
新加坡国家研究基金会;
关键词
EXPRESSION; METASTASIS; PREDICTION; CYTOSCAPE;
D O I
10.1371/journal.pone.0086309
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The prognosis of cancer recurrence is an important research area in bioinformatics and is challenging due to the small sample sizes compared to the vast number of genes. There have been several attempts to predict cancer recurrence. Most studies employed a supervised approach, which uses only a few labeled samples. Semi-supervised learning can be a great alternative to solve this problem. There have been few attempts based on manifold assumptions to reveal the detailed roles of identified cancer genes in recurrence. Results: In order to predict cancer recurrence, we proposed a novel semi-supervised learning algorithm based on a graph regularization approach. We transformed the gene expression data into a graph structure for semi-supervised learning and integrated protein interaction data with the gene expression data to select functionally-related gene pairs. Then, we predicted the recurrence of cancer by applying a regularization approach to the constructed graph containing both labeled and unlabeled nodes. Conclusions: The average improvement rate of accuracy for three different cancer datasets was 24.9% compared to existing supervised and semi-supervised methods. We performed functional enrichment on the gene networks used for learning. We identified that those gene networks are significantly associated with cancer-recurrence-related biological functions. Our algorithm was developed with standard C++ and is available in Linux and MS Windows formats in the STL library. The executable program is freely available at: http://embio.yonsei.ac.kr/Park/ssl.php.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] SICAGO: Semi-supervised cluster analysis using semantic distance between gene pairs in Gene Ontology
    Kang, Bo-Yeong
    Ko, Song
    Kim, Dae-Won
    BIOINFORMATICS, 2010, 26 (10) : 1384 - 1385
  • [32] Semi-supervised fuzzy-rough extreme learning machine for classification of cancer from microRNA
    Kumar, Ansuman
    Marak, Dikme Chisil B.
    Halder, Anindya
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (10) : 4537 - 4548
  • [33] Semi-Supervised Soft Computing for Ammonia Nitrogen Using a Self-Constructing Fuzzy Neural Network with an Active Learning Mechanism
    Zhou, Hongbiao
    Huang, Yang
    Yang, Dan
    Chen, Lianghai
    Wang, Le
    WATER, 2024, 16 (20)
  • [34] Developing Sustainable Classification of Diseases via Deep Learning and Semi-Supervised Learning
    Yin, Chunwu
    Chen, Zhanbo
    HEALTHCARE, 2020, 8 (03)
  • [35] Determining Effects of Non-synonymous SNPs on Protein-Protein Interactions using Supervised and Semi-supervised Learning
    Zhao, Nan
    Han, Jing Ginger
    Shyu, Chi-Ren
    Korkin, Dmitry
    PLOS COMPUTATIONAL BIOLOGY, 2014, 10 (05)
  • [36] Accurate in silico identification of protein succinylation sites using an iterative semi-supervised learning technique
    Zhao, Xiaowei
    Ning, Qiao
    Chai, Haiting
    Ma, Zhiqiang
    JOURNAL OF THEORETICAL BIOLOGY, 2015, 374 : 60 - 65
  • [37] Less Is More: Unlocking Semi-Supervised Deep Learning for Vulnerability Detection
    Yu, Xiao
    Lin, Guancheng
    Hu, Xing
    Keung, Jacky wai
    Xia, Xin
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2025, 34 (03)
  • [38] Semi-supervised learning of Hidden Markov Models for biological sequence analysis
    Tamposis, Ioannis A.
    Tsirigos, Konstantinos D.
    Theodoropoulou, Margarita C.
    Kontou, Panagiota, I
    Bagos, Pantelis G.
    BIOINFORMATICS, 2019, 35 (13) : 2208 - 2215
  • [39] Iterative processes: a review of semi-supervised machine learning in rehabilitation science
    Kringle, Emily A.
    Knutson, Evan C.
    Engstrom, Collin
    Terhorst, Lauren
    DISABILITY AND REHABILITATION-ASSISTIVE TECHNOLOGY, 2020, 15 (05) : 515 - 520
  • [40] Semi-supervised fuzzy K-NN for cancer classification from microarray gene expression data
    Halder, Anindya
    Misra, Subhashis
    2014 FIRST INTERNATIONAL CONFERENCE ON AUTOMATION, CONTROL, ENERGY & SYSTEMS (ACES-14), 2014, : 266 - 270