Integrative Gene Network Construction to Analyze Cancer Recurrence Using Semi-Supervised Learning

被引:30
|
作者
Park, Chihyun [1 ]
Ahn, Jaegyoon [1 ]
Kim, Hyunjin [1 ]
Park, Sanghyun [1 ]
机构
[1] Yonsei Univ, Dept Comp Sci, Seoul 120749, South Korea
来源
PLOS ONE | 2014年 / 9卷 / 01期
基金
新加坡国家研究基金会;
关键词
EXPRESSION; METASTASIS; PREDICTION; CYTOSCAPE;
D O I
10.1371/journal.pone.0086309
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The prognosis of cancer recurrence is an important research area in bioinformatics and is challenging due to the small sample sizes compared to the vast number of genes. There have been several attempts to predict cancer recurrence. Most studies employed a supervised approach, which uses only a few labeled samples. Semi-supervised learning can be a great alternative to solve this problem. There have been few attempts based on manifold assumptions to reveal the detailed roles of identified cancer genes in recurrence. Results: In order to predict cancer recurrence, we proposed a novel semi-supervised learning algorithm based on a graph regularization approach. We transformed the gene expression data into a graph structure for semi-supervised learning and integrated protein interaction data with the gene expression data to select functionally-related gene pairs. Then, we predicted the recurrence of cancer by applying a regularization approach to the constructed graph containing both labeled and unlabeled nodes. Conclusions: The average improvement rate of accuracy for three different cancer datasets was 24.9% compared to existing supervised and semi-supervised methods. We performed functional enrichment on the gene networks used for learning. We identified that those gene networks are significantly associated with cancer-recurrence-related biological functions. Our algorithm was developed with standard C++ and is available in Linux and MS Windows formats in the STL library. The executable program is freely available at: http://embio.yonsei.ac.kr/Park/ssl.php.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Semi-supervised network inference using simulated gene expression dynamics
    Phan Nguyen
    Braun, Rosemary
    BIOINFORMATICS, 2018, 34 (07) : 1148 - 1156
  • [2] Network based Enterprise Profiling with Semi-Supervised Learning
    Park, Sunghong
    Park, Kanghee
    Shin, Hyunjung
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [3] Synthesizability of materials stoichiometry using semi-supervised learning
    Jang, Jidon
    Noh, Juhwan
    Zhou, Lan
    Gu, Geun Ho
    Gregoire, John M.
    Jung, Yousung
    MATTER, 2024, 7 (06) : 2294 - 2312
  • [4] Semi-Supervised Learning Using Hierarchical Mixture Models: Gene Essentiality Case Study
    Daniels, Michael W.
    Dvorkin, Daniel
    Powers, Rani K.
    Kechris, Katerina
    MATHEMATICAL AND COMPUTATIONAL APPLICATIONS, 2021, 26 (02)
  • [5] Comparing supervised and semi-supervised Machine Learning Models on Diagnosing Breast Cancer
    Al-Azzam, Nosayba
    Shatnawi, Ibrahem
    ANNALS OF MEDICINE AND SURGERY, 2021, 62 : 53 - 64
  • [6] COSNet: A Cost Sensitive Neural Network for Semi-supervised Learning in Graphs
    Bertoni, Alberto
    Frasca, Marco
    Valentini, Giorgio
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT I, 2011, 6911 : 219 - 234
  • [7] Mineral Prospectivity Mapping Using Semi-supervised Machine Learning
    Li, Quanke
    Chen, Guoxiong
    Wang, Detao
    MATHEMATICAL GEOSCIENCES, 2025, 57 (02) : 275 - 305
  • [8] Energy consumption modelling using deep learning embedded semi-supervised learning
    Chen, Chong
    Liu, Ying
    Kumar, Maneesh
    Qin, Jian
    Ren, Yunxia
    COMPUTERS & INDUSTRIAL ENGINEERING, 2019, 135 : 757 - 765
  • [9] Computerized breast cancer analysis system using three stage semi-supervised learning method
    Sun, Wenqing
    Tseng, Tzu-Liang
    Zhang, Jianying
    Qian, Wei
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2016, 135 : 77 - 88
  • [10] Boundary heat diffusion classifier for a semi-supervised learning in a multilayer network embedding
    Timilsina, Mohan
    Novacek, Vit
    d'Aquin, Mathieu
    Yang, Haixuan
    NEURAL NETWORKS, 2022, 156 : 205 - 217