Integrative Gene Network Construction to Analyze Cancer Recurrence Using Semi-Supervised Learning
被引:30
|
作者:
论文数: 引用数:
h-index:
机构:
Park, Chihyun
[1
]
Ahn, Jaegyoon
论文数: 0引用数: 0
h-index: 0
机构:
Yonsei Univ, Dept Comp Sci, Seoul 120749, South KoreaYonsei Univ, Dept Comp Sci, Seoul 120749, South Korea
Ahn, Jaegyoon
[1
]
Kim, Hyunjin
论文数: 0引用数: 0
h-index: 0
机构:
Yonsei Univ, Dept Comp Sci, Seoul 120749, South KoreaYonsei Univ, Dept Comp Sci, Seoul 120749, South Korea
Kim, Hyunjin
[1
]
论文数: 引用数:
h-index:
机构:
Park, Sanghyun
[1
]
机构:
[1] Yonsei Univ, Dept Comp Sci, Seoul 120749, South Korea
来源:
PLOS ONE
|
2014年
/
9卷
/
01期
基金:
新加坡国家研究基金会;
关键词:
EXPRESSION;
METASTASIS;
PREDICTION;
CYTOSCAPE;
D O I:
10.1371/journal.pone.0086309
中图分类号:
O [数理科学和化学];
P [天文学、地球科学];
Q [生物科学];
N [自然科学总论];
学科分类号:
07 ;
0710 ;
09 ;
摘要:
Background: The prognosis of cancer recurrence is an important research area in bioinformatics and is challenging due to the small sample sizes compared to the vast number of genes. There have been several attempts to predict cancer recurrence. Most studies employed a supervised approach, which uses only a few labeled samples. Semi-supervised learning can be a great alternative to solve this problem. There have been few attempts based on manifold assumptions to reveal the detailed roles of identified cancer genes in recurrence. Results: In order to predict cancer recurrence, we proposed a novel semi-supervised learning algorithm based on a graph regularization approach. We transformed the gene expression data into a graph structure for semi-supervised learning and integrated protein interaction data with the gene expression data to select functionally-related gene pairs. Then, we predicted the recurrence of cancer by applying a regularization approach to the constructed graph containing both labeled and unlabeled nodes. Conclusions: The average improvement rate of accuracy for three different cancer datasets was 24.9% compared to existing supervised and semi-supervised methods. We performed functional enrichment on the gene networks used for learning. We identified that those gene networks are significantly associated with cancer-recurrence-related biological functions. Our algorithm was developed with standard C++ and is available in Linux and MS Windows formats in the STL library. The executable program is freely available at: http://embio.yonsei.ac.kr/Park/ssl.php.
机构:
Univ Missouri, Inst Informat, Columbia, MO USAUniv Missouri, Inst Informat, Columbia, MO USA
Zhao, Nan
Han, Jing Ginger
论文数: 0引用数: 0
h-index: 0
机构:
Univ Missouri, Inst Informat, Columbia, MO USAUniv Missouri, Inst Informat, Columbia, MO USA
Han, Jing Ginger
Shyu, Chi-Ren
论文数: 0引用数: 0
h-index: 0
机构:
Univ Missouri, Inst Informat, Columbia, MO USA
Univ Missouri, Dept Comp Sci, Columbia, MO USAUniv Missouri, Inst Informat, Columbia, MO USA
Shyu, Chi-Ren
Korkin, Dmitry
论文数: 0引用数: 0
h-index: 0
机构:
Univ Missouri, Inst Informat, Columbia, MO USA
Univ Missouri, Dept Comp Sci, Columbia, MO USA
Univ Missouri, Bond Life Sci Ctr, Columbia, MO USAUniv Missouri, Inst Informat, Columbia, MO USA
机构:
NE Normal Univ, Sch Comp Sci & Informat Technol, Changchun 130117, Peoples R ChinaNE Normal Univ, Sch Comp Sci & Informat Technol, Changchun 130117, Peoples R China
Zhao, Xiaowei
Ning, Qiao
论文数: 0引用数: 0
h-index: 0
机构:
NE Normal Univ, Sch Comp Sci & Informat Technol, Changchun 130117, Peoples R ChinaNE Normal Univ, Sch Comp Sci & Informat Technol, Changchun 130117, Peoples R China
Ning, Qiao
Chai, Haiting
论文数: 0引用数: 0
h-index: 0
机构:
NE Normal Univ, Sch Comp Sci & Informat Technol, Changchun 130117, Peoples R ChinaNE Normal Univ, Sch Comp Sci & Informat Technol, Changchun 130117, Peoples R China
Chai, Haiting
Ma, Zhiqiang
论文数: 0引用数: 0
h-index: 0
机构:
NE Normal Univ, Key Lab Intelligent Informat Proc Jilin Univ, Changchun 130117, Peoples R ChinaNE Normal Univ, Sch Comp Sci & Informat Technol, Changchun 130117, Peoples R China