SEMI-SUPERVISED LEARNING OF ALTERNATIVELY SPLICED EXONS USING EXPECTATION MAXIMIZATION TYPE APPROACHES

被引:2
|
作者
Stanescu, Ana [1 ]
Caragea, Doina [1 ]
机构
[1] Kansas State Univ, Manhattan, KS 66506 USA
关键词
Semi-supervised learning; Expectation maximization; Alternative splicing;
D O I
10.5220/0003791802400245
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Successful advances in DNA sequencing technologies have made it possible to obtain tremendous amounts of data fast and inexpensively. As a consequence, the afferent genome annotation has become the bottleneck in our understanding of genes and their functions. Traditionally, data from biological domains have been analyzed using supervised learning techniques. However, given the large amounts of unlabeled genomics data available, together with small amounts of labeled data, the use of semi-supervised learning algorithms is desirable. Our purpose is to study the applicability of semi-supervised learning frameworks to DNA prediction problems, with focus on alternative splicing, a natural biological process that contributes to protein diversity. More specifically, we address the problem of predicting alternatively spliced exons. To utilize the unlabeled data, we train classifiers via the Expectation Maximization method and variants of this method. The experiments conducted show an increase in the quality of the prediction models when unlabeled data is used in the training phase, as compared to supervised prediction models which do not make use of the unlabeled data.
引用
收藏
页码:240 / 245
页数:6
相关论文
共 50 条
  • [1] Predicting alternatively spliced exons using semi-supervised learning
    Stanescu, Ana
    Tangirala, Karthik
    Caragea, Doina
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2016, 14 (01) : 1 - 21
  • [2] Semi-Supervised Learning of Alternatively Spliced Exons Using Co-Training
    Tangirala, Karthik
    Caragea, Doina
    2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM 2011), 2011, : 243 - 246
  • [3] Rumor detection in Arabic tweets using semi-supervised and unsupervised expectation-maximization
    Alzanin, Samah M.
    Azmi, Aqil M.
    KNOWLEDGE-BASED SYSTEMS, 2019, 185
  • [4] Optimization approaches for semi-supervised learning
    Yajima, Y
    Hoshiba, T
    ICMLA 2005: FOURTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2005, : 247 - 252
  • [5] Optimization approaches to semi-supervised learning
    Demiriz, A
    Bennett, KP
    COMPLEMENTARITY: APPLICATIONS, ALGORITHMS AND EXTENSIONS, 2001, 50 : 121 - 141
  • [6] Bayesian Pseudo Labels: Expectation Maximization for Robust and Efficient Semi-supervised Segmentation
    Xu, Mou-Cheng
    Zhou, Yukun
    Jin, Chen
    de Groot, Marius
    Alexander, Daniel C.
    Oxtoby, Neil P.
    Hu, Yipeng
    Jacob, Joseph
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT V, 2022, 13435 : 580 - 590
  • [7] Greedy approaches to semi-supervised subspace learning
    Kim, Minyoung
    PATTERN RECOGNITION, 2015, 48 (04) : 1563 - 1570
  • [8] Spectral Transformation Approaches To Semi-supervised Learning
    Hu, Chonghai
    Wang, Chengqun
    Liu, Kangsheng
    FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2008, : 207 - +
  • [9] Approaches to semi-supervised learning of fuzzy classifiers
    Klose, A
    KI 2003: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2003, 2821 : 436 - 449
  • [10] Alternatively spliced internal exons prediction using SVM
    Qiao, Yuanhua
    Jia, Erze
    Zeng, Yanjun
    PROGRESS ON POST-GENOME TECHNOLOGIES, 2007, : 115 - 118