Class-Incremental Learning With Deep Generative Feature Replay for DNA Methylation-Based Cancer Classification

被引:4
作者
Batbaatar, Erdenebileg [1 ]
Park, Kwang Ho [1 ]
Amarbayasgalan, Tsatsral [1 ]
Davagdorj, Khishigsuren [1 ]
Munkhdalai, Lkhagvadorj [1 ]
Pham, Van-Huy [2 ]
Ryu, Keun Ho [2 ]
机构
[1] Chungbuk Natl Univ, Sch Elect & Comp Engn, Database Bioinformat Lab, Cheongju 28644, South Korea
[2] Ton Duc Thang Univ, Fac Informat Technol, Ho Chi Minh City 700000, Vietnam
基金
新加坡国家研究基金会;
关键词
Cancer; DNA; Task analysis; Data models; Computational modeling; Feature extraction; Biological system modeling; Computational biology; deep learning; class-incremental learning; continual learning; deep generative model; variational autoencoder; DNA methylation; cancer classification; EPIGENETICS;
D O I
10.1109/ACCESS.2020.3039624
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Developing lifelong learning algorithms are mandatory for computational systems biology. Recently, many studies have shown how to extract biologically relevant information from high-dimensional data to understand the complexity of cancer by taking the benefit of deep learning (DL). Unfortunately, new cancer growing up into the hundred types that make systems difficult to classify them efficiently. In contrast, the current state-of-the-art continual learning (CL) methods are not designed for the dynamic characteristics of high-dimensional data. And data security and privacy are some of the main issues in the biomedical field. This article addresses three practical challenges for class-incremental learning (Class-IL) such as data privacy, high-dimensionality, and incremental learning problems. To solve this, we propose a novel continual learning approach, called Deep Generative Feature Replay (DGFR), for cancer classification tasks. DGFR consists of an incremental feature selection (IFS) and a scholar network (SN). IFS is used for selecting the most significant CpG sites from high-dimensional data. We investigate different dimensions to find an optimal number of selected CpG sites. SN employs a deep generative model for generating pseudo data without accessing past samples and a neural network classifier for predicting cancer types. We use a variational autoencoder (VAE), which has been successfully applied to this research field in previous works. All networks are sequentially trained on multiple tasks in the Class-IL setting. We evaluated the proposed method on the publicly available DNA methylation data. The experimental results show that the proposed DGFR achieves a significantly superior quality of cancer classification tasks with various state-of-the-art methods in terms of accuracy.
引用
收藏
页码:210800 / 210815
页数:16
相关论文
共 85 条
[1]   Deep Learning Applications for Predicting Pharmacological Properties of Drugs and Drug Repurposing Using Transcriptomic Data [J].
Aliper, Alexander ;
Plis, Sergey ;
Artemov, Artem ;
Ulloa, Alvaro ;
Mamoshina, Polina ;
Zhavoronkov, Alex .
MOLECULAR PHARMACEUTICS, 2016, 13 (07) :2524-2530
[2]  
Aljundi R., 2019, Advances in Neural Information Processing Systems (NeurIPS), Vvol 32
[3]   Expert Gate: Lifelong Learning with a Network of Experts [J].
Aljundi, Rahaf ;
Chakravarty, Punarjay ;
Tuytelaars, Tinne .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :7120-7129
[4]   DeepCpG: accurate prediction of single-cell DNA methylation states using deep learning [J].
Angermueller, Christof ;
Lee, Heather J. ;
Reik, Wolf ;
Stegle, Oliver .
GENOME BIOLOGY, 2017, 18
[5]  
[Anonymous], ARXIV171010628
[6]  
[Anonymous], 2019, ARXIV190908383
[7]   DNA methylation and gene silencing in cancer [J].
Baylin S.B. .
Nature Clinical Practice Oncology, 2005, 2 (Suppl 1) :S4-S11
[8]   IL2M: Class Incremental Learning With Dual Memory [J].
Belouadah, Eden ;
Popescu, Adrian .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :583-592
[9]  
Belouadah Eden, 2018, COMPUTER VISION EC 2, P151
[10]   Perceptions of epigenetics [J].
Bird, Adrian .
NATURE, 2007, 447 (7143) :396-398