DeepEBV: a deep learning model to predict Epstein-Barr virus (EBV) integration sites

被引:5
作者
Liang, Jiuxing [1 ,2 ]
Cui, Zifeng [3 ]
Wu, Canbiao [1 ]
Yu, Yao [4 ,5 ]
Tian, Rui [6 ]
Xie, Hongxian [7 ]
Jin, Zhuang [3 ]
Fan, Weiwen [3 ]
Xie, Weiling [3 ]
Huang, Zhaoyue [3 ]
Xu, Wei [3 ]
Zhu, Jingjing [3 ]
You, Zeshan [3 ]
Guo, Xiaofang [8 ]
Qiu, Xiaofan [1 ]
Ye, Jiahao [1 ,9 ]
Lang, Bin [10 ]
Li, Mengyuan [3 ]
Tan, Songwei [11 ]
Hu, Zheng [3 ,12 ]
机构
[1] Minist Educ, Key Lab Brain Cognit & Educ Sci, Guangzhou, Peoples R China
[2] South China Normal Univ, Inst Brain Res & Rehabil, Guangzhou 510631, Peoples R China
[3] Sun Yat Sen Univ, Affiliated Hosp 1, Dept Gynaecol Oncol, Guangzhou 510080, Guangdong, Peoples R China
[4] Chinese Peoples Liberat Army Gen Hosp, Med Ctr 1, Dept Urol, Beijing 100853, Peoples R China
[5] Nankai Univ, Sch Med, Tianjin 300071, Peoples R China
[6] Sun Yat Sen Univ, Affiliated Hosp 1, Ctr Translat Med, Guangzhou 510080, Guangdong, Peoples R China
[7] Generulor Co Bio X Lab, Guangzhou 510006, Guangdong, Peoples R China
[8] Sun Yat Sen Univ, Affiliated Hosp 1, Eastern Hosp, Dept Med Oncol, Guangzhou 510700, Peoples R China
[9] South China Normal Univ, Sch Comp Sci, Guangzhou 510631, Peoples R China
[10] Macao Polytech Inst, Sch Hlth Sci & Sports, Macau, Peoples R China
[11] Huazhong Univ Sci & Technol, Tongji Med Coll, Sch Pharm, Wuhan 430030, Peoples R China
[12] Huazhong Univ Sci & Technol, Tongji Med Coll, Tongji Hosp, Dept Obstet & Gynaecol, Wuhan 430030, Hubei, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
GENE; EXPRESSION;
D O I
10.1093/bioinformatics/btab388
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Epstein-Barr virus (EBV) is one of the most prevalent DNA oncogenic viruses. The integration of EBV into the host genome has been reported to play an important role in cancer development. The preference of EBV integration showed strong dependence on the local genomic environment, which enables the prediction of EBV integration sites. Results: An attention-based deep learning model, DeepEBV, was developed to predict EBV integration sites by learning local genomic features automatically. First, DeepEBV was trained and tested using the data from the dsVIS database. The results showed that DeepEBV with EBV integration sequences plus Repeat peaks and 2-fold data augmentation performed the best on the training dataset. Furthermore, the performance of the model was validated in an independent dataset. In addition, the motifs of DNA-binding proteins could influence the selection preference of viral insertional mutagenesis. Furthermore, the results showed that DeepEBV can predict EBV integration hotspot genes accurately. In summary, DeepEBV is a robust, accurate and explainable deep learning model, providing novel insights into EBV integration preferences and mechanisms.
引用
收藏
页码:3405 / 3411
页数:7
相关论文
共 35 条
  • [1] Aghdam H.H., 2017, GUIDE CONVOLUTIONAL, DOI [10.5555/3122908#, DOI 10.5555/3122908#]
  • [2] An Atlas of the Epstein-Barr Virus Transcriptome and Epigenome Reveals Host-Virus Regulatory Interactions
    Arvey, Aaron
    Tempera, Italo
    Tsai, Kevin
    Chen, Horng-Shen
    Tikhmyanova, Nadezhda
    Klichinsky, Michael
    Leslie, Christina
    Lieberman, Paul M.
    [J]. CELL HOST & MICROBE, 2012, 12 (02) : 233 - 245
  • [3] Brouillette M., 2020, DEEP LEARNING IS BLA
  • [4] High-Throughput RNA Sequencing-Based Virome Analysis of 50 Lymphoma Cell Lines from the Cancer Cell Line Encyclopedia Project
    Cao, Subing
    Strong, Michael J.
    Wang, Xia
    Moss, Walter N.
    Concha, Monica
    Lin, Zhen
    O'Grady, Tina
    Baddoo, Melody
    Fewell, Claire
    Renne, Rolf
    Flemington, Erik K.
    [J]. JOURNAL OF VIROLOGY, 2015, 89 (01) : 713 - 729
  • [5] Integrated Pan-Cancer Map of EBV-Associated Neoplasms Reveals Functional Host-Virus Interactions
    Chakravorty, Srishti
    Yan, Bingyu
    Wang, Chong
    Wang, Luopin
    Quaid, Joseph Taylor
    Lin, Chin Fang
    Briggs, Scott D.
    Majumder, Joydeb
    Canaria, D. Alejandro
    Chauss, Daniel
    Chopra, Gaurav
    Olson, Matthew R.
    Zhao, Bo
    Afzali, Behdad
    Kazemian, Majid
    [J]. CANCER RESEARCH, 2019, 79 (23) : 6010 - 6023
  • [6] Linkage between STAT regulation and Epstein-Barr virus gene expression in tumors
    Chen, HL
    Lee, JM
    Zong, YS
    Borowitz, M
    Ng, MH
    Ambinder, RF
    Hayward, SD
    [J]. JOURNAL OF VIROLOGY, 2001, 75 (06) : 2929 - 2937
  • [7] Chollet F., 2015, KERAS 20 COMPUTER SO
  • [8] Cruz JA, 2006, CANCER INFORM, V2, P59
  • [9] Deeplearning.net, 2020, CONVOLUTIONAL NEURAL
  • [10] GUIDOTTI R, 2018, ARXIV180201933CSCY, DOI DOI 10.1145/3236009