DeePhafier: a phage lifestyle classifier using a multilayer self-attention neural network combining protein information

被引:1
作者
Miao, Yan [1 ]
Sun, Zhenyuan [1 ]
Lin, Chen [2 ]
Gu, Haoran [1 ]
Ma, Chenjing [1 ]
Liang, Yingjian [3 ]
Wang, Guohua [1 ]
机构
[1] Northeast Forestry Univ, Coll Comp & Control Engn, 26 Hexing Rd, Harbin 150040, Heilongjiang, Peoples R China
[2] Xiamen Univ, Natl Inst Data Sci Hlth & Med, 4221 Xiangannan Rd, Xiamen 361102, Fujian, Peoples R China
[3] Harbin Med Univ, Affiliated Hosp 1, Dept Gen Surg, Key Lab Hepatosplen Surg,Minist Educ, 23 Postal St, Harbin 150007, Heilongjiang, Peoples R China
基金
中国国家自然科学基金;
关键词
metagenome; phage lifestyle classification; self-attention network; PSSM matrix; METAGENOMIC DATA; BACTERIA; HOST;
D O I
10.1093/bib/bbae377
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Bacteriophages are the viruses that infect bacterial cells. They are the most diverse biological entities on earth and play important roles in microbiome. According to the phage lifestyle, phages can be divided into the virulent phages and the temperate phages. Classifying virulent and temperate phages is crucial for further understanding of the phage-host interactions. Although there are several methods designed for phage lifestyle classification, they merely either consider sequence features or gene features, leading to low accuracy. A new computational method, DeePhafier, is proposed to improve classification performance on phage lifestyle. Built by several multilayer self-attention neural networks, a global self-attention neural network, and being combined by protein features of the Position Specific Scoring Matrix matrix, DeePhafier improves the classification accuracy and outperforms two benchmark methods. The accuracy of DeePhafier on five-fold cross-validation is as high as 87.54% for sequences with length >2000bp.
引用
收藏
页数:10
相关论文
共 36 条
  • [1] A State-of-the-Art Survey on Deep Learning Theory and Architectures
    Alom, Md Zahangir
    Taha, Tarek M.
    Yakopcic, Chris
    Westberg, Stefan
    Sidike, Paheding
    Nasrin, Mst Shamima
    Hasan, Mahmudul
    Van Essen, Brian C.
    Awwal, Abdul A. S.
    Asari, Vijayan K.
    [J]. ELECTRONICS, 2019, 8 (03)
  • [2] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [3] BASIC LOCAL ALIGNMENT SEARCH TOOL
    ALTSCHUL, SF
    GISH, W
    MILLER, W
    MYERS, EW
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) : 403 - 410
  • [4] EFFECTS OF GROWTH-MEDIUM ON PHAGE PRODUCTION AND INDUCTION IN ESCHERICHIA-COLI K-12 LAMBDA LYSOGENS
    CLARK, DW
    MEYER, HP
    LEIST, C
    FIECHTER, A
    [J]. JOURNAL OF BIOTECHNOLOGY, 1986, 3 (5-6) : 271 - 280
  • [5] Clarke KJ, 1998, FEMS MICROBIOL LETT, V166, P177, DOI 10.1111/j.1574-6968.1998.tb13887.x
  • [6] Predicting TF Proteins by Incorporating Evolution Information Through PSSM
    Du, Zhihua
    Huang, Tianyou
    Uversky, Vladimir N.
    Li, Jianqiang
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (02) : 1319 - 1326
  • [7] Dynamic Viral Populations in Hypersaline Systems as Revealed by Metagenomic Assembly
    Emerson, Joanne B.
    Thomas, Brian C.
    Andrade, Karen
    Allen, Eric E.
    Heidelberg, Karla B.
    Banfield, Jillian F.
    [J]. APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2012, 78 (17) : 6309 - 6320
  • [9] A comparison of alternative tests of significance for the problem of m rankings
    Friedman, M
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1940, 11 : 86 - 92
  • [10] Novel Virulent Bacteriophage ΦSG005, Which Infects Streptococcus gordonii, Forms a Distinct Clade among Streptococcus Viruses
    Fujiki, Jumpei
    Yoshida, Shin-ichi
    Nakamura, Tomohiro
    Nakamura, Keisuke
    Amano, Yurika
    Nishida, Keita
    Nishi, Keitaro
    Sasaki, Michihito
    Iwasaki, Tomohito
    Sawa, Hirofumi
    Komatsuzawa, Hitoshi
    Hijioka, Hiroshi
    Iwano, Hidetomo
    [J]. VIRUSES-BASEL, 2021, 13 (10):