The peptide therapeutics market is providing new opportunities for the biotechnology and pharmaceutical industries. Therefore, identifying therapeutic peptides and exploring their properties are important. Although several studies have proposed different machine learning methods to predict peptides as being therapeutic peptides, most do not explain the decision factors of model in detail. In this work, an Interpretable Therapeutic Peptide Prediction (ITP-Pred) model based on efficient feature fusion was developed. First, we proposed three kinds of feature descriptors based on sequence and physicochemical property encoded, namely amino acid composition (AAC), group AAC and coding autocorrelation, and concatenated them to obtain the feature representation of therapeutic peptide. Then, we input it into the CNN-Bi-directional Long Short-Term Memory (BiLSTM) model to automatically learn recognition of therapeutic peptides. The cross-validation and independent verification experiments results indicated that ITP-Pred has a higher prediction performance on the benchmark dataset than other comparison methods. Finally, we analyzed the output of the model from two aspects: sequence order and physical and chemical properties, mining important features as guidance for the design of better models that can complement existing methods.
机构:
Pakistan Inst Engn & Appl Sci, Dept Comp & Informat Sci, Islamabad, PakistanCatholic Univ Daegu, Dept Biomed Engn, Coll Med Sci, Gyongsan, South Korea
Afridi, Tariq Habib
;
论文数: 引用数:
h-index:
机构:
Khan, Asifullah
;
Lee, Yeon Soo
论文数: 0引用数: 0
h-index: 0
机构:
Catholic Univ Daegu, Dept Biomed Engn, Coll Med Sci, Gyongsan, South KoreaCatholic Univ Daegu, Dept Biomed Engn, Coll Med Sci, Gyongsan, South Korea
机构:
Univ Montpellier 2, CNRS, UMR 5235, F-34095 Montpellier, FranceUniv Montpellier 2, CNRS, UMR 5235, F-34095 Montpellier, France
Boisguerin, Prisca
;
Giorgi, Jean-Michel
论文数: 0引用数: 0
h-index: 0
机构:
Univ Montpellier 2, CNRS, UMR 5235, F-34095 Montpellier, FranceUniv Montpellier 2, CNRS, UMR 5235, F-34095 Montpellier, France
Giorgi, Jean-Michel
;
Barrere-Lemaire, Stephanie
论文数: 0引用数: 0
h-index: 0
机构:
Univ Montpellier I, UMR CNRS 5203, INSERM U661, Inst Genom Fonct, F-34000 Montpellier, France
Univ Montpellier 2, UMR CNRS 5203, INSERM U661, Inst Genom Fonct, F-34000 Montpellier, FranceUniv Montpellier 2, CNRS, UMR 5235, F-34095 Montpellier, France
机构:
Mahidol Univ, Fac Med Technol, Ctr Data Min & Biomed Informat, Bangkok 10700, ThailandChiang Mai Univ, Coll Arts Media & Technol, Chiang Mai 50200, Thailand
Schaduangrat, Nalini
;
Nantasenamat, Chanin
论文数: 0引用数: 0
h-index: 0
机构:
Mahidol Univ, Fac Med Technol, Ctr Data Min & Biomed Informat, Bangkok 10700, ThailandChiang Mai Univ, Coll Arts Media & Technol, Chiang Mai 50200, Thailand
Nantasenamat, Chanin
;
Piacham, Theeraphon
论文数: 0引用数: 0
h-index: 0
机构:
Mahidol Univ, Fac Med Technol, Dept Clin Microbiol & Appl Technol, Bangkok 10700, ThailandChiang Mai Univ, Coll Arts Media & Technol, Chiang Mai 50200, Thailand
Piacham, Theeraphon
;
Shoombuatong, Watshara
论文数: 0引用数: 0
h-index: 0
机构:
Mahidol Univ, Fac Med Technol, Ctr Data Min & Biomed Informat, Bangkok 10700, ThailandChiang Mai Univ, Coll Arts Media & Technol, Chiang Mai 50200, Thailand
机构:
Pakistan Inst Engn & Appl Sci, Dept Comp & Informat Sci, Islamabad, PakistanCatholic Univ Daegu, Dept Biomed Engn, Coll Med Sci, Gyongsan, South Korea
Afridi, Tariq Habib
;
论文数: 引用数:
h-index:
机构:
Khan, Asifullah
;
Lee, Yeon Soo
论文数: 0引用数: 0
h-index: 0
机构:
Catholic Univ Daegu, Dept Biomed Engn, Coll Med Sci, Gyongsan, South KoreaCatholic Univ Daegu, Dept Biomed Engn, Coll Med Sci, Gyongsan, South Korea
机构:
Univ Montpellier 2, CNRS, UMR 5235, F-34095 Montpellier, FranceUniv Montpellier 2, CNRS, UMR 5235, F-34095 Montpellier, France
Boisguerin, Prisca
;
Giorgi, Jean-Michel
论文数: 0引用数: 0
h-index: 0
机构:
Univ Montpellier 2, CNRS, UMR 5235, F-34095 Montpellier, FranceUniv Montpellier 2, CNRS, UMR 5235, F-34095 Montpellier, France
Giorgi, Jean-Michel
;
Barrere-Lemaire, Stephanie
论文数: 0引用数: 0
h-index: 0
机构:
Univ Montpellier I, UMR CNRS 5203, INSERM U661, Inst Genom Fonct, F-34000 Montpellier, France
Univ Montpellier 2, UMR CNRS 5203, INSERM U661, Inst Genom Fonct, F-34000 Montpellier, FranceUniv Montpellier 2, CNRS, UMR 5235, F-34095 Montpellier, France
机构:
Mahidol Univ, Fac Med Technol, Ctr Data Min & Biomed Informat, Bangkok 10700, ThailandChiang Mai Univ, Coll Arts Media & Technol, Chiang Mai 50200, Thailand
Schaduangrat, Nalini
;
Nantasenamat, Chanin
论文数: 0引用数: 0
h-index: 0
机构:
Mahidol Univ, Fac Med Technol, Ctr Data Min & Biomed Informat, Bangkok 10700, ThailandChiang Mai Univ, Coll Arts Media & Technol, Chiang Mai 50200, Thailand
Nantasenamat, Chanin
;
Piacham, Theeraphon
论文数: 0引用数: 0
h-index: 0
机构:
Mahidol Univ, Fac Med Technol, Dept Clin Microbiol & Appl Technol, Bangkok 10700, ThailandChiang Mai Univ, Coll Arts Media & Technol, Chiang Mai 50200, Thailand
Piacham, Theeraphon
;
Shoombuatong, Watshara
论文数: 0引用数: 0
h-index: 0
机构:
Mahidol Univ, Fac Med Technol, Ctr Data Min & Biomed Informat, Bangkok 10700, ThailandChiang Mai Univ, Coll Arts Media & Technol, Chiang Mai 50200, Thailand