Machine Learning Based Prediction of Enzymatic Degradation of Plastics Using Encoded Protein Sequence and Effective Feature Representation

被引:14
|
作者
Jiang, Renjing [1 ]
Shang, Lanyu [2 ]
Wang, Ruohan [1 ]
Wang, Dong [2 ]
Wei, Na [1 ]
机构
[1] Univ Illinois, Dept Civil & Environm Engn, Urbana, IL 61801 USA
[2] Univ Illinois, Sch Informat Sci, Champaign, IL 61820 USA
基金
美国国家科学基金会;
关键词
Machine learning; plastic waste; enzymaticdegradation; enzyme function; sequence representation; HEAT-CAPACITY; PORE-SIZE; TECHNOLOGIES; DEPOLYMERASE; HYDROLYSIS; DIFFUSION; SUBSTRATE;
D O I
10.1021/acs.estlett.3c00293
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Enzyme biocatalysis for plastic treatment and recyclingis an emergingfield of growing interest. However, it is challenging and time-consumingto identify plastic-degrading enzymes with desirable functionality,given the large number of putative enzyme sequences. There is a criticalneed to develop an effective approach to accurately predict the enzymeactivity in degrading different types of plastics. In this study,we developed a machine-learning-based plastic enzymatic degradation(PED) framework to predict the ability of an enzyme to degrade plasticsof interest by exploring and recognizing hidden patterns in proteinsequences. A data set integrating information from a wide range ofexperimentally verified enzymes and various common plastic substrateswas created. A new context-aware enzyme sequence representation (CESR)mechanism was developed to learn the abundant contextual informationin enzyme sequences, and feature extraction was performed for enzymesat both the amino acid level and global sequence level. Thirteen machinelearning classification algorithms were compared, and XGBoost wasidentified as the best-performing algorithm. PED achieved an overallaccuracy of 90.2% and outperformed sequence-based protein classificationmodels from the existing literature. Furthermore, important enzymefeatures in plastic degradation were identified and comprehensivelyinterpreted. This study demonstrated a new tool for the predictionand discovery of plastic-degrading enzymes.
引用
收藏
页码:557 / 564
页数:8
相关论文
共 50 条
  • [21] Sequence-Based Prediction of Cysteine Reactivity Using Machine Learning
    Wang, Haobo
    Chen, Xuemin
    Li, Can
    Liu, Yuan
    Yang, Fan
    Wang, Chu
    BIOCHEMISTRY, 2018, 57 (04) : 451 - 460
  • [22] Comparative analysis and prediction of quorum-sensing peptides using feature representation learning and machine learning algorithms
    Wei, Leyi
    Hu, Jie
    Li, Fuyi
    Song, Jiangning
    Su, Ran
    Zou, Quan
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (01) : 106 - 119
  • [23] Prediction of Protein-Protein Interactions Using An Effective Sequence Based Combined Method
    Goktepe, Yunus Emre
    Kodaz, Halife
    NEUROCOMPUTING, 2018, 303 : 68 - 74
  • [24] Effective Crack Damage Detection Using Multilayer Sparse Feature Representation and Incremental Extreme Learning Machine
    Wang, Baoxian
    Li, Yiqiang
    Zhao, Weigang
    Zhang, Zhaoxi
    Zhang, Yufeng
    Wang, Zhe
    APPLIED SCIENCES-BASEL, 2019, 9 (03):
  • [25] Prediction of Protein Structural Class Using a Combined Representation of Protein-sequence Information and Support Vector Machine
    Wu, Li
    Dai, Qi
    Han, Bin
    Zhu, Lei
    Li, Lihua
    2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS (BIBMW), 2010, : 101 - 106
  • [26] Meta-iAVP: A Sequence-Based Meta-Predictor for Improving the Prediction of Antiviral Peptides Using Effective Feature Representation
    Schaduangrat, Nalini
    Nantasenamat, Chanin
    Prachayasittikul, Virapong
    Shoombuatong, Watshara
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2019, 20 (22)
  • [27] Feature recognition for graph-based assembly product representation using machine learning
    Woerner, Jonathan M.
    Brovkina, Daniella
    Riedel, Oliver
    2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 629 - 635
  • [28] Effective Feature Engineering Technique for Heart Disease Prediction With Machine Learning
    Qadri, Azam Mehmood
    Raza, Ali
    Munir, Kashif
    Almutairi, Mubarak S.
    IEEE ACCESS, 2023, 11 : 56214 - 56224
  • [29] Tax Default Prediction Using Feature Transformation-Based Machine Learning
    Abedin, Mohammad Zoynul
    Chi, Guotai
    Uddin, Mohammed Mohi
    Satu, Md Shahriare
    Khan, Imran
    Hajek, Petr
    IEEE ACCESS, 2021, 9 : 19864 - 19881
  • [30] ACPred-FL: a sequence-based predictor using effective feature representation to improve the prediction of anti-cancer peptides
    Wei, Leyi
    Zhou, Chen
    Chen, Huangrong
    Song, Jiangning
    Su, Ran
    BIOINFORMATICS, 2018, 34 (23) : 4007 - 4016