Protein classification by autofluorescence spectral shape analysis using machine learning

被引:0
作者
Mukunda, Darshan Chikkanayakanahalli [1 ]
Rodrigues, Jackson [1 ]
Chandra, Subhash [1 ]
Mazumder, Nirmal [1 ]
Vitkin, Alex [2 ]
Mahato, Krishna Kishore [1 ]
机构
[1] Manipal Acad Higher Educ, Manipal Sch Life Sci, Dept Biophys, Manipal 576104, Karnataka, India
[2] Univ Toronto, Dept Med Biophys, Toronto, ON M5G 1L7, Canada
关键词
Proteins; Machine learning; Support vector machine; Autofluorescence; Autofluorescence library; TRYPTOPHAN FLUORESCENCE-SPECTRA; RESONANCE ENERGY-TRANSFER; LOG-NORMAL COMPONENTS; INFRARED-SPECTROSCOPY; DECOMPOSITION; GLYCATION; HSA;
D O I
10.1016/j.talanta.2023.125167
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Depending on the relative numbers and spatial arrangement of Tryptophan (Trp; W) and Tyrosine (Tyr; Y) residues, different proteins produce distinct autofluorescence (AF) spectral shapes when excited at similar to 280 nm. Yet, considering the vast number and heterogeneous forms in nature, visual analysis and precise identification of proteins based on their AF spectra is challenging and further compounded in cases when different proteins produce substantially similar AF spectral shapes. There is, thus, a serious need to develop a methodology to address this problem. The current study proposes a practical technology to quickly identify proteins using machine learning (ML) algorithms based on their AF spectra. Specifically, AF spectra of fifteen different standard proteins of varying origin with distinct structural and Trp/Tyr compositions were recorded; based on the spectral features selected by the Minimum-Redundancy-Maximum-Relevance (mRMR) algorithm, a multiclass Support Vector Machine (SVM) learning model with Radial Basis Function (RBF), Polynomial, and Linear kernels classified the proteins with high accuracy of 99.06%, 99.03%, and 98.29% respectively. Since protein identification is the key to understand biological functions and disease diagnosis, the proposed methodology could offer a viable alternative to and improve the existing protein identification techniques.
引用
收藏
页数:10
相关论文
共 49 条
[1]   Methylglyoxal induced glycation and aggregation of human serum albumin: Biochemical and biophysical approach [J].
Ahmed, Azaj ;
Shamsi, Anas ;
Khan, Mohd Shahnawaz ;
Husain, Fohad Mabood ;
Bano, Bilqees .
INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2018, 113 :269-276
[2]   Review: Glycation of human serum albumin [J].
Anguizola, Jeanethe ;
Matsuda, Ryan ;
Barnaby, Omar S. ;
Hoy, K. S. ;
Wa, Chunling ;
DeBolt, Erin ;
Koke, Michelle ;
Hage, David S. .
CLINICA CHIMICA ACTA, 2013, 425 :64-76
[3]   A comparison of Raman and FT-IR spectroscopy for the prediction of meat spoilage [J].
Argyri, Anthoula A. ;
Jarvis, Roger M. ;
Wedge, David ;
Xu, Yun ;
Panagou, Efstathios Z. ;
Goodacre, Royston ;
Nychas, George-John E. .
FOOD CONTROL, 2013, 29 (02) :461-470
[4]   Proteomics: Technologies and Their Applications [J].
Aslam, Bilal ;
Basit, Madiha ;
Nisar, Muhammad Atif ;
Khurshid, Mohsin ;
Rasool, Muhammad Hidayat .
JOURNAL OF CHROMATOGRAPHIC SCIENCE, 2017, 55 (02) :182-196
[5]   Support vector machine and principal component analysis for microarray data classification [J].
Astuti, Widi ;
Adiwijaya .
INTERNATIONAL CONFERENCE ON DATA AND INFORMATION SCIENCE (ICODIS), 2018, 971
[6]  
Bakheet S, 2017, COMPUTATION, V5, DOI 10.3390/computation5010004
[7]   Label-free SERS detection of proteins based on machine learning classification of chemo-structural determinants [J].
Barucci, Andrea ;
D'Andrea, Cristiano ;
Farnesi, Edoardo ;
Banchelli, Martina ;
Amicucci, Chiara ;
de Angelis, Marella ;
Hwang, Byungil ;
Matteini, Paolo .
ANALYST, 2021, 146 (02) :674-682
[8]   Deep Ultraviolet Plasmonic Enhancement of Single Protein Autofluorescence in Zero-Mode Waveguides [J].
Barulin, Aleksandr ;
Claude, Jean-Benoit ;
Patra, Satyajit ;
Bonod, Nicolas ;
Wenger, Jerome .
NANO LETTERS, 2019, 19 (10) :7434-7442
[9]   Support vector machines in tandem with infrared spectroscopy for geographical classification of green arabica coffee [J].
Bona, Evandro ;
Marquetti, Izabele ;
Link, Jade Varaschim ;
Figueiredo Makimori, Gustavo Yasuo ;
Arca, Vinicius da Costa ;
Guimardes Lemes, Andre Luis ;
Garcia Ferreira, Juliana Mendes ;
dos Santos Scholz, Maria Brigida ;
Valderrama, Patricia ;
Poppi, Ronei Jesus .
LWT-FOOD SCIENCE AND TECHNOLOGY, 2017, 76 :330-336
[10]   Prediction of lysine ubiquitination with mRMR feature selection and analysis [J].
Cai, Yudong ;
Huang, Tao ;
Hu, Lele ;
Shi, Xiaohe ;
Xie, Lu ;
Li, Yixue .
AMINO ACIDS, 2012, 42 (04) :1387-1395