SpectraTr: A novel deep learning model for qualitative analysis of drug spectroscopy based on transformer structure

被引:22
作者
Fu, Pengyou [1 ,2 ]
Wen, Yue [2 ]
Zhang, Yuke [3 ]
Li, Lingqiao [1 ]
Feng, Yanchun [4 ]
Yin, Lihui [4 ]
Yang, Huihua [1 ,2 ]
机构
[1] Guilin Univ Elect Technol, Sch Comp Sci & Informat Secur, 1 Jinji Rd, Guilin 541004, Peoples R China
[2] Beijing Univ Posts & Telecommun, Sch Articial Intelligence, 10 Xitucheng Rd, Beijing 100876, Peoples R China
[3] Beijing Univ Posts & Telecommun, Sch Int, 10 Xitucheng Rd, Beijing 100876, Peoples R China
[4] Natl Inst Food & Drug Control, 10 Tiantanxili Rd, Beijing 100050, Peoples R China
基金
中国国家自然科学基金;
关键词
Near-infrared spectroscopy analysis; drug supervision; transformer structure; deep learning; chemometrics; NETWORK; AUTOENCODER;
D O I
10.1142/S1793545822500213
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
The drug supervision methods based on near-infrared spectroscopy analysis are heavily dependent on the chemometrics model which characterizes the relationship between spectral data and drug categories. The preliminary application of convolution neural network in spectral analysis demonstrates excellent end-to-end prediction ability, but it is sensitive to the hyper-parameters of the network. The transformer is a deep-learning model based on self-attention mechanism that compares convolutional neural networks (CNNs) in predictive performance and has an easy-to-design model structure. Hence, a novel calibration model named SpectraTr, based on the transformer structure, is proposed and used for the qualitative analysis of drug spectrum. The experimental results of seven classes of drug and 18 classes of drug show that the proposed SpectraTr model can automatically extract features from a huge number of spectra, is not dependent on pre-processing algorithms, and is insensitive to model hyperparameters. When the ratio of the training set to test set is 8:2, the prediction accuracy of the SpectraTr model reaches 100% and 99.52%, respectively, which outperforms PLS_DA, SVM, SAE, and CNN. The model is also tested on a public drug data set, and achieved classification accuracy of 96.97% without pre-processing algorithm, which is 34.85%, 28.28%, 5.05%, and 2.73% higher than PLS_DA, SVM, SAE, and CNN, respectively. The research shows that the SpectraTr model performs exceptionally well in spectral analysis and is expected to be a novel deep calibration model after Autoencoder networks (AEs) and CNN.
引用
收藏
页数:11
相关论文
共 31 条
[21]   Effective prediction of soil organic matter by deep SVD concatenation using FT-NIR spectroscopy [J].
Qiao, Hanli ;
Shi, Xiubo ;
Chen, Huazhou ;
Lyu, Jingyi ;
Hong, Shaoyong .
SOIL & TILLAGE RESEARCH, 2022, 215
[22]   Anomaly detection during milk processing by autoencoder neural network based on near-infrared spectroscopy [J].
Vasafi, Pegah Sadeghi ;
Paquet-Durand, Olivier ;
Brettschneider, Kim ;
Hinrichs, Joerg ;
Hitzmann, Bernd .
JOURNAL OF FOOD ENGINEERING, 2021, 299
[23]  
Vaswani A, 2017, ADV NEUR IN, V30
[24]   Artificial intelligence in reaction prediction and chemical synthesis [J].
Venkatasubramanian, Venkat ;
Mann, Vipul .
CURRENT OPINION IN CHEMICAL ENGINEERING, 2022, 36
[25]   Spectra selection methods: A novel optimization way for treating dynamic spectra and in-line near infrared modeling [J].
Wang, Haiyan ;
Liu, Ronghua ;
Nie, Lei ;
Xu, Dongbo ;
Yin, Wenping ;
Li, Lian ;
Zang, Hengchang .
JOURNAL OF INNOVATIVE OPTICAL HEALTH SCIENCES, 2020, 13 (04)
[26]   Multi-constraint molecular generation based on conditional transformer, knowledge distillation and reinforcement learning [J].
Wang, Jike ;
Hsieh, Chang-Yu ;
Wang, Mingyang ;
Wang, Xiaorui ;
Wu, Zhenxing ;
Jiang, Dejun ;
Liao, Benben ;
Zhang, Xujun ;
Yang, Bo ;
He, Qiaojun ;
Cao, Dongsheng ;
Chen, Xi ;
Hou, Tingjun .
NATURE MACHINE INTELLIGENCE, 2021, 3 (10) :914-922
[27]   Determination of the immunoglobulin G precipitation end-point by an intelligent near-infrared spectroscopy system [J].
Yu, Chen ;
Quan, Shuang ;
Yang, Cui ;
Zhang, Chengliang ;
Fan, Jiajin ;
Li, Lian ;
Zang, Hengchang .
JOURNAL OF INNOVATIVE OPTICAL HEALTH SCIENCES, 2021, 14 (03)
[28]  
Yun B., 2021, ARXIV
[29]  
Yun L., 2021, CHEMOMETR INTELL LAB, V214
[30]   Food and agro-product quality evaluation based on spectroscopy and deep learning: A review [J].
Zhang, Xiaolei ;
Yang, Jie ;
Lin, Tao ;
Ying, Yibin .
TRENDS IN FOOD SCIENCE & TECHNOLOGY, 2021, 112 :431-441