An Efficient Lightweight Hybrid Model with Attention Mechanism for Enhancer Sequence Recognition

被引:10
作者
Aladhadh, Suliman [1 ]
Almatroodi, Saleh A. [2 ]
Habib, Shabana [1 ]
Alabdulatif, Abdulatif [3 ]
Khattak, Saeed Ullah [4 ]
Islam, Muhammad [5 ]
机构
[1] Qassim Univ, Coll Comp, Dept Informat Technol, Buraydah 51452, Saudi Arabia
[2] Qassim Univ, Coll Appl Med Sci, Dept Med Labs, Buraydah 51452, Saudi Arabia
[3] Qassim Univ, Coll Comp, Dept Comp Sci, Buraydah 51452, Saudi Arabia
[4] Univ Peshawar, Ctr Biotechnol & Microbiol, Peshawar 25120, Pakistan
[5] Onaizah Coll, Coll Engn & Informat Technol, Dept Elect Engn, Onaizah 56447, Saudi Arabia
关键词
deep learning; enhancer sequence; convolution neural network; sequential learning models; temporal attention mechanism; DEEP; PREDICTION; STRENGTH; CNN;
D O I
10.3390/biom13010070
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Enhancers are sequences with short motifs that exhibit high positional variability and free scattering properties. Identification of these noncoding DNA fragments and their strength are extremely important because they play a key role in controlling gene regulation on a cellular basis. The identification of enhancers is more complex than that of other factors in the genome because they are freely scattered, and their location varies widely. In recent years, bioinformatics tools have enabled significant improvement in identifying this biological difficulty. Cell line-specific screening is not possible using these existing computational methods based solely on DNA sequences. DNA segment chromatin accessibility may provide useful information about its potential function in regulation, thereby identifying regulatory elements based on its chromatin accessibility. In chromatin, the entanglement structure allows positions far apart in the sequence to encounter each other, regardless of their proximity to the gene to be acted upon. Thus, identifying enhancers and assessing their strength is difficult and time-consuming. The goal of our work was to overcome these limitations by presenting a convolutional neural network (CNN) with attention-gated recurrent units (AttGRU) based on Deep Learning. It used a CNN and one-hot coding to build models, primarily to identify enhancers and secondarily to classify their strength. To test the performance of the proposed model, parallels were drawn between enhancer-CNNAttGRU and existing state-of-the-art methods to enable comparisons. The proposed model performed the best for predicting stage one and stage two enhancer sequences, as well as their strengths, in a cross-species analysis, achieving best accuracy values of 87.39% and 84.46%, respectively. Overall, the results showed that the proposed model provided comparable results to state-of-the-art models, highlighting its usefulness.
引用
收藏
页数:15
相关论文
共 50 条
[21]   Abnormal Activity Recognition from Surveillance Videos Using Convolutional Neural Network [J].
Habib, Shabana ;
Hussain, Altaf ;
Albattah, Waleed ;
Islam, Muhammad ;
Khan, Sheroz ;
Khan, Rehan Ullah ;
Khan, Khalil .
SENSORS, 2021, 21 (24)
[22]  
Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
[23]   EnhancerPred: a predictor for discovering enhancers based on the combination and selection of multiple features [J].
Jia, Cangzhi ;
He, Wenying .
SCIENTIFIC REPORTS, 2016, 6
[24]   Towards efficient and effective renewable energy prediction via deep learning [J].
Khan, Zulfiqar Ahmad ;
Hussain, Tanveer ;
Ul Haq, Ijaz ;
Ullah, Fath U. Min ;
Baik, Sung Wook .
ENERGY REPORTS, 2022, 8 :10230-10243
[25]   Randomly Initialized CNN with Densely Connected Stacked Autoencoder for Efficient Fire Detection [J].
Khan, Zulfiqar Ahmad ;
Hussain, Tanveer ;
Ullah, Fath U. Min ;
Gupta, Suneet Kumar ;
Lee, Mi Young ;
Baik, Sung Wook .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 116
[26]   Efficient Short-Term Electricity Load Forecasting for Effective Energy Management [J].
Khan, Zulfiqar Ahmad ;
Ullah, Amin ;
Ul Haq, Ijaz ;
Hamdy, Mohamed ;
Mauro, Gerardo Maria ;
Muhammad, Khan ;
Hijji, Mohammad ;
Baik, Sung Wook .
SUSTAINABLE ENERGY TECHNOLOGIES AND ASSESSMENTS, 2022, 53
[27]   DEEP: a general computational framework for predicting enhancers [J].
Kleftogiannis, Dimitrios ;
Kalnis, Panos ;
Bajic, Vladimir B. .
NUCLEIC ACIDS RESEARCH, 2015, 43 (01) :e6
[28]   iEnhancer-RF: Identifying enhancers and their strength by enhanced feature representation using random forest [J].
Lim, Dae Yeong ;
Khanal, Jhabindra ;
Tayara, Hilal ;
Chong, Kil To .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2021, 212
[29]   iEnhancer-EL: identifying enhancers and their strength with ensemble learning approach [J].
Liu, Bin ;
Li, Kai ;
Huang, De-Shuang ;
Chou, Kuo-Chen .
BIOINFORMATICS, 2018, 34 (22) :3835-3842
[30]   iEnhancer-PsedeKNC: Identification of enhancers and their subgroups based on Pseudo degenerate kmer nucleotide composition [J].
Liu, Bin .
NEUROCOMPUTING, 2016, 217 :46-52