A pattern recognition model to distinguish cancerous DNA sequences via signal processing methods

被引:10
作者
Khodaei, Amin [1 ]
Feizi-Derakhshi, Mohammad-Reza [1 ]
Mozaffari-Tazehkand, Behzad [1 ]
机构
[1] Tabriz Univ, Fac Elect & Comp Engn, 29 Bahman Blvd, Tabriz, Iran
关键词
Cancer; DNA sequence; Anti-notch filter; Discrete Fourier transform; Pattern recognition; SUPPORT VECTOR MACHINES; PROTEIN-CODING REGIONS; PREDICTION; CLASSIFICATION; IDENTIFICATION; SOFTWARE; HEALTHY; NETWORK; GENES;
D O I
10.1007/s00500-020-04942-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cancer is one of the life-threatening diseases caused by changes in the structure of genetic components of the cell. DNA sequences are one of the most important factors in the formation and spread of this disease. The signal processing approach is one of the scientific fields that has been developed in the last two decades in the analysis of DNA sequences. In this research, a hybrid model of discrete Fourier transform and anti-notch digital filter has been used for this purpose. The aim of using these techniques is to model an approach that can distinguish cancerous samples from non-cancerous ones. In other words, a pattern recognition model is designed to discriminate cancerous cell samples based on the features of protein coding regions of DNA sequences. Some computational and statistical techniques have been used in feature extraction and feature selection stages. Despite the proposed model simplicity, it doesn't face conventional challenges such as high computational complexity or memory dissipation. Case studies have been tested with the least possible feature, depending on the nature of the features. Experimental results and features relationship led to the proposal of the SVM classifier to discriminate two categories. The output features and classification show good discrimination results among the cancerous and non-cancerous samples. One of the main advantages of the proposed model is the independence of its performance over the data length. Evaluation and validation results indicate the high accuracy and precision of the proposed method which emphasizes the biological genetic mutation nature of cancer.
引用
收藏
页码:16315 / 16334
页数:20
相关论文
共 46 条
[41]   N-Gram Pattern Recognition using Multivariate-Bernoulli Model with Smoothing Methods for Text Classification [J].
Kilimci, Zeynep Hilal ;
Akyokus, Selim .
2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, :597-600
[42]   Deep Learning Techniques for Pattern Recognition in EEG Audio Signal-Processing-Based Eye-Closed and Eye-Open Cases [J].
Almukhtar, Firas Husham ;
Ajwad, Asmaa Abbas ;
Kamil, Amna Shibib ;
Jaleel, Refed Adnan ;
Kamil, Raya Adil ;
Mosa, Sarah Jalal .
ELECTRONICS, 2022, 11 (23)
[43]   Pattern Recognition Methods of Multi-source Partial Discharge Based on the Improved Deformable DETR Model and its Application [J].
Lei, Zhipeng ;
Peng, Chuan ;
Xu, Zihan ;
Jiang, Wanting ;
Li, Chuanyang ;
Lin, Lingyan ;
Peng, Bangfa .
Zhongguo Dianji Gongcheng Xuebao/Proceedings of the Chinese Society of Electrical Engineering, 2024, 44 (15) :6248-6260
[44]   Intelligent and robust computational prediction model for DNA N4-methylcytosine sites via natural language processing [J].
Tahir, Muhammd ;
Tayara, Hilal ;
Hayat, Maqsood ;
Chong, Kil To .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2021, 217
[45]   Structural damage identification via physics-guided machine learning: a methodology integrating pattern recognition with finite element model updating [J].
Zhang, Zhiming ;
Sun, Chao .
STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2021, 20 (04) :1675-1688
[46]   Enhanced recognition of insulator defects on power transmission lines via proposal-based detection model with integrated improvement methods [J].
Wang, Qinglong ;
Li, Yongjian ;
Cui, Shihao ;
Li, Na ;
Zhang, Xinchun ;
Jiang, Wenqiang ;
Peng, Wen ;
Sun, Jie .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 136