共 37 条
Enabling Artificial Intelligence for Genome Sequence Analysis of COVID-19 and Alike Viruses
被引:40
作者:
Ahmed, Imran
[1
]
Jeon, Gwanggil
[2
]
机构:
[1] Inst Management Sci, Ctr Excellence IT, Peshawar 25000, Khyber Pakhtunk, Pakistan
[2] Incheon Natl Univ, Dept Embedded Syst Engn, Incheon, South Korea
关键词:
Genome sequence analysis;
Artificial intelligence;
Machine learning;
SVM;
COVID-19;
PERSON DETECTOR;
FRAMEWORK;
IOT;
D O I:
10.1007/s12539-021-00465-0
中图分类号:
Q [生物科学];
学科分类号:
07 ;
0710 ;
09 ;
摘要:
Recent pandemic of COVID-19 (Coronavirus) caused by severe acute respiratory syndrome Coronavirus 2 (SARS-CoV-2) has been growing lethally with unusual speed. It has infected millions of people and continues a mortifying influence on the global population's health and well-being. In this situation, genome sequence analysis and advanced artificial intelligence techniques may help researchers and medical experts to understand the genetic variants of COVID-19 or SARS-CoV-2. Genome sequence analysis of COVID-19 is crucial to understand the virus's origin, behavior, and structure, which might help produce/develop vaccines, antiviral drugs, and efficient preventive strategies. This paper introduces an artificial intelligence based system to perform genome sequence analysis of COVID-19 and alike viruses, e.g., SARS, middle east respiratory syndrome, and Ebola. The system helps to get important information from the genome sequences of different viruses. We perform comparative data analysis by extracting basic information of COVID-19 and other genome sequences, including information of nucleotides composition and their frequency, tri-nucleotide compositions, count of amino acids, alignment between genome sequences, and their DNA similarity information. We use different visualization methods to analyze these viruses' genome sequences and, finally, apply machine learning based classifier support vector machine to classify different genome sequences. The data set of different virus genome sequences are obtained from an online publicly accessible data center repository. The system achieves good classification results with an accuracy of 97% for COVID-19, 96%, SARS, and 95% for MERS and Ebola genome sequences, respectively.
引用
收藏
页码:504 / 519
页数:16
相关论文