Application of K-means clustering based on artificial intelligence in gene statistics of biological information engineering

被引：3

作者：

He, Zheng ^{[1
]}

Shen, Xinyu ^{[2
]}

Zhou, Yanlin ^{[3
]}

Wang, Yong ^{[4
]}

机构：

[1] Columbia Univ, Appl Analyt, New York, NY 10027 USA

[2] Columbia Univ, Biostat, New York, NY USA

[3] Johns Hopkins Univ, Comp Sci, Baltimore, MD USA

[4] Univ Aberdeen, Informat Technol, Aberdeen, Scotland

来源：

PROCEEDINGS OF 2024 4TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND INTELLIGENT COMPUTING, BIC 2024 | 2024年

关键词：

K_ mean clustering; Disease detection; Single nucleotide polymorphism; Bioinformation process;

D O I：

10.1145/3665689.3665767

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

In the field of bioinformatics, the algorithm of biological gene sequence has always been one of the hot problems in scientific research. With the combination of biology and artificial intelligence technology, genetic algorithm data is increasing. At the same time, the emergence of a new generation of sequencing technology, the decrease in sequencing time and cost, and the high sequencing throughput have significantly increased the sequence data, showing an exponential growth trend, and there are still new biological gene sequence data found and recorded every day, and the speed of data generation is much faster than the speed of data processing, so the processing of large-scale DNA sequencing data needs more efficient methods. Therefore, AI bioinformatics engineering, especially genetic algorithms and K-means cluster analysis, has become an important tool in the field of bioinformatics statistics, with particular impact on the detection and diagnosis of single nucleotide polymorphisms (SNPS) associated with contact dermatitis. These advanced AI models efficiently process large amounts of genomic data, including SNP datasets, and have the ability to analyze patient genotype information. By doing so, they can identify SNPS that are strongly associated with contact dermatitis and establish meaningful associations between these genetic variants and the disease.The adoption of this personalized medicine approach not only addresses the specific needs of individual patients but also significantly enhances the success rate of treatment. Through the analysis of convolution algorithm and gene sequence in biological information engineering, this paper focuses on the relevant experiments of gene K-cluster analysis model, demonstrating the advantages and reference significance of gene algorithm under K-cluster analysis for current gene information statistics.

引用

页码：468 / 473

页数：6

共 18 条

[1] Building an Intrusion Detection System Using a Filter-Based Feature Selection Algorithm
Ambusaidi, Mohammed A.
He, Xiangjian
Nanda, Priyadarsi
Tan, Zhiyuan
[J]. IEEE TRANSACTIONS ON COMPUTERS, 2016, 65 (10) : 2986 - 2998
[2] Cai Jiangfeng, 2023, 2023 3 INT C CONS EL, P834, DOI 10.1109/mce.2022.3206678
[3] Che C., 2023, Journal of Theory and Practice of Engineering Science, V3, P36
[4] A multi-level intrusion detection method for abnormal network behaviors
Ji, Soo-Yeon
Jeong, Bong-Keun
Choi, Seonho
Jeong, Dong Hyun
[J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2016, 62 : 9 - 17
[5] SomaticSniper: identification of somatic point mutations in whole genome sequencing data
Larson, David E.
Harris, Christopher C.
Chen, Ken
Koboldt, Daniel C.
Abbott, Travis E.
Dooling, David J.
Ley, Timothy J.
Mardis, Elaine R.
Wilson, Richard K.
Ding, Li
[J]. BIOINFORMATICS, 2012, 28 (03) : 311 - 317
[6] Li Linxiao, 2020, ADV NEUR IN, V33
[7] Liu B., 2023, Journal of Theory and Practice of Engineering Science, V3, P36, DOI [DOI 10.53469/JTPES.2023.03(12).06, 10.53469/jtpes.2023.03(12).06]
[8] Liu B, 2023, Arxiv, DOI arXiv:2312.12872
[9] Liu Yuxiang, 2023, Journal of Theory and Practice of Engineering Science, V3, P22
[10] Moustafa N., 2015, MIL COMM INF SYST C, P1, DOI DOI 10.1109/MILCIS.2015.7348942

← 1 2 →