Comparative analysis of machine learning models for shortlisting SNPs to facilitate detection of marginal epistasis in GWAS

被引:0
作者
Dasmandal, Tanwy [1 ,2 ]
Sinha, Dipro [4 ]
Rai, Anil [3 ]
Mishra, Dwijesh Chandra [4 ]
Archak, Sunil [5 ]
机构
[1] ICAR Indian Agr Res Inst, Grad Sch, New Delhi, India
[2] ICAR Natl Bur Fish Genet Resources, Lucknow, Uttar Pradesh, India
[3] Indian Council Agr Res, New Delhi, India
[4] ICAR Indian Agr Stat Res Inst, New Delhi, India
[5] ICAR Natl Bur Plant Genet Resources, New Delhi 110012, India
关键词
Marginal epistasis; Machine learning; GWAS; Feature selection; SNP-SNP interactions; GENOME-WIDE ASSOCIATION; GENETIC ARCHITECTURE; COMPLEX TRAITS; COMMON; SELECTION;
D O I
10.1007/s41060-024-00647-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Epistasis, an essential genetic element causing phenotypic diversity, is frequently characterized as the interaction between two or more genes. Previous models could identify marginal epistatic interactions by mapping variants that have nonzero marginal epistatic effects. However, these models fail short of identifying individual interaction partners. To reduce the computational burden of the existing epistasis detection algorithms without compromising the detection of exact epistatic partners, strengths of various machine learning algorithms were exploited as a filtering strategy. Seven machine learning strategies were compared for shortlisting marginally associated SNPs that includes AdaBoost, artificial neural network, 3 random forest, stepwise regression, ridge regression, lasso and elastic net. Datasets were simulated for different combinations of heritability and minor allele frequencies, and performances of different algorithms were evaluated using power and precision measures. We found that ridge regression model outperformed the other models in shortlisting marginal epistasis-related SNPs. Thus, it is expected that epistasis detection tools will benefit by adding a filtering stage using ridge regression for efficient detection of marginal epistasis in large genomic datasets.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Comparative analysis of machine learning models for anomaly detection in manufacturing
    Kharitonov, Andrey
    Nahhas, Abdulrahman
    Pohl, Matthias
    Turowski, Klaus
    3RD INTERNATIONAL CONFERENCE ON INDUSTRY 4.0 AND SMART MANUFACTURING, 2022, 200 : 1288 - 1297
  • [2] A Comparative Analysis of Machine Learning Models for the Detection of Undiagnosed Diabetes Patients
    Cichosz, Simon Lebech
    Bender, Clara
    Hejlesen, Ole
    DIABETOLOGY, 2024, 5 (01): : 1 - 11
  • [3] Comparative Analysis of Machine Learning Models in Computer Network Intrusion Detection
    Osa, Edosa
    Oghenevbaire, Ogodo Efevberha
    2022 IEEE NIGERIA 4TH INTERNATIONAL CONFERENCE ON DISRUPTIVE TECHNOLOGIES FOR SUSTAINABLE DEVELOPMENT (IEEE NIGERCON), 2022, : 648 - 652
  • [4] SNPs-based Hypertension Disease Detection via Machine Learning Techniques
    Alzubi, Raid
    Ramzan, Naeem
    Alzoubi, Hadeel
    Katsigiannis, Stamos
    2018 24TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC' 18), 2018, : 44 - 49
  • [5] A Comparative Study of Machine Learning Models for Parkinson's Disease Detection
    Bunterngchit, Chayut
    Bunterngchit, Yuthachai
    2022 INTERNATIONAL CONFERENCE ON DECISION AID SCIENCES AND APPLICATIONS (DASA), 2022, : 465 - 469
  • [6] Comparative Analysis of Multiclass Classification Machine Learning Models for Cybersecurity Intrusion Detection
    Loughmari, Mohamed
    El Affar, Anass
    DIGITAL TECHNOLOGIES AND APPLICATIONS, ICDTA 2024, VOL 2, 2024, 1099 : 97 - 108
  • [7] A Comparative Analysis of various Machine Learning and Deep Learning Models for Gene Expression
    Thakur, Tanima
    Batra, Isha
    Malik, Arun
    2021 INTERNATIONAL CONFERENCE ON COMPUTING SCIENCES (ICCS 2021), 2021, : 139 - 142
  • [8] Comparative Analysis of Machine Learning Models for Predictive Analysis of Machine Failures
    Baldovino, Renann G.
    Camacho, Ken Sammuel I.
    Chua-Unsu, Megan Victoria Hillary Y.
    Go, Jed Leonard C.
    Munsayac, Francisco Emmanuel T. Jr, III
    Bugtai, Nilo T.
    9TH INTERNATIONAL CONFERENCE ON MECHATRONICS ENGINEERING, ICOM 2024, 2024, : 288 - 293
  • [9] Novel Alzheimer's disease genes and epistasis identified using machine learning GWAS platform
    Lundberg, Mischa
    Sng, Letitia M. F.
    Szul, Piotr
    Dunne, Rob
    Bayat, Arash
    Burnham, Samantha C.
    Bauer, Denis C.
    Twine, Natalie A.
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [10] Comparative Analysis of Machine Learning Models for Performance Prediction of the SPEC Benchmarks
    Tousi, Ashkan
    Lujan, Mikel
    IEEE ACCESS, 2022, 10 : 11994 - 12011