Hepatitis C Prediction Using Machine Learning and Deep Learning-Based Hybrid Approach with Biomarker and Clinical Data

被引:0
作者
Rokiya Ripa [1 ]
Khandaker Mohammad Mohi Uddin [2 ]
Mir Jafikul Alam [1 ]
Md. Mahbubur Rahman [3 ]
机构
[1] Department of Computer Science and Engineering, Dhaka International University, Dhaka
[2] Department of Computer Science and Engineering, Southeast University, Dhaka
[3] Department of Computer Science and Engineering, Bangladesh University of Business and Technology, Dhaka
来源
Biomedical Materials & Devices | 2025年 / 3卷 / 1期
关键词
Classification; Deep learning; Hepatitis C virus; Machine learning; Predictions;
D O I
10.1007/s44174-024-00197-x
中图分类号
学科分类号
摘要
Chronic liver damage is believed to be mostly caused by the Hepatitis C virus (HCV). About 90% of hepatitis C infections progress to chronic hepatitis. Acute HCV infection is a condition that frequently progresses to liver cirrhosis and eventually liver cancer; therefore, understanding this stage of the virus is essential. Molecular and serological testing approaches are often expensive and difficult to perform for diagnosing HCV infection. Machine learning technology can be effectively employed to identify patterns or associations for diagnosing HCV infection. The study utilized machine learning techniques to develop classification models for hepatitis C illness, aiming to anticipate the virus responsible for the infection. Our research integrates various machine learning algorithms, including Random Forest, Cat Boost, Bagging Classifier, SGD Classifier, Gaussian NB, Bernoulli NB, Multinomial NB, Linear Discriminant Analysis, ANN, and MLP. Prior to training the models, preprocessing methods, including normalization, filtering, and SMOTE, were used to improve the dataset’s attributes. The classification accuracy scores show encouraging results, with ANN scoring 79.63%, MLP scoring 46.29%, Bernoulli NB scoring 79.62%, Cat Boost scoring 98.14%, and Random Forest scoring 99.53%. Among classification methods, random forest demonstrates the highest accuracy in diagnosing HCV infection. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.
引用
收藏
页码:558 / 575
页数:17
相关论文
共 54 条
[41]  
Rigg J., Doyle O., McDonogh N., Leavitt N., Ali R., Son A., Kreter B., Finding undiagnosed patients with hepatitis C virus: an application of machine learning to US ambulatory electronic medical records, BMJ Health Care Inform, 30, 1, (2023)
[42]  
HCV Data Data Set. UCI Machine Learning Repository, (2020)
[43]  
Hoffmann G., Bietenbeck A., Lichtinghagen R., Klawonn F., Using machine learning techniques to generate laboratory diagnostic pathways—a case study, J. Lab. Precis. Med, 3, 6, (2018)
[44]  
Hall M.A., Correlation-based feature selection for machine learning (Doctoral dissertation, (1999)
[45]  
Uddin K.M.M., Dey S.K., Babu H.M.H., A Voice assistive mobile application tool to detect cardiovascular disease using machine learning approach, Biomed. Mater. Dev, (2024)
[46]  
Natekin A., Knoll A., Gradient boosting machines, a tutorial, Front. Neurorobot, 7, (2013)
[47]  
Le N.Q.K., Ho Q.T., Nguyen V.N., Chang J.S., BERT-promoter: an improved sequence-based predictor of DNA promoter using BERT pre-trained model and SHAP feature selection, Comput. Biol. Chem, 99, (2022)
[48]  
Rahman M.M., A web-based heart disease prediction system using machine learning algorithms, Netw. Biol, 12, 2, (2022)
[49]  
Langley P., Selection of relevant features in machine learning, Proceedings of the AAAI Fall symposium on relevance, 184, pp. 245-271, (1994)
[50]  
Ganapathy S., Kulothungan K., Muthurajkumar S., Vijayalakshmi M., Yogesh P., Kannan A., Intelligent feature selection and classification techniques for intrusion detection in networks: a survey, EURASIP J. Wirel. Commun. Netw, 2013, pp. 1-16, (2013)