Logistic Regression and Logistic Regression-Genetic Algorithm for Classification of Liver Cancer Data

被引:2
作者
Wibowo, Velery Virgina Putri [1 ]
Rustam, Zuherman [1 ]
Laeli, Afifah Rofi [1 ]
Said, Alva Andhika [1 ]
机构
[1] Univ Indonesia, Dept Math, Depok, Indonesia
来源
2021 INTERNATIONAL CONFERENCE ON DECISION AID SCIENCES AND APPLICATION (DASA) | 2021年
关键词
Hepatocellular Carcinoma; Logistic Regression; Genetic Algorithm; Machine Learning; Feature Selection;
D O I
10.1109/DASA53625.2021.9682242
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cancer is a condition that can cause death in which abnormal cells arise and evolve in the body. Furthermore, it has a high mortality rate worldwide and its cases are expected to continue increasing rapidly every year. There are various types of cancers, and an example is Hepatocellular Carcinoma (HCC). This cancer is a general type of primary liver cancer, and is malignant in nature. It is also aggressive, thus, could spread and develop rapidly. The diagnosis of HCC is often made at a late stage because most sufferers do not show distinctive signs. Patients diagnosed at an advanced stage have a low chance of living because their liver has been damaged. Therefore, early diagnosis is needed to increase the survival rate and provide the best treatments to patients. Machine learning can be applied in the medical sector to diagnose diseases with high accuracy. Therefore, this study proposed the Logistic Regression (LR) method to classify HCC data. Based on the data, there were several features available, though, some may not be relevant. Due to this condition, feature selection was needed to increase the accuracy and determine which features were important. Genetic Algorithm (GA) was applied as a feature selection tool and Logistic Regression without feature selection (LR) was compared with Logistic Regression with Genetic Algorithm (LR-GA) to determine which method is best for classifying HCC. Based on the results, LR-GA is a better machine learning method than LR with 93.18%, 90.91%, 95.45%, and 93.12% values for accuracy, recall, precision, and f1-score respectively.
引用
收藏
页数:5
相关论文
共 50 条
[31]   Genetic algorithm with logistic regression for prediction of progression to Alzheimer's disease [J].
Piers Johnson ;
Luke Vandewater ;
William Wilson ;
Paul Maruff ;
Greg Savage ;
Petra Graham ;
Lance S Macaulay ;
Kathryn A Ellis ;
Cassandra Szoeke ;
Ralph N Martins ;
Christopher C Rowe ;
Colin L Masters ;
David Ames ;
Ping Zhang .
BMC Bioinformatics, 15
[32]   Texture classification using kernel logistic regression [J].
Tambo, Asongu L. ;
Mistry, Rajan B. ;
Campbell, Jonathan M. ;
Chan, Sherwin R. ;
Hang, Xiyi .
INT CONF ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS AND APPLICATIONS/INT CONF ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL 1, 2007, :259-262
[33]   Multiple Classification Using Logistic Regression Model [J].
Zou, Baoping .
INTERNET OF VEHICLES - TECHNOLOGIES AND SERVICES, 2016, 10036 :238-243
[34]   Multiclass Classification by Sparse Multinomial Logistic Regression [J].
Abramovich, Felix ;
Grinshtein, Vadim ;
Levy, Tomer .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2021, 67 (07) :4637-4646
[35]   Evaluation of classification ability of logistic regression model on SERS data of miRNAs [J].
Zeng, Xiaojun ;
Liu, Yu ;
Liu, Wei ;
Yuan, Changjing ;
Luo, Xizi ;
Xie, Fengxin ;
Chen, Xueping ;
de la Chapelle, Marc Lamy ;
Tian, Huiyan ;
Yang, Xiang ;
Fu, Weiling .
JOURNAL OF BIOPHOTONICS, 2022, 15 (12)
[36]   Classification of array CGH data using smoothed logistic regression model [J].
Huang, Jian ;
Salim, Agus ;
Lei, Kaibin ;
O'Sullivan, Kathleen ;
Pawitan, Yudi .
STATISTICS IN MEDICINE, 2009, 28 (30) :3798-3810
[37]   Logistic regression for feature selection and soft classification of remote sensing data [J].
Cheng, Qi ;
Varshney, Pramod K. ;
Arora, Manoj K. .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2006, 3 (04) :491-494
[38]   Sparse logistic regression for whole-brain classification of fMRI data [J].
Ryali, Srikanth ;
Supekar, Kaustubh ;
Abrams, Daniel A. ;
Menon, Vinod .
NEUROIMAGE, 2010, 51 (02) :752-764
[39]   Sentiment classification on Big Data using Naive Bayes and Logistic Regression [J].
Prabhat, Anjuman ;
Khullar, Vikas .
2017 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2017,
[40]   Classification Methods Based on Fitting Logistic Regression to Positive and Unlabeled Data [J].
Furmanczyk, Konrad ;
Paczutkowski, Kacper ;
Dudzinski, Marcin ;
Dziewa-Dawidczyk, Diana .
COMPUTATIONAL SCIENCE - ICCS 2022, PT I, 2022, :31-45