Group penalized logistic regression differentiates between benign and malignant ovarian tumors

被引：0

作者：

Xuemei Hu

Ying Xie

Yanlin Yang

Huifeng Jiang

机构：

[1] Chongqing Technology and Business University,School of Mathematics and Statistics

[2] Chongqing Technology and Business University,Chongqing Key Laboratory of Social Economy and Applied Statistics

[3] Chongqing Vocational College of Science and Technology,General National Defense Education College

[4] Chongqing University of Eduaction,School of Economics and Business Administration

[5] Chongqing Technology and Business University,Research Center for Economy of Upper Reaches of the Yangtse River

来源：

Soft Computing | 2023年 / 27卷

关键词：

Ovarian cancer; GCD algorithm; GLASSO/GSCAD/GMCP penalty; Machine learning methods; Deep learning methods;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Ovarian cancer is one of the most common types of cancer in women. Correct differentiation between benign and malignant ovarian tumors is of immense importance in medical fields. In this paper, we introduce group penalized logistic regressions to enhance diagnosis accuracy. Firstly, we divide 349 ovarian cancer patients into two sets: one for learning model parameters, and the other for assessing prediction performance, and select 46 variables from 49 traits as the predictor vector to construct GLASSO/GSCAD/GMCP penalized logistic regressions with 11 groups. Secondly, we develop group coordinate descent (GCD) algorithm and its specific pseudo code to simultaneously complete group selection and group estimation, introduce the tenfold cross validation (CV) procedure to select the relatively optimal tuning parameter, and apply the testing set and Youden index to obtain class probability estimator and class index information. Finally, we compute the accuracy, precision, specificity, sensitivity, F1-score and the area under ROC curve (AUC) to assess the prediction performance to the proposed group penalized methods, and found that GLASSO/GSCAD/GMCP penalized logistic regressions outperform three machine learning methods (ANN, artificial neural network; SVM, support vector machine; XGBoost, eXtreme gradient boosting) and three deep learning methods (CNN, convolutional neural network; DNN, deep neural network; RNN, recurrent neural network) in terms of accuracy, F1-score and AUC.

引用

页码：18565 / 18584

页数：19

共 143 条

[1] Alam TM(2022)An efficient deep learning-based skin cancer classifier for an imbalanced dataset Diagnostics (Basel) 12 2115-441
[2] Shaukat K(2022)Melanoma detection using deep learning-based classifications Healthcare (Basel) 10 2481-253
[3] Khan WA(2012)A comparison of CA125, HE4, risk ovarian malignancy algorithm (ROMA), and risk malignancy index (RMI) for the classification of ovarian masses Clinics (Sao Paulo) 67 437-187
[4] Hameed IA(2022)Automatic malignant and benign skin cancer classification using a hybrid deep learning approach Diagnostics (Basel) 12 2472-684
[5] Almuqren LA(2011)Coordinate descent algorithms for nonconvex penalized regression, with applications to biological feature selection Ann Appl Stat 5 232-94
[6] Raza MA(2015)Group descent algorithms for nonconvex penalized linear and logistic regression models with grouped predictors Stat Comput 25 173-10074
[7] Aslam M(2017)The study of credit scoring model based on group LASSO Procedia Comput Sci 122 677-150
[8] Luo S(2021)A novel graph convolutional feature based convolutional neural network for stock trend prediction Inf Sci 556 67-30
[9] Alwakid G(2023)A co-evolutionary genetic algorithm for robust and balanced controller placement in software-defined networks J Netw Comput Appl 212 10063-20
[10] Gouda W(2022)A genetic programming-based approach for classifying pancreatic adenocarcinoma: the SICED experience Soft Comput 26 138-1360

← 1 2 3 4 5 6 7 8 9 10 →