Application and impact of Lasso regression in gastroenterology: A systematic review

被引:18
作者
Ali, Hassam [1 ]
Shahzad, Maria [2 ]
Sarfraz, Shiza [2 ]
Sewell, Kerry B. B. [3 ]
Alqalyoobi, Shehabaldin [4 ,5 ]
Mohan, Babu P. P. [6 ]
机构
[1] East Carolina Univ, Dept Gastroenterol & Hepatol, Greenville, NC USA
[2] Univ Hlth Sci, Dept Internal Med, Lahore, Punjab, Pakistan
[3] East Carolina Univ, Laupus Hlth Sci Lib, Greenville, NC USA
[4] East Carolina Univ, Dept Pulm & Crit Care Med, Greenville, NC USA
[5] Univ Louisville, Sch Publ Hlth & Informat Sci, Dept Bioinformat & Biostat, Louisville, KY USA
[6] Orlando Gastroenterol PA, Gastroenterol & Hepatol, 1507 S Hiawassee Rd,Ste 105, Orlando, FL 32835 USA
关键词
Clinical decision-making; Diagnostic accuracy; Esophageal cancer; High-dimensional data; Inflammatory bowel disease; Gastroenterology; Lasso regression; Liver disease; Machine learning; Prediction modeling; Regularization; Variable selection; FATTY LIVER-DISEASE; VALIDATION; ALGORITHM; NOMOGRAM; ADULTS;
D O I
10.1007/s12664-023-01426-9
中图分类号
R57 [消化系及腹部疾病];
学科分类号
摘要
Least absolute shrinkage and selection operator (Lasso) regression is a statistical technique that can be used to study the effects of clinical variables in outcome prediction. In this study, we aimed at systematically reviewing the application of Lasso regression in gastroenterology for developing predictive models and providing a method of performing Lasso regression. A comprehensive search strategy was conducted in PubMed, Embase and Cochrane CENTRAL databases (Keywords: lasso regression; gastrointestinal tract/diseases) following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. Studies were screened for eligibility based on pre-defined selection criteria and the data was extracted using a standardized form. Total 16 studies were included, comprising a diverse range of gastroenterological disease-related outcomes. Sample sizes ranged from 134 to 8861 subjects. Eleven studies reported liver disease-related prediction models, while five focused on non-hepatic etiology models. Lasso regression was applied for variable selection, risk prediction and model development, with various validation methods and performance metrics used. Model performance metrics included Area Under the Receiver Operating Characteristics (AUROC), C-index and calibration plots. In gastroenterology, Lasso regression has been used in various diseases such as inflammatory bowel disease, liver disease and esophageal cancer. It is valuable for complex scenarios with many predictors. However, its effectiveness depends on high-quality and complete data. While it identifies important variables, it doesn't provide causal interpretations. Therefore, cautious interpretation is necessary considering the study design and data quality.
引用
收藏
页码:780 / 790
页数:11
相关论文
共 23 条
[1]   Predicting the individualized risk of poor adherence to ART medication among adolescents living with HIV in Uganda: the Suubi plus Adherence study [J].
Brathwaite, Rachel ;
Ssewamala, Fred M. ;
Neilands, Torsten B. ;
Okumu, Moses ;
Mutumba, Massy ;
Damulira, Christopher ;
Nabunya, Proscovia ;
Kizito, Samuel ;
Bahar, Ozge Sensoy ;
Mellins, Claude A. ;
McKay, Mary M. .
JOURNAL OF THE INTERNATIONAL AIDS SOCIETY, 2021, 24 (06)
[2]   A Nomogram Model Based on Noninvasive Bioindicators to Predict 3-Year Risk of Nonalcoholic Fatty Liver in Nonobese Mainland Chinese: A Prospective Cohort Study [J].
Cai, Xintian ;
Aierken, Xiayire ;
Ahmat, Ayguzal ;
Cao, Yuanyuan ;
Zhu, Qing ;
Wu, Ting ;
Li, Nanfang .
BIOMED RESEARCH INTERNATIONAL, 2020, 2020
[3]   Development and Validation of an Algorithm to Identify Nonalcoholic Fatty Liver Disease in the Electronic Medical Record [J].
Corey, Kathleen E. ;
Kartoun, Uri ;
Zheng, Hui ;
Shaw, Stanley Y. .
DIGESTIVE DISEASES AND SCIENCES, 2016, 61 (03) :913-919
[4]   Dynamic Nomogram for Predicting Thrombocytopenia in Adults with Acute Pancreatitis [J].
Deng, Hongbin ;
Yu, Xianqiang ;
Gao, Kun ;
Liu, Yang ;
Tong, Zhihui ;
Liu, Yuxiu ;
Li, Weiqin .
JOURNAL OF INFLAMMATION RESEARCH, 2021, 14 :6657-6667
[5]   Machine learning algorithm outperforms fibrosis markers in predicting significant fibrosis in biopsy-confirmed NAFLD [J].
Feng, Gong ;
Zheng, Kenneth I. ;
Li, Yang-Yang ;
Rios, Rafael S. ;
Zhu, Pei-Wu ;
Pan, Xiao-Yan ;
Li, Gang ;
Ma, Hong-Lei ;
Tang, Liang-Jie ;
Byrne, Christopher D. ;
Targher, Giovanni ;
He, Na ;
Mi, Man ;
Chen, Yong-Ping ;
Zheng, Ming-Hua .
JOURNAL OF HEPATO-BILIARY-PANCREATIC SCIENCES, 2021, 28 (07) :593-603
[6]  
Hastie T., 2009, The elements of statistical learning: Data mining, inference, and prediction, DOI DOI 10.1007/978-0-387-84858-7
[7]  
James G, 2013, SPRINGER TEXTS STAT, V103, P1, DOI [10.1007/978-1-4614-7138-7, 10.1007/978-1-4614-7138-7_1]
[8]   A K-fold averaging cross-validation procedure [J].
Jung, Yoonsuh ;
Hu, Jianhua .
JOURNAL OF NONPARAMETRIC STATISTICS, 2015, 27 (02) :167-179
[9]   Predicting Low Risk for Sustained Alcohol Use After Early Liver Transplant for Acute Alcoholic Hepatitis: The Sustained Alcohol Use Post-Liver Transplant Score [J].
Lee, Brian P. ;
Vittinghoff, Eric ;
Hsu, Christine ;
Han, Hyosun ;
Therapondos, George ;
Fix, Oren K. ;
Victor, David W. ;
Dronamraju, Deepti ;
Im, Gene Y. ;
Voigt, Michael D. ;
Rice, John P. ;
Lucey, Michael R. ;
Eswaran, Sheila ;
Chen, Po-Hung ;
Li, Zhiping ;
Maddur, Haripriya ;
Terrault, Norah A. .
HEPATOLOGY, 2019, 69 (04) :1477-1487
[10]   Risk factors and prediction model for nonalcoholic fatty liver disease in northwest China [J].
Li, Danting ;
Zhang, Meiyu ;
Wu, Shengli ;
Tan, Huiwen ;
Li, Nong .
SCIENTIFIC REPORTS, 2022, 12 (01)