Classification Methods Based on Fitting Logistic Regression to Positive and Unlabeled Data

被引:1
作者
Furmanczyk, Konrad [1 ]
Paczutkowski, Kacper [1 ]
Dudzinski, Marcin [1 ]
Dziewa-Dawidczyk, Diana [1 ]
机构
[1] Warsaw Univ Life Sci, Inst Informat Technol, Warsaw, Poland
来源
COMPUTATIONAL SCIENCE - ICCS 2022, PT I | 2022年
关键词
Positive unlabeled learning; Logistic regression; Empirical risk minimization; Thresholded lasso;
D O I
10.1007/978-3-031-08751-6_3
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In our work, we consider the classification methods based on the model of logistic regression for positive and unlabeled data. We examine the following four methods of the posterior probability estimation, where the risk of logistic loss function is optimized, namely: the naive approach, the weighted likelihood approach, as well as the quite recently proposed methods - the joint approach, and the LassoJoint method. The objective of our study is to evaluate the accuracy, the recall, the precision and the F1-score of the considered classification methods. The corresponding assessments have been carried out on 13 machine learning model schemes by conducting some numerical experiments on selected real datasets.
引用
收藏
页码:31 / 45
页数:15
相关论文
共 21 条
[1]  
[Anonymous], 2003, P INT C MACH LEARN I
[2]   Learning from positive and unlabeled data: a survey [J].
Bekker, Jessa ;
Davis, Jesse .
MACHINE LEARNING, 2020, 109 (04) :719-760
[3]  
Dua D, 2019, UCI MACHINE LEARNING
[4]  
Friedman J., 2009, glmnet: lasso and elastic-net regularized generalized linear models
[5]   Regularization Paths for Generalized Linear Models via Coordinate Descent [J].
Friedman, Jerome ;
Hastie, Trevor ;
Tibshirani, Rob .
JOURNAL OF STATISTICAL SOFTWARE, 2010, 33 (01) :1-22
[6]  
Furmanczyk Konrad, 2021, Computational Science - ICCS 2021. 21st International Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12744), P18, DOI 10.1007/978-3-030-77967-2_2
[7]  
Guo T, 2020, CVPR, P8385
[8]   Inference from presence-only data; the ongoing controversy [J].
Hastie, Trevor ;
Fithian, Will .
ECOGRAPHY, 2013, 36 (08) :864-867
[9]  
Hou M., 2018, P 27 INT JOINT C ART, P4864
[10]  
Kuhn M., 2022, caret: Classification and Regression Training (Version 6.0-86)