Handling Imbalanced Data With Weighted Logistic Regression and Propensity Score Matching methods: The Case of P2P Money Transfers

被引:0
作者
Agrawal, Lavlin [1 ]
Mulgund, Pavankumar [2 ]
Sharman, Raj [3 ]
机构
[1] North Carolina Agr & Tech State Univ, Greensboro, NC 27411 USA
[2] Univ Memphis, Memphis, TN USA
[3] Univ Buffalo, Buffalo, NY USA
关键词
Adoption and Use; Bank-backed P2P; Imbalanced Data; Methodological Decisions; Propensity Match; Rare Event; Weighted Logistic Regression; MOBILE BANKING ADOPTION; TECHNOLOGY ACCEPTANCE; INFORMATION-TECHNOLOGY; CONSUMER ACCEPTANCE; USER ACCEPTANCE; ONLINE; MODEL; CLASSIFICATION; DECISIONS; EXTENSION;
D O I
10.4018/JDM.335888
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The adoption of empirical methods for secondary data analysis has witnessed a significant surge in IS research. However, the secondary data is often incomplete, skewed, and imbalanced at best. Consequently, there is a growing recognition of the importance of empirical techniques and methodological decisions made to navigate through such issues. However, there is not enough methodological guidance, especially in the form of a worked case study that demonstrates the challenges of imbalanced datasets and offers prescriptive on how to deal with them. Using data on P2P money transfer services, this article presents a running example by analyzing the same dataset using several different methods. It then compares the outcomes of these choices and explicates the rationale behind some decisions such as inclusion and categorization of variables, parameter setting, and model selection. Finally, the article discusses certain regressions models such as weighted logistic regression and propensity matching, and when they should be used.
引用
收藏
页码:1 / 37
页数:37
相关论文
共 133 条
[81]  
MAYER RC, 1995, ACAD MANAGE REV, V20, P709, DOI 10.2307/258792
[82]   Initial trust formation in new organizational relationships [J].
Mcknight, DH ;
Cummings, LL ;
Chervany, NL .
ACADEMY OF MANAGEMENT REVIEW, 1998, 23 (03) :473-490
[83]  
Merritt C., 2011, Journal of Payments Strategy Systems, V5, P143, DOI DOI 10.69554/KFVA4978
[84]   Collinearity diagnostics of binary logistic regression model [J].
Midi, Habshah ;
Sarkar, S. K. ;
Rana, Sohel .
JOURNAL OF INTERDISCIPLINARY MATHEMATICS, 2010, 13 (03) :253-267
[85]  
Montgomery DC, 2012, Introduction to Linear Regression Analysis, V5th
[86]   Age differences in technology adoption decisions: Implications for a changing work force [J].
Morris, MG ;
Venkatesh, V .
PERSONNEL PSYCHOLOGY, 2000, 53 (02) :375-403
[87]   What is a support vector machine? [J].
Noble, William S. .
NATURE BIOTECHNOLOGY, 2006, 24 (12) :1565-1567
[88]   Extending the understanding of mobile banking adoption: When UTAUT meets TTF and ITM [J].
Oliveira, Tiago ;
Faria, Miguel ;
Thomas, Manoj Abraham ;
Popovic, Ales .
INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT, 2014, 34 (05) :689-703
[89]   Performance evaluation of some propensity score matching methods by using binary logistic regression model [J].
Olmus, Hulya ;
Bespinar, Esra ;
Nazman, Ezgi .
COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2022, 51 (04) :1647-1660
[90]  
Owen AB, 2007, J MACH LEARN RES, V8, P761