Reputation Transferability in Online Labor Markets

被引:100
作者
Kokkodis, Marios [1 ]
Ipeirotis, Panagiotis G. [1 ]
机构
[1] NYU, Dept Informat Operat & Management Sci, Leonard N Stern Sch Business, 550 1St Ave, New York, NY 10012 USA
关键词
online labor markets; reputation; Bayesian modeling; linear dynamical systems; COMMERCE; EBAY;
D O I
10.1287/mnsc.2015.2217
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Online workplaces such as oDesk, Amazon Mechanical Turk, and TaskRabbit have been growing in importance over the last few years. In such markets, employers post tasks on which remote contractors work and deliver the product of their work online. As in most online marketplaces, reputation mechanisms play a very important role in facilitating transactions, since they instill trust and are often predictive of the employer's future satisfaction. However, labor markets are usually highly heterogeneous in terms of available task categories; in such scenarios, past performance may not be an accurate signal of future performance. To account for this natural heterogeneity, in this work, we build models that predict the performance of a worker based on prior, category-specific feedback. Our models assume that each worker has a category- specific quality, which is latent and not directly observable; what is observable, though, is the set of feedback ratings of the worker and of other contractors with similar work histories. Based on this information, we provide a series of models of increasing complexity that successfully estimate the worker's quality. We start by building a binomial model and a multinomial model under the implicit assumption that the latent qualities of the workers are static. Next, we remove this assumption, and we build linear dynamic systems that capture the evolution of these latent qualities over time. We evaluate our models on a large corpus of over a million transactions ( completed tasks) from oDesk, an online labor market with hundreds of millions of dollars in transaction volume. Our results show an improved accuracy of up to 25% compared to feedback baselines and significant improvement over the commonly used collaborative filtering approach. Our study clearly illustrates that reputation systems should present different reputation scores, depending on the context in which the worker has been previously evaluated and the job for which the worker is applying.
引用
收藏
页码:1687 / 1706
页数:20
相关论文
共 55 条
[41]  
OMahony M. P., 2010, RIAO 2010 9 INT C AD, P164
[42]  
Otterbacher J, 2009, CHI2009: PROCEEDINGS OF THE 27TH ANNUAL CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, VOLS 1-4, P955
[43]   Inefficient Hiring in Entry-Level Labor Markets [J].
Pallais, Amanda .
AMERICAN ECONOMIC REVIEW, 2014, 104 (11) :3565-3599
[44]   Robust classification for imprecise environments [J].
Provost, F ;
Fawcett, T .
MACHINE LEARNING, 2001, 42 (03) :203-231
[45]   The promise of Mechanical Turk: How online labor markets can help theorists run behavioral experiments [J].
Rand, David G. .
JOURNAL OF THEORETICAL BIOLOGY, 2012, 299 :172-179
[46]   The value of reputation on eBay: A controlled experiment [J].
Resnick, Paul ;
Zeckhauser, Richard ;
Swanson, John ;
Lockwood, Kate .
EXPERIMENTAL ECONOMICS, 2006, 9 (02) :79-101
[47]  
Ricci F, 2011, RECOMMENDER SYSTEMS HANDBOOK, P1, DOI 10.1007/978-0-387-85820-3_1
[48]  
Shah C, 2010, SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, P411
[49]  
Sharara H, 2011, P INT C WEBL SOC MED
[50]  
Shaw A.D., 2011, C COMP SUPP COOP WOR, DOI [DOI 10.1145/1958824.1958865, 10.1145/1958824.1958865]