Improve Reputation Evaluation of Crowdsourcing Participants Using Multidimensional Index and Machine Learning Techniques

被引:17
作者
Huang, Yanrong [1 ]
Chen, Min [1 ]
机构
[1] Wuhan Univ, Sch Comp Sci, State Key Lab Software Engn, Wuhan 430072, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Crowdsourcing participants; reputation evaluation; machine learning; random forest; data dimension reduction; MODEL; TRUST;
D O I
10.1109/ACCESS.2019.2933147
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Building a scientific and reasonable reputation evaluation mechanism for crowdsourcing participants is an effective way to solve the problem of transaction fraud, to establish the trust of traders and ensure the quality of task completion. Under the big data environment, machine learning methods have been applied in the domain of e-commerce of physical goods to improve the traditional reputation evaluation methods, and achieved good results. However, fewstudies have applied machine learning methods to crowdsourcing, a form of service e-commerce, to evaluate the reputation of participants. This paper proposes a reputation evaluation model (i.e. LDA-RF) for crowdsourcing participants of Random Forest based on Linear Discriminant Analysis. The model consists of five steps: firstly, building a multidimensional reputation evaluation index system for crowdsourcing participants, collecting real data sets, and preprocessing data; secondly, data dimensionality reduction methods, including Linear Discriminant Analysis, Principal Component Analysis, Mean Impact Value method and ReliefF feature selection method, are used to eliminate redundant variables; thirdly, data normalization; fourthly, with selected feature subset, five machine learning techniques, Random Forest, Decision Tree, Back propagation Neural Network, Radial Basis Function Neural Network and Support Vector Machine are used to train the model; Fifthly, the validity of the model is tested by four evaluation measures: 10 fold cross validation, confusion matrix, Kruskal-wallis test and dispersion degree. The results show that the LDA-RF model on accuracy, F1-measure, generalization ability and robustness are better than those of other models, and it has better performance and effectiveness. This study represents a new contribution to establish reputation evaluation of crowdsourcing participants under big data environment.
引用
收藏
页码:118055 / 118067
页数:13
相关论文
共 33 条
[1]   A comparative study on base classifiers in ensemble methods for credit scoring [J].
Abelian, Joaquin ;
Castellano, Javier G. .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 73 :1-10
[2]  
Bhattacharjee S, 2017, IEEE CONF COMM NETW, P200
[3]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[4]  
Broomhead D. S., 1988, Complex Systems, V2, P321
[5]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297
[6]  
Fu Y. G., 2016, J CENTRAL U FINANCE, V8, P74
[7]   Boomerang: Rebounding the Consequences of Reputation Feedback on Crowdsourcing Platforms [J].
Gaikwad, Snehalkumar S. ;
Morina, Durim ;
Ginzberg, Adam ;
Mullings, Catherine ;
Goyal, Shirish ;
Gamage, Dilrukshi ;
Diemert, Christopher ;
Burton, Mathias ;
Zhou, Sharon ;
Whiting, Mark ;
Ziulkoski, Karolina ;
Ballav, Alipta ;
Gilbee, Aaron ;
Niranga, Senadhipathige S. ;
Sehgal, Vibhor ;
Lin, Jasmine ;
Kristianto, Leonardy ;
Richmond-Fuller, Angela ;
Regino, Jeff ;
Chhibber, Nalin ;
Majeti, Dinesh ;
Sharma, Sachin ;
Mananova, Kamila ;
Dhaka, Dinesh ;
Dai, William ;
Purynova, Victoria ;
Sandeep, Samarth ;
Chandrakanthan, Varshine ;
Sarma, Tejas ;
Matin, Sekandar ;
Nasser, Ahmed ;
Nistala, Rohit ;
Stolzoff, Alexander ;
Milland, Kristy ;
Mathur, Vinayak ;
Vaish, Rajan ;
Bernstein, Michael S. .
UIST 2016: PROCEEDINGS OF THE 29TH ANNUAL SYMPOSIUM ON USER INTERFACE SOFTWARE AND TECHNOLOGY, 2016, :625-637
[8]  
Howe J., 2006, Wired Mag, V14, P1
[9]  
Howe J., 2008, Crowdsourcing: How the power of the crowd is driving the future of business
[10]  
Hsueh Pei-Yun, 2009, P NAACL HLT WORKSH A, P27