From crowdsourcing to crowdmining: using implicit human intelligence for better understanding of crowdsourced data

被引:8
作者
Guo, Bin [1 ]
Chen, Huihui [1 ]
Liu, Yan [1 ]
Chen, Chao [2 ]
Han, Qi [3 ]
Yu, Zhiwen [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Peoples R China
[2] Chongqing Univ, Dept Comp Sci, Chongqing 400044, Peoples R China
[3] Colorado Sch Mines, Golden, CO 80401 USA
来源
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS | 2020年 / 23卷 / 02期
基金
美国国家科学基金会; 中国国家自然科学基金; 国家重点研发计划;
关键词
Data-centric crowdsourcing; Crowd mining; Implicit human intelligence; Mobile crowd sensing; Social media; SIMILARITY;
D O I
10.1007/s11280-019-00718-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the development of mobile social networks, more and more crowdsourced data are generated on the Web or collected from real-world sensing. The fragment, heterogeneous, and noisy nature of online/offline crowdsourced data, however, makes it difficult to be understood. Traditional content-based analyzing methods suffer from potential issues such as computational intensiveness and poor performance. To address them, this paper presents CrowdMining. In particular, we observe that the knowledge hidden in the process of data generation, regarding individual/crowd behavior patterns (e.g., mobility patterns, community contexts such as social ties and structure) and crowd-object interaction patterns (flickering or tweeting patterns) are neglected in crowdsourced data mining. Therefore, a novel approach that leverages implicit human intelligence (implicit HI) for crowdsourced data mining and understanding is proposed. Two studies titled CrowdEvent and CrowdRoute are presented to showcase its usage, where implicit HIs are extracted either from online or offline crowdsourced data. A generic model for CrowdMining is further proposed based on a set of existing studies. Experiments based on real-world datasets demonstrate the effectiveness of CrowdMining.
引用
收藏
页码:1101 / 1125
页数:25
相关论文
共 46 条
  • [1] Analyzing how travelers choose scenic routes using route choice models
    Alivand, Majid
    Hochmair, Hartwig
    Srinivasan, Sivaramakrishnan
    [J]. COMPUTERS ENVIRONMENT AND URBAN SYSTEMS, 2015, 50 : 41 - 52
  • [2] [Anonymous], 2013, P 1 AAAI C HUM COMP
  • [3] [Anonymous], 2010, Proceedings of the 19th international conference on World wide web, WWW'10, DOI [10.1145/1772690.1772777, 10.1145/ 1772690.1772777]
  • [4] Bao Xuan., 2010, PROC MOBISYS 2010, P357, DOI DOI 10.1145/1814433.1814468
  • [5] Maximizing benefits from crowdsourced data
    Barbier, Geoffrey
    Zafarani, Reza
    Gao, Huiji
    Fung, Gabriel
    Liu, Huan
    [J]. COMPUTATIONAL AND MATHEMATICAL ORGANIZATION THEORY, 2012, 18 (03) : 257 - 279
  • [6] Machine learning of event segmentation for news on demand
    Boykin, S
    Merlino, A
    [J]. COMMUNICATIONS OF THE ACM, 2000, 43 (02) : 35 - 41
  • [7] Chen H., 2016, P IEEE INT C COMP CO, P1359
  • [8] Statistical Evaluation of Efficiency and Possibility of Earthquake Predictions with Gravity Field Variation and its Analytic Signal in Western China
    Chen, Shi
    Jiang, Changsheng
    Zhuang, Jiancang
    [J]. PURE AND APPLIED GEOPHYSICS, 2016, 173 (01) : 305 - 319
  • [9] Flock: Hybrid Crowd-Machine Learning Classifiers
    Cheng, Justin
    Bernstein, Michael S.
    [J]. PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON COMPUTER-SUPPORTED COOPERATIVE WORK AND SOCIAL COMPUTING (CSCW'15), 2015, : 600 - 611
  • [10] Cooper Matthew., 2005, ACM T MULTIM COMPUT, V1, P269