Crowd R-CNN: An Object Detection Model Utilizing Crowdsourced Labels

被引:2
作者
Hu, Yucheng [1 ]
Song, Meina [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Comp Sci, Beijing, Peoples R China
来源
ICVISP 2019: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON VISION, IMAGE AND SIGNAL PROCESSING | 2019年
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Aggregation; Crowdsourcing; Deep Learning; Ground Truth Inference; Object Detection;
D O I
10.1145/3387168.3387180
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accuracy of object detection has increased significantly in recent years because of the rapid development of deep learning techniques. Nevertheless, its applications in many fields are still limited, mainly due to the lack of large datasets, especially datasets with accurate annotations. Crowdsourcing provides a promising approach to tackle the problem mentioned above because of their "divide and conquer" nature. Nonetheless, existing crowdsourced techniques, e.g., Amazon Mechanical Turk (MTurk), often fail to guarantee the quality of the annotations. In this paper, we propose a novel probabilistic scheme based on crowdsourcing for ground truth inference. As a representative of object detection, we choose Faster R-CNN as the base framework. We name our scheme Crowd R-CNN. We propose an aggregation approach to aggregate annotations from multiple annotators, which allows to convert anchor labels and annotated labels with each other and train the network end-to-end using backpropagation. To improve accuracy, Crowd R-CNN takes into consideration the multi-dimensional measure of the annotators' ability and updates these parameters during training. Experimental results demonstrate that Crowd R-CNN can deal with noisy crowdsourced data effectively. Crowd R-CNN is able to achieve comparable results to the baseline with ground truth annotations and is the first algorithm to solve the problem of how to train deep object detection model utilizing crowdsourced labels.
引用
收藏
页数:7
相关论文
共 32 条
[1]   AggNet: Deep Learning From Crowds for Mitosis Detection in Breast Cancer Histology Images [J].
Albarqouni, Shadi ;
Baur, Christoph ;
Achilles, Felix ;
Belagiannis, Vasileios ;
Demirci, Stefanie ;
Navab, Nassir .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2016, 35 (05) :1313-1321
[2]  
[Anonymous], 2009, Advances in Neural Information Processing Systems
[3]  
[Anonymous], 2015, P 18 INT C EXT DAT T, DOI 10.5441/002/edbt.2015.18
[4]   Truth Is a Lie: Crowd Truth and the Seven Myths of Human Annotation [J].
Aroyo, Lora ;
Welty, Chris .
AI MAGAZINE, 2015, 36 (01) :15-24
[5]  
Dawid A. P., 1979, J ROYAL STAT SOC SER, V28, P20
[6]  
Demartini G., 2012, P 21 INT C WORLD WID, P469
[7]   How to Collect Segmentations for Biomedical Images? A Benchmark Evaluating the Performance of Experts, Crowdsourced Non-Experts, and Algorithms [J].
Gurari, Danna ;
Theriault, Diane ;
Sameki, Mehrnoosh ;
Isenberg, Brett ;
Pham, Tuan A. ;
Purwada, Alberto ;
Solski, Patricia ;
Walker, Matthew ;
Zhang, Chentian ;
Wong, Joyce Y. ;
Betke, Margrit .
2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, :1169-1176
[8]  
Hartigan J. A., 1979, Applied Statistics, V28, P100, DOI 10.2307/2346830
[9]  
He K, 2017, P IEEE INT C COMP VI, DOI DOI 10.1109/ICCV.2017.322
[10]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778