Deep Metric Learning for Crowdedness Regression

被引:83
作者
Wang, Qi [1 ,2 ,3 ]
Wan, Jia [1 ,2 ]
Yuan, Yuan [1 ,2 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Peoples R China
[2] Northwestern Polytech Univ, Ctr OpT IMagery Anal & Learning, Xian 710072, Peoples R China
[3] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; metric learning; regression; congestion detection; crowd counting;
D O I
10.1109/TCSVT.2017.2703920
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Cross-scene regression tasks, such as congestion level detection and crowd counting, are useful hut challenging. There are two main problems, which limit the performance of existing algorithms. The first one is that no appropriate congestion-related feature can reflect the real density in scenes. Though deep learning has been proved to be capable of extracting high level semantic representations, it is hard to converge on regression tasks, since the label is too weak to guide the learning of parameters in practice. Thus, many approaches utilize additional information, such as a density map, to guide the learning, which increases the effort of labeling. Another problem is that most existing methods are composed of several steps, for example, feature extraction and regression. Since the steps in the pipeline are separated, these methods face the problem of complex optimization. To remedy it, a deep metric learning-based regression method is proposed to extract density related features, and learn better distance measurement simultaneously. The proposed networks trained end-to-end for better optimization can be used for crowdedness regression tasks, including congestion level detection and crowd counting. Extensive experiments confirm the effectiveness of the proposed method.
引用
收藏
页码:2633 / 2643
页数:11
相关论文
共 51 条
[1]  
Abadi M., 2015, TensorFlow: Large-scale machine learning on heterogeneous systems.
[2]  
[Anonymous], P MACHINE LEARNING R
[3]  
[Anonymous], INT C MACH VIS
[4]  
[Anonymous], IMAGE TECHNOLOGY ADV
[5]  
[Anonymous], P 10 INT C SIGN PROC
[6]  
[Anonymous], 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06), DOI DOI 10.1109/CVPR.2006.167
[7]  
[Anonymous], 2013, ADV COMPUTATIONAL SC
[8]  
[Anonymous], TRAFFIC FLOW THEORY
[9]  
Chan AB, 2005, 2005 IEEE INTELLIGENT VEHICLES SYMPOSIUM PROCEEDINGS, P771
[10]   Kernel-based distance metric learning for content-based image retrieval [J].
Chang, Hong ;
Yeung, Dit-Yan .
IMAGE AND VISION COMPUTING, 2007, 25 (05) :695-703