CACrowdGAN: Cascaded Attentional Generative Adversarial Network for Crowd Counting

被引:21
作者
Zhu, Aichun [1 ,2 ]
Zheng, Zhe [1 ]
Huang, Yaoying [1 ]
Wang, Tian [3 ]
Jin, Jing [1 ]
Hu, Fangqiang [1 ]
Hua, Gang [2 ]
Snoussi, Hichem [4 ]
机构
[1] Nanjing Tech Univ, Sch Comp Sci & Technol, Nanjing 210000, Peoples R China
[2] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221008, Jiangsu, Peoples R China
[3] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100083, Peoples R China
[4] Univ Technol Troyes, Inst Charles Delaunay LM2S FRE CNRS 2019, F-10300 Troyes, France
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Crowd counting; generative adversarial network; attention mechanism;
D O I
10.1109/TITS.2021.3075859
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Crowd counting is a valuable technology for extremely dense scenes in the transportation. Existing methods generally have higher-order inconsistencies between ground truth density maps and generated density maps. To address this issue, we incorporate an attentional discriminator to take charge of checking the density map between the generator and the ground truth. Thus, a Cascaded Attentional Generative Adversarial Network (CACrowdGAN) is proposed that enables the attentional-driven discriminator to distinguish implausible density maps and simultaneously to guide the generator to deliver fine-grained high quality density maps. The proposed CACrowdGAN consists of two components: an attentional generator and a cascaded attentional discriminator. The attentional generator has an attention module and a density module. The attention module is developed for the generator to focus on the crowd regions of the input images, while the density module is used to provide the attentional input of the discriminator. In addition, a cascaded attentional discriminator is proposed to synthesize attentional-driven fine-grained details at different crowd regions of the input image and compute a per-pixel fine-grained loss for training generator. The proposed CACrowdGAN achieves the state-of-the-art performance on five popular crowd counting datasets (ShanghaiTech, WorldEXPO'10, UCSD, UCF_CC_50 and UCF_QNRF), which demonstrates the effectiveness and robustness of the proposed approach in the complex scenes.
引用
收藏
页码:8090 / 8102
页数:13
相关论文
共 63 条
[1]  
[Anonymous], 2019, ARXIV190106026
[2]  
Arjovsky Martin, 2017, ARXIV170107875
[3]  
Berthelot D., 2017, BEGAN: Boundary Equilibrium Generative Adversarial Networks
[4]   Look and Think Twice: Capturing Top-Down Visual Attention with Feedback Convolutional Neural Networks [J].
Cao, Chunshui ;
Liu, Xianming ;
Yang, Yi ;
Yu, Yinan ;
Wang, Jiang ;
Wang, Zilei ;
Huang, Yongzhen ;
Wang, Liang ;
Huang, Chang ;
Xu, Wei ;
Ramanan, Deva ;
Huang, Thomas S. .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2956-2964
[5]   Scale Aggregation Network for Accurate and Efficient Crowd Counting [J].
Cao, Xinkun ;
Wang, Zhipeng ;
Zhao, Yanyun ;
Su, Fei .
COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 :757-773
[6]   Privacy preserving crowd monitoring: Counting people without people models or tracking [J].
Chan, Antoni B. ;
Liang, Zhang-Sheng John ;
Vasconcelos, Nuno .
2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, :1766-1772
[7]   Attention to Scale: Scale-aware Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Yang, Yi ;
Wang, Jiang ;
Xu, Wei ;
Yuille, Alan L. .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3640-3649
[8]  
Dai F, 2019, ARXIV190609707
[9]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[10]   Competition-Based Distributed Differential Evolution [J].
Ge, Yong-Feng ;
Yu, Wei-Jie ;
Zhan, Zhi-Hui ;
Zhang, Jun .
2018 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2018, :118-125