IAUnet: Global Context-Aware Feature Learning for Person Reidentification

被引:48
作者
Hou, Ruibing [1 ,2 ]
Ma, Bingpeng [2 ]
Chang, Hong [1 ,2 ]
Gu, Xinqian [1 ,2 ]
Shan, Shiguang [1 ,2 ,3 ]
Chen, Xilin [1 ,2 ]
机构
[1] Chinese Acad Sci, Key Lab Intelligent Informat Proc, Inst Comp Technol, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100049, Peoples R China
[3] CAS Ctr Excellence Brain Sci & Intelligence Techn, Shanghai 200031, Peoples R China
关键词
Context modeling; Feature extraction; Computational modeling; Semantics; Aggregates; Visualization; Task analysis; Feature enhancing; interaction-aggregation; person reidentification (reID); spatial-temporal context modeling; ATTENTION; NETWORK;
D O I
10.1109/TNNLS.2020.3017939
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Person reidentification (reID) by convolutional neural network (CNN)-based networks has achieved favorable performance in recent years. However, most of existing CNN-based methods do not take full advantage of spatial-temporal context modeling. In fact, the global spatial-temporal context can greatly clarify local distractions to enhance the target feature representation. To comprehensively leverage the spatial-temporal context information, in this work, we present a novel block, interaction-aggregation-update (IAU), for high-performance person reID. First, the spatial-temporal IAU (STIAU) module is introduced. STIAU jointly incorporates two types of contextual interactions into a CNN framework for target feature learning. Here, the spatial interactions learn to compute the contextual dependencies between different body parts of a single frame, while the temporal interactions are used to capture the contextual dependencies between the same body parts across all frames. Furthermore, a channel IAU (CIAU) module is designed to model the semantic contextual interactions between channel features to enhance the feature representation, especially for small-scale visual cues and body parts. Therefore, the IAU block enables the feature to incorporate the globally spatial, temporal, and channel context. It is lightweight, end-to-end trainable, and can be easily plugged into existing CNNs to form IAUnet. The experiments show that IAUnet performs favorably against state of the art on both image and video reID tasks and achieves compelling results on a general object categorization task. The source code is available at https://github.com/blue-blue272/ImgReID-IAnet.
引用
收藏
页码:4460 / 4474
页数:15
相关论文
共 87 条
[41]   Recurrent Convolutional Network for Video-based Person Re-Identification [J].
McLaughlin, Niall ;
del Rincon, Jesus Martinez ;
Miller, Paul .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1325-1334
[42]  
Paisitkriangkrai S, 2015, PROC CVPR IEEE, P1846, DOI 10.1109/CVPR.2015.7298794
[43]   Object-Part Attention Model for Fine-Grained Image Classification [J].
Peng, Yuxin ;
He, Xiangteng ;
Zhao, Junjie .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (03) :1487-1500
[44]   Pose-Normalized Image Generation for Person Re-identification [J].
Qian, Xuelin ;
Fu, Yanwei ;
Xiang, Tao ;
Wang, Wenxuan ;
Qiu, Jie ;
Wu, Yang ;
Jiang, Yu-Gang ;
Xue, Xiangyang .
COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 :661-678
[45]   Performance Measures and a Data Set for Multi-target, Multi-camera Tracking [J].
Ristani, Ergys ;
Solera, Francesco ;
Zou, Roger ;
Cucchiara, Rita ;
Tomasi, Carlo .
COMPUTER VISION - ECCV 2016 WORKSHOPS, PT II, 2016, 9914 :17-35
[46]   A Pose-Sensitive Embedding for Person Re-Identification with Expanded Cross Neighborhood Re-Ranking [J].
Sarfraz, M. Saquib ;
Schumann, Arne ;
Eberle, Andreas ;
Stiefelhagen, Rainer .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :420-429
[47]   End-to-End Deep Kronecker-Product Matching for Person Re-identification [J].
Shen, Yantao ;
Xiao, Tong ;
Li, Hongsheng ;
Yi, Shuai ;
Wang, Xiaogang .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6886-6895
[48]   Embedding Deep Metric for Person Re-identification: A Study Against Large Variations [J].
Shi, Hailin ;
Yang, Yang ;
Zhu, Xiangyu ;
Liao, Shengcai ;
Lei, Zhen ;
Zheng, Weishi ;
Li, Stan Z. .
COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :732-748
[49]  
Shi XJ, 2015, ADV NEUR IN, V28
[50]   Mask-guided Contrastive Attention Model for Person Re-Identification [J].
Song, Chunfeng ;
Huang, Yan ;
Ouyang, Wanli ;
Wang, Liang .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1179-1188