A multitask joint framework for real-time person search

被引:0
作者
Ye Li
Kangning Yin
Jie Liang
Zhuofu Tan
Xinzhong Wang
Guangqiang Yin
Zhiguo Wang
机构
[1] Shenzhen Institute of Information Technology,
[2] University of Electronic Science and Technology of China,undefined
[3] Kash Institute of Electronics and Information Industry,undefined
来源
Multimedia Systems | 2023年 / 29卷
关键词
Person search; Multitask; Joint framework; Real time;
D O I
暂无
中图分类号
学科分类号
摘要
Person searches generally involve three important parts: person detection, feature extraction and identity comparison. However, a person search integrating detection, extraction and comparison has the two following drawbacks. First, the accuracy of detection will affect the accuracy of comparison. Second, it is difficult to achieve real-time results in real-world applications. To solve these problems, we propose a multitask joint framework for real-time person search (MJF) that optimizes person detection, feature extraction and identity comparison. For the person detection module, we propose the YOLOv5-GS model, which is trained with a person dataset. YOLOv5-GS combines the advantages of the Ghostnet and the squeeze-and-excitation block and improves the speed of person detection. For the feature extraction module, we design a model adaptation architecture, which can select different networks according to the number of people. It can balance the relationship between accuracy and speed. For identity comparison, we propose a 3D pooled table and a matching strategy to improve identification accuracy. On the condition of 1920 ×\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\times$$\end{document} 1080-resolution video and a 200-ID table, the IR and the FPS achieved by our method reach 82.69% and 25.14, respectively. Therefore, the MJF can achieve real-time person search.
引用
收藏
页码:211 / 222
页数:11
相关论文
共 19 条
  • [1] Sharma S(2021)Asl-3dcnn: American sign language recognition technique using 3-d convolutional neural networks Multimed. Tools Appl. 80 26319-26331
  • [2] Kumar K(2021)Selective shallow models strength integration for emotion detection using GloVe and LSTM Multim. Tools Appl. 80 28349-28363
  • [3] Vijayvergia A(2021)Requirement of artificial intelligence technology awareness for thoracic surgeons Cardiothorac. Surg. 29 13-18
  • [4] Kumar K(2021)Triplet online instance matching loss for person re-identification Neurocomputing 433 10-662
  • [5] Darbari A(2021)Res2net: a new multi-scale backbone architecture IEEE Trans. Pattern Anal. Mach. Intell. 43 652-undefined
  • [6] Kumar K(undefined)undefined undefined undefined undefined-undefined
  • [7] Darbari S(undefined)undefined undefined undefined undefined-undefined
  • [8] Patil PL(undefined)undefined undefined undefined undefined-undefined
  • [9] Li Y(undefined)undefined undefined undefined undefined-undefined
  • [10] Yin G(undefined)undefined undefined undefined undefined-undefined