Fast and Accurate Hand-Raising Gesture Detection in Classroom

被引：3

作者：

Liu, Tao ^{[1
]}

Jiang, Fei ^{[1
]}

Shen, Ruimin ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai, Peoples R China

来源：

NEURAL INFORMATION PROCESSING, ICONIP 2020, PT IV | 2020年 / 1332卷

基金：

中国博士后科学基金;

关键词：

Hand-raising detection; CenterNet; Suppression loss;

D O I：

10.1007/978-3-030-63820-7_26

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes a fast and accurate method for hand-raising gesture detection in classrooms. Our method is based on a one-stage detector, CenterNet, which significantly reduces the inference time. Meanwhile, we design three mechanisms to improve the performance. Firstly, we propose a novel suppression loss to prevent easy and hard examples from overwhelming the training process. Secondly, we adopt a deep layer aggregation network to fuse semantic and spatial representation, which is effective for detecting tiny gestures. Thirdly, due to less variation in aspect ratios, we only regress single width property to predict whole bounding box. Thus achieving a more accurate result. Experiments show that our method achieves 91.4% mAP on our hand-raising dataset and runs at 26 FPS, 6.7x faster than the two-stage ones.

引用

页码：232 / 239

页数：8

共 14 条

[1] Beyond triplet loss: a deep quadruplet network for person re-identification [J].

Chen, Weihua ;

Chen, Xiaotang ;

Zhang, Jianguo ;

Huang, Kaiqi .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1320-1329

[2]

Dai JF, 2016, ADV NEUR IN, V29

[3]

Dollar P., 2009, Pedestrian detection: a benchmark

[4] CornerNet: Detecting Objects as Paired Keypoints [J].

Law, Hei ;

Deng, Jia .

COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :765-781

[5]

Li BY, 2019, AAAI CONF ARTIF INTE, P8577

[6] Focal Loss for Dense Object Detection [J].

Lin, Tsung-Yi ;

Goyal, Priya ;

Girshick, Ross ;

He, Kaiming ;

Dollar, Piotr .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2999-3007

[7] Feature Pyramid Networks for Object Detection [J].

Lin, Tsung-Yi ;

Dollar, Piotr ;

Girshick, Ross ;

He, Kaiming ;

Hariharan, Bharath ;

Belongie, Serge .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :936-944

[8] High-level Semantic Feature Detection: A New Perspective for Pedestrian Detection [J].

Liu, Wei ;

Liao, Shengcai ;

Ren, Weiqiang ;

Hu, Weidong ;

Yu, Yinan .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5182-5191

[9]

Oksuz K, 2020, Arxiv, DOI arXiv:1909.00169

[10] Libra R-CNN: Towards Balanced Learning for Object Detection [J].

Pang, Jiangmiao ;

Chen, Kai ;

Shi, Jianping ;

Feng, Huajun ;

Ouyang, Wanli ;

Lin, Dahua .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :821-830

← 1 2 →