Convolutional Neural Network Architecture for Semaphore Recognition

被引:0
作者
Li, Wanchong [1 ]
Yang, Yuliang [1 ]
Wang, Mengyuan [1 ]
Zhang, Linhao [1 ]
Zhu, Mengyu [2 ]
机构
[1] Univ Sci & Technol Beijing, Dept Commun Engn, Beijing, Peoples R China
[2] Beijing Inst Technol, Dept Biomed Engn, Beijing, Peoples R China
来源
PROCEEDINGS OF 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS) | 2018年
关键词
semaphore; BN; SRNet; Tiny-SRNet;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper proposes convolutional neural networks SRNet and Tiny-SRNet for human semaphore action recognition. SRNet is composed of 5 layers of convolution and 3 layers of fully connected layers. In addition to the first convolution layer., a batch normalization layer is added before each convolution layer. In order to enable deep learning algorithms to be applied to both mobile and embedded platforms., Tiny-SRNet removes the full connected layers in SRNet and replaces them with a convolutional layer and a global average pooling layer. The experimental results show that compared with the mainstream classification models AlexNet., GoogleNet and VGGI6., SRNet achieves the highest recognition rate of 98.9% on the semaphore dataset, and Tiny-SRNet compresses its model size to 1/24 of SRNet with a reduction of 1.7% accuracy.
引用
收藏
页码:559 / 562
页数:4
相关论文
共 13 条
[1]  
[Anonymous], CORR
[2]  
[Anonymous], P 32 INT C MACH LEAR
[3]  
[Anonymous], 2014, PROC INT C LEARN REP
[4]   Actions and Attributes from Wholes and Parts [J].
Gkioxari, Georgia ;
Girshick, Ross ;
Malik, Jitendra .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2470-2478
[5]  
Han S., 2015, ABS150706149 CORR
[6]   Going deeper into action recognition: A survey [J].
Herath, Samitha ;
Harandi, Mehrtash ;
Porikli, Fatih .
IMAGE AND VISION COMPUTING, 2017, 60 :4-21
[7]   Caffe: Convolutional Architecture for Fast Feature Embedding [J].
Jia, Yangqing ;
Shelhamer, Evan ;
Donahue, Jeff ;
Karayev, Sergey ;
Long, Jonathan ;
Girshick, Ross ;
Guadarrama, Sergio ;
Darrell, Trevor .
PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, :675-678
[8]   Semantic Pyramids for Gender and Action Recognition [J].
Khan, Fahad Shahbaz ;
van de Weijer, Joost ;
Anwer, Rao Muhammad ;
Felsberg, Michael ;
Gatta, Carlo .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (08) :3633-3645
[9]  
Krizhevsky A., 2017, COMMUN ACM, V60, P84, DOI [DOI 10.1145/3065386, 10.1145/3065386]
[10]   Deep learning [J].
LeCun, Yann ;
Bengio, Yoshua ;
Hinton, Geoffrey .
NATURE, 2015, 521 (7553) :436-444