Deep Learning Based Pedestrian Detection at Distance in Smart Cities

被引：11

作者：

Dinakaran, Ranjith K. ^{[1
]}

Easom, Philip ^{[1
]}

Bouridane, Ahmed ^{[1
]}

Zhang, Li ^{[1
]}

Jiang, Richard ^{[3
]}

Mehboob, Fozia ^{[2
]}

Rauf, Abdul ^{[2
]}

机构：

[1] Northumbria Univ, Comp & Informat Sci, Newcastle Upon Tyne, Tyne & Wear, England

[2] Imam Mohammed Ibn Saud Islamic Univ, Comp Sci, Riyadh, Saudi Arabia

[3] Sch Comp & Commun, Lancaster, England

来源：

INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 2 | 2020年 / 1038卷

关键词：

Deep neural networks; Object detection; Smart homecare; Smart cities;

D O I：

10.1007/978-3-030-29513-4_43

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Generative adversarial networks (GANs) have been promising for many computer vision problems due to their powerful capabilities to enhance the data for training and test. In this paper, we leveraged GANs and proposed a new architecture with a cascaded Single Shot Detector (SSD) for pedestrian detection at distance, which is yet a challenge due to the varied sizes of pedestrians in videos at distance. To overcome the low-resolution issues in pedestrian detection at distance, DCGAN is employed to improve the resolution first to reconstruct more discriminative features for a SSD to detect objects in images or videos. A crucial advantage of our method is that it learns a multiscale metric to distinguish multiple objects at different distances under one image, while DCGAN serves as an encoder-decoder platform to generate parts of an image that contain better discriminative information. To measure the effectiveness of our proposed method, experiments were carried out on the Canadian Institute for Advanced Research (CIFAR) dataset, and it was demonstrated that the proposed new architecture achieved a much better detection rate, particularly on vehicles and pedestrians at distance, making it highly suitable for smart cities applications that need to discover key objects or pedestrians at distance.

引用

页码：588 / 593

页数：6

共 17 条

[11]

Finn C, 2016, ADV NEUR IN, V29

[12]

Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672

[13] Emotion recognition from scrambled facial images via many graph embedding [J].

Jiang, Richard ;

Ho, Anthony T. S. ;

Cheheb, Ismahane ;

Al-Maadeed, Noor ;

Al-Maadeed, Somaya ;

Bouridane, Ahmed .

PATTERN RECOGNITION, 2017, 67 :245-251

[14] Privacy-Protected Facial Biometric Verification Using Fuzzy Forest Learning [J].

Jiang, Richard ;

Bouridane, Ahmed ;

Crookes, Danny ;

Celebi, M. Emre ;

Wei, Hua-Liang .

IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2016, 24 (04) :779-790

[15] Face Recognition in the Scrambled Domain via Salience-Aware Ensembles of Many Kernels [J].

Jiang, Richard ;

Al-Maadeed, Somaya ;

Bouridane, Ahmed ;

Crookes, Danny ;

Celebi, M. Emre .

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2016, 11 (08) :1712-1722

[16] Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network [J].

Ledig, Christian ;

Theis, Lucas ;

Huszar, Ferenc ;

Caballero, Jose ;

Cunningham, Andrew ;

Acosta, Alejandro ;

Aitken, Andrew ;

Tejani, Alykhan ;

Totz, Johannes ;

Wang, Zehan ;

Shi, Wenzhe .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :105-114

[17] Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks [J].

Ren, Shaoqing ;

He, Kaiming ;

Girshick, Ross ;

Sun, Jian .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) :1137-1149

← 1 2 →