Saliency guided faster-RCNN (SGFr-RCNN) model for object detection and recognition

被引：34

作者：

Sharma, Vipal Kumar ^{[1
]}

Mir, Roohie Naaz ^{[1
]}

机构：

[1] Natl Inst Technol, Dept Comp Sci & Engn, Srinagar, Jammu & Kashmir, India

来源：

JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES | 2022年 / 34卷 / 05期

关键词：

Faster RCNN; Object detection; Recognition; Fitness function; Saliency; Convolutions; Bounding boxes; ROI; Pooling;

D O I：

10.1016/j.jksuci.2019.09.012

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recently, the object detection and recognition based applications are widely adopted in various real-time and offline applications. The computer vision based automatic learning schemes have gained huge attraction from researchers due to their significant nature of learning that can significantly improve the detection performance. The advances in deep and convolutional neural networks have improved the efficiency of applications based on recognition and detection. However, enhancing precision, decreasing detection error, and detecting camouflaged items are still regarded as difficult problems. In this work, we concentrated on these problems and presented a model based on Faster-RCNN that utilizes saliency detection, proposal generation and bounding box regression, for better detection along with loss functions. The suggested method is referred to as the saliency driven Faster RCNN model for object detection and recognition using computer vision approach (SGFr-RCNN). The performance of the suggested strategy is assessed using the data sets (PASCAL VOC 2007, PASCAL VOC 2012 & CAMO_UOW) and contrasted with current methods in terms of mean average precision. The comparative research demonstrates the important improvement in the results of the suggested strategy relative to the current methods. (c) 2019 The Authors. Published by Elsevier B.V. on behalf of King Saud University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

引用

页码：1687 / 1699

页数：13

共 60 条

[1]

Alhwarin Faraj., 2008, BCS Int. Acad. Conf, P178

[2]

[Anonymous], 2019, Firstpost

[3] A deep convolutional neural network for video sequence background subtraction [J].

Babaee, Mohammadreza ;

Duc Tung Dinh ;

Rigoll, Gerhard .

PATTERN RECOGNITION, 2018, 76 :635-649

[4] Active Skeleton for Non-rigid Object Detection [J].

Bai, Xiang ;

Wang, Xinggang ;

Latecki, Longin Jan ;

Liu, Wenyu ;

Tu, Zhuowen .

2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, :575-582

[5]

Bottou L., 2014, P NEURAL INFORM PROC

[6] Detection Evolution with Multi-Order Contextual Co-occurrence [J].

Chen, Guang ;

Ding, Yuanyuan ;

Xiao, Jing ;

Han, Tony X. .

2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :1798-1805

[7] Multi-View 3D Object Detection Network for Autonomous Driving [J].

Chen, Xiaozhi ;

Ma, Huimin ;

Wan, Ji ;

Li, Bo ;

Xia, Tian .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6526-6534

[8] Pedestrian Detection: An Evaluation of the State of the Art [J].

Dollar, Piotr ;

Wojek, Christian ;

Schiele, Bernt ;

Perona, Pietro .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (04) :743-761

[9]

Dong J, 2014, LECT NOTES COMPUT SC, V8693, P299, DOI 10.1007/978-3-319-10602-1_20

[10]

El Baf F, 2008, IEEE INT CONF FUZZY, P1731

← 1 2 3 4 5 6 →