Granulated deep learning and Z-numbers in motion detection and object recognition

被引：24

作者：

Pal, Sankar K. ^{[1
]}

Bhoumik, Debasmita ^{[1
]}

Bhunia Chakraborty, Debarati ^{[1
]}

机构：

[1] Indian Stat Inst, Ctr Soft Comp Res, Kolkata 700108, India

来源：

NEURAL COMPUTING & APPLICATIONS | 2020年 / 32卷 / 21期

关键词：

Deep learning; Granular computing; Rough sets; Video tracking; Object recognition; Z-numbers; FUZZY; TRACKING;

D O I：

10.1007/s00521-019-04200-1

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The article deals with the problems of motion detection, object recognition, and scene description using deep learning in the framework of granular computing and Z-numbers. Since deep learning is computationally intensive, whereas granular computing, on the other hand, leads to computation gain, a judicious integration of their merits is made so as to make the learning mechanism computationally efficient. Further, it is shown how the concept of z-numbers can be used to quantify the abstraction of semantic information in interpreting a scene, where subjectivity is of major concern, through recognition of its constituting objects. The system, thus developed, involves recognition of both static objects in the background and moving objects in foreground separately. Rough set theoretic granular computing is adopted where rough lower and upper approximations are used in defining object and background models. During deep learning, instead of scanning the entire image pixel by pixel in the convolution layer, we scan only the representative pixel of each granule. This results in a significant gain in computation time. Arbitrary-shaped and sized granules, as expected, perform better than regular-shaped rectangular granules or fixed-sized granules. The method of tracking is able to deal efficiently with various challenging cases, e.g., tracking partially overlapped objects and suddenly appeared objects. Overall, the granulated system shows a balanced trade-off between speed and accuracy as compared to pixel level learning in tracking and recognition. The concept of using Z-numbers, in providing a granulated linguistic description of a scene, is unique. This gives a more natural interpretation of object recognition in terms of certainty toward scene understanding.

引用

页码：16533 / 16548

页数：16

共 35 条

[1]

[Anonymous], 2010, Advances in Neural Information Processing Systems (NeurIPS)

[2]

Banerjee R, 2013, STUD FUZZ SOFT COMP, V291, P71, DOI 10.1007/978-3-642-34922-5_6

[3] Granulation, rough entropy and spatiotemporal moving object detection [J].

Chakraborty, Debarati ;

Shankar, B. Uma ;

Pal, Sankar K. .

APPLIED SOFT COMPUTING, 2013, 13 (09) :4001-4009

[4] Neighborhood granules and rough rule-base in tracking [J].

Chakraborty, Debarati Bhunia ;

Pal, Sankar K. .

NATURAL COMPUTING, 2016, 15 (03) :359-370

[5] Scalable Object Detection using Deep Neural Networks [J].

Erhan, Dumitru ;

Szegedy, Christian ;

Toshev, Alexander ;

Anguelov, Dragomir .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :2155-2162

[6]

Ferris J, 2009, PALG STUD THEAT PERF, P1

[7] Online object tracking via motion-guided convolutional neural network (MGNet) [J].

Gan, Weihao ;

Lee, Ming-Sui ;

Wu, Chi-hao ;

Kuo, C. -C. .

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 53 :180-191

[8] Good Features to Correlate for Visual Tracking [J].

Gundogdu, Erhan ;

Alatan, A. Aydin .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (05) :2526-2540

[9] Correlation Filters with Weighted Convolution Responses [J].

He, Zhiqun ;

Fan, Yingruo ;

Zhuang, Junfei ;

Dong, Yuan ;

Bai, HongLiang .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, :1992-2000

[10] Learning to Track at 100 FPS with Deep Regression Networks [J].

Held, David ;

Thrun, Sebastian ;

Savarese, Silvio .

COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :749-765

← 1 2 3 4 →