GPU accelerated novel particle filtering method

被引：2

作者：

Das, Subhra Kanti ^{[1
]}

Mazumdar, Chandan ^{[2
]}

Banerjee, Kumardeb ^{[3
]}

机构：

[1] CSIR CMERI, Durgapur 713209, India

[2] Jadavpur Univ, Dept CSE, Kolkata 700032, India

[3] Jadavpur Univ, Dept EIE, Kolkata 700098, India

来源：

COMPUTING | 2014年 / 96卷 / 08期

关键词：

Particle filters; Resampling; Dual distribution; Parallel; GPU;

D O I：

10.1007/s00607-014-0400-2

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In this paper, a graphics processor unit (GPU) accelerated particle filtering algorithm is presented with an introduction to a novel resampling technique. The aim remains in the mitigation of particle impoverishment as well as computational burden, problems which are commonly associated with classical (systematic) resampled particle filtering. The proposed algorithm employs a priori-space dependent distribution in addition to the likelihood, and hence is christened as dual distribution dependent (D3) resampling method. Simulation results exhibit lesser values for root mean square error (RMSE) in comparison to that for systematic resampling. D3 resampling is shown to improve particle diversity after each iteration, thereby affecting the overall quality of estimation. However, computational burden is significantly increased owing to few excessive computations within the newly formulated resampling framework. With a view to obtaining parallel speedup we introduce a CUDA version of the proposed method for necessary acceleration by GPU. The GPU programming model is detailed in the context of this paper. Implementation issues are discussed along with illustration of empirical computational efficiency, as obtained by executing the CUDA code on Quadro 2000 GPU. The GPU enabled code has a speedup of 3 and 4 over the sequential executions of systematic and D3 resampling methods respectively. Performance both in terms of RMSE and running time have been elaborated with respect to different selections for threads per block towards effective implementations. It is in this context that, we further introduce a cost to performance metric (CPM) for assessing the algorithmic efficiency of the estimator, involving both quality of estimation and running time as comparative factors, transformed into a unified parameter for assessment. CPM values for estimators obtained from all such different choices for threads per block have been determined and a final value for the chosen parameter is resolved for generation of a holistic effective estimator.

引用

页码：749 / 773

页数：25

共 25 条

[1] Amdahl G. M., 1967, P APR 18 20 1967 SPR, P483, DOI [10.1145/1465482.1465560, DOI 10.1145/1465482.1465560]
[2] A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking
Arulampalam, MS
Maskell, S
Gordon, N
Clapp, T
[J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2002, 50 (02) : 174 - 188
[3] Bolic Miodrag, 2003, P IEEE INT C AC SPEE
[4] Parallel particle filtering
Brun, O
Teuliere, V
Garcia, JM
[J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2002, 62 (07) : 1186 - 1202
[5] Cuda C, 2013, CUD C PROGR GUID
[6] Dan Simon., 2006, Optimal state estimation: Kalman, H infinity, and nonlinear approaches
[7] Fearnhead P., 1998, THESIS U OXFORD OXFO
[8] Fossen T. I., 2002, MARINE CONTROL SYSTE, P19
[9] Freitas de N, 2001, SEQUENTIAL MONTE CAR
[10] NOVEL-APPROACH TO NONLINEAR NON-GAUSSIAN BAYESIAN STATE ESTIMATION
GORDON, NJ
SALMOND, DJ
SMITH, AFM
[J]. IEE PROCEEDINGS-F RADAR AND SIGNAL PROCESSING, 1993, 140 (02) : 107 - 113

← 1 2 3 →