Parallel Algorithm for Connected-Component Analysis Using CUDA

被引:5
作者
Windisch, Dominic [1 ]
Kaever, Christian [2 ]
Juckeland, Guido [2 ]
Bieberle, Andre [2 ]
机构
[1] Tech Univ Dresden, Inst Power Engn, D-01062 Dresden, Germany
[2] Helmholtz Zent Dresden Rossendorf, Bautzner Landstr 400, D-01328 Dresden, Germany
关键词
connected-component analysis; image stream processing; parallel computing; CUDA; RAY COMPUTED-TOMOGRAPHY;
D O I
10.3390/a16020080
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, we introduce a parallel algorithm for connected-component analysis (CCA) on GPUs which drastically reduces the volume of data to transfer from GPU to the host. CCA algorithms targeting GPUs typically store the extracted features in arrays large enough to potentially hold the maximum possible number of objects for the given image size. Transferring these large arrays to the host requires large portions of the overall execution time. Therefore, we propose an algorithm which uses a CUDA kernel to merge trees of connected component feature structs. During the tree merging, various connected-component properties, such as total area, centroid and bounding box, are extracted and accumulated. The tree structure then enables us to only transfer features of valid objects to the host for further processing or storing. Our benchmarks show that this implementation significantly reduces memory transfer volume for processing results on the host whilst maintaining similar performance to state-of-the-art CCA algorithms.
引用
收藏
页数:11
相关论文
共 20 条
[1]   SEQUENTIAL APPROACH TO EXTRACTION OF SHAPE FEATURES [J].
AGRAWALA, AK ;
KULKARNI, AV .
COMPUTER GRAPHICS AND IMAGE PROCESSING, 1977, 6 (06) :538-557
[2]   Optimized Block-Based Algorithms to Label Connected Components on GPUs [J].
Allegretti, Stefano ;
Bolelli, Federico ;
Grana, Costantino .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (02) :423-438
[3]   Parallel Light Speed Labeling: an efficient connected component algorithm for labeling and analysis on multi-core processors [J].
Cabaret, Laurent ;
Lacassagne, Lionel ;
Etiemble, Daniel .
JOURNAL OF REAL-TIME IMAGE PROCESSING, 2018, 15 (01) :173-196
[4]   Real-time embedded system for traffic sign recognition based on ZedBoard [J].
Farhat, Wajdi ;
Faiedh, Hassene ;
Souani, Chokri ;
Besbes, Kamel .
JOURNAL OF REAL-TIME IMAGE PROCESSING, 2019, 16 (05) :1813-1823
[5]   Ultra fast electron beam X-ray computed tomography for two-phase flow measurement [J].
Fischer, F. ;
Hampel, U. .
NUCLEAR ENGINEERING AND DESIGN, 2010, 240 (09) :2254-2259
[6]   Rapid data processing for ultrafast X-ray computed tomography using scalable and modular CUDA based pipelines [J].
Frust, Tobias ;
Wagner, Michael ;
Stephan, Jan ;
Juckeland, Guido ;
Bieberle, Andre .
COMPUTER PHYSICS COMMUNICATIONS, 2017, 219 :353-360
[7]  
Hennequin A, 2018, CONF DESIGN ARCHIT, P76, DOI 10.1109/DASIP.2018.8596835
[8]  
Kaever C., 2021, REAL TIME OBJECT REC
[9]   Automatic segmentation of liver & lesion detection using H-minima transform and connecting component labeling [J].
Khan, Nazish ;
Ahmed, Imran ;
Kiran, Mahreen ;
Rehman, Hamoodur ;
Din, Sadia ;
Paul, Anand ;
Reddy, Alavalapati Goutham .
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (13-14) :8459-8481
[10]   Light speed labeling: efficient connected component labeling on RISC architectures [J].
Lacassagne, Lionel ;
Zavidovique, Bertrand .
JOURNAL OF REAL-TIME IMAGE PROCESSING, 2011, 6 (02) :117-135