Semi-automated computer vision-based tracking of multiple industrial entities: a framework and dataset creation approach

被引:0
作者
Jérôme Rutinowski
Hazem Youssef
Sven Franke
Irfan Fachrudin Priyanta
Frederik Polachowski
Moritz Roidl
Christopher Reining
机构
[1] TU Dortmund University,Chair of Material Handling and Warehousing
来源
EURASIP Journal on Image and Video Processing | / 2024卷
关键词
Warehousing; Computer vision; Object detection; Classification;
D O I
暂无
中图分类号
学科分类号
摘要
This contribution presents the TOMIE framework (Tracking Of Multiple Industrial Entities), a framework for the continuous tracking of industrial entities (e.g., pallets, crates, barrels) over a network of, in this example, six RGB cameras. This framework makes use of multiple sensors, data pipelines, and data annotation procedures, and is described in detail in this contribution. With the vision of a fully automated tracking system for industrial entities in mind, it enables researchers to efficiently capture high-quality data in an industrial setting. Using this framework, an image dataset, the TOMIE dataset, is created, which at the same time is used to gauge the framework’s validity. This dataset contains annotation files for 112,860 frames and 640,936 entity instances that are captured from a set of six cameras that perceive a large indoor space. This dataset out-scales comparable datasets by a factor of four and is made up of scenarios, drawn from industrial applications from the sector of warehousing. Three tracking algorithms, namely ByteTrack, Bot-Sort, and SiamMOT, are applied to this dataset, serving as a proof-of-concept and providing tracking results that are comparable to the state of the art.
引用
收藏
相关论文
共 54 条
[1]  
Frankó A(2020)Reliable identification schemes for asset and production tracking in industry 4.0 Sensors 20 3709-41
[2]  
Vida G(2023)Recent advances of monocular 2D and 3D human pose estimation: a deep learning perspective ACM Comput. Surv. 55 1-88
[3]  
Varga P(2021)Deep 3D human pose estimation: a review Comput. Vis. Image Underst. 381 61-881
[4]  
Liu W(2020)Deep learning in video multi-object tracking: A survey Neurocomputing 129 845-765
[5]  
Bao Q(2021)Motchalllenge: A benchmark for single-camera multiple target tracking Int. J. Comput. Vis. 13 745-19
[6]  
Sun Y(2018)Multi-object tracking with correlation filter for autonomous vehicle Sensors 34 3-1237
[7]  
Mei T(2017)A review of detection and tracking of object from image and video sequences Int. J. Comput. Intell. Res. 32 1231-348
[8]  
Wang J(2013)Intelligent multi-camera video surveillance: A review Pattern Recogn. Lett. 125 341-undefined
[9]  
Tan S(2013)Vision meets robotics: the kitti dataset Int. J. Robot. Res. undefined undefined-undefined
[10]  
Zhen X(2008)Evaluating multiple object tracking performance: the CLEAR MOT metrics EURASIP J. Image Video Process. undefined undefined-undefined