Multi-sensor data fusion across dimensions: A novel approach to synopsis generation using sensory data

被引：0

作者：

Ingle, Palash Yuvraj ^{[1
,2
]}

Kim, Young-Gab ^{[1
,2
]}

机构：

[1] Sejong Univ, Dept Comp & Informat Secur, Seoul 05006, South Korea

[2] Sejong Univ, Convergence Engn Intelligent Drone, Seoul 05006, South Korea

来源：

JOURNAL OF INDUSTRIAL INFORMATION INTEGRATION | 2025年 / 46卷

基金：

新加坡国家研究基金会;

关键词：

Drone surveillance; Deep learning; Video fusion; Video synopsis; VIDEO SYNOPSIS; ATTENTION; DEVICES;

D O I：

10.1016/j.jii.2025.100876

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Unmanned aerial vehicles (UAVs) and autonomous ground vehicles are increasingly outfitted with advanced sensors such as LiDAR, cameras, and GPS, enabling real-time object detection, tracking, localization, and navigation. These platforms generate high-volume sensory data, such as video streams and point clouds, that require efficient processing to support timely and informed decision-making. Although video synopsis techniques are widely used for visual data summarization, they encounter significant challenges in multi-sensor environments due to disparities in sensor modalities. To address these limitations, we propose a novel sensory data synopsis framework designed for both UAV and autonomous vehicle applications. The proposed system integrates a dualtask learning model with a real-time sensor fusion module to jointly perform abnormal object segmentation and depth estimation by combining LiDAR and camera data. The framework comprises a sensory fusion algorithm, a 3D-to-2D projection mechanism, and a Metropolis-Hastings-based trajectory optimization strategy to refine object tubes and construct concise, temporally-shifted synopses. This design selectively preserves and repositions salient information across space and time, enhancing synopsis clarity while reducing computational overhead. Experimental evaluations conducted on standard datasets (i.e., KITTI, Cityscapes, and DVS) demonstrate that our framework achieves a favorable balance between segmentation accuracy and inference speed. In comparison with existing studies, it yields superior performance in terms of frame reduction, recall, and F1 score. The results highlight the robustness, real-time capability, and broad applicability of the proposed approach to intelligent surveillance, smart infrastructure, and autonomous mobility systems.

引用

页数：24

共 81 条

[11] Deep Multi-Modal Object Detection and Semantic Segmentation for Autonomous Driving: Datasets, Methods, and Challenges [J].

Feng, Di ;

Haase-Schutz, Christian ;

Rosenbaum, Lars ;

Hertlein, Heinz ;

Glaser, Claudius ;

Timm, Fabian ;

Wiesbeck, Werner ;

Dietmayer, Klaus .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (03) :1341-1360

[12]

Fernandez-Moral E, 2018, IEEE INT VEH SYM, P1051, DOI 10.1109/IVS.2018.8500497

[13] Visual Object Detection and Tracking for Internet of Things Devices Based on Spatial Attention Powered Multidomain Network [J].

Gao, Haining ;

Yu, Lei ;

Khan, Imran Ali ;

Wang, Yinling ;

Yang, Yong ;

Shen, Hongdan .

IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (04) :2811-2820

[14] Vision meets robotics: The KITTI dataset [J].

Geiger, A. ;

Lenz, P. ;

Stiller, C. ;

Urtasun, R. .

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (11) :1231-1237

[15] QuadroNet: Multi-Task Learning for Real-Time Semantic Depth Aware Instance Segmentation [J].

Goel, Kratarth ;

Srinivasan, Praveen ;

Tariq, Sarah ;

Philbin, James .

2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, :315-324

[16] Depth from Camera Motion and Object Detection [J].

Griffin, Brent A. ;

Corso, Jason J. .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :1397-1406

[17] A comprehensive analysis of multi-strategic RIME algorithm for UAV path planning in varied terrains [J].

Gu, Tao ;

Zhang, Yajuan ;

Wang, Limin ;

Zhang, Yufei ;

Deveci, Muhammet ;

Wen, Xin .

JOURNAL OF INDUSTRIAL INFORMATION INTEGRATION, 2025, 43

[18]

Gu Y., 2022, arXiv, DOI DOI 10.48550/ARXIV.2203.04129

[19] Real-time dense traffic detection using lightweight backbone and improved path aggregation feature pyramid network [J].

Guo, Feng ;

Wang, Yi ;

Qian, Yu .

JOURNAL OF INDUSTRIAL INFORMATION INTEGRATION, 2023, 31

[20] EC2Detect: Real-Time Online Video Object Detection in Edge-Cloud Collaborative IoT [J].

Guo, Siyan ;

Zhao, Cong ;

Wang, Guiqin ;

Yang, Jiaqing ;

Yang, Shusen .

IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (20) :20382-20392

← 1 2 3 4 5 6 7 8 9 →