Real-Time Context-Aware Early Filtering for High-Definition Video Analytics on Commodity Edge Devices Using GenAI for Data Augmentation

被引:0
作者
Pontes, Felipe Arruda [1 ,2 ]
Schukat, Michael [2 ]
Curry, Edward [1 ,2 ]
机构
[1] Univ Galway, Data Sci Inst, Insight Ctr Data Analyt, Galway H91 TK33, Ireland
[2] Univ Galway, Sch Comp Sci, Galway H91 TK33, Ireland
来源
IEEE ACCESS | 2024年 / 12卷
基金
爱尔兰科学基金会;
关键词
Image edge detection; Filtering; Real-time systems; Pipelines; Accuracy; Throughput; Streaming media; High definition video; Cloud computing; Cameras; Commodity edge; generative AI; deep neural networks; real-time; streaming;
D O I
10.1109/ACCESS.2024.3520807
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This work proposes a fast and accurate early filtering pipeline for video analytics in commodity Edge devices for Smart-Cities applications. This pipeline can run in real-time even on a small and GPU-less device such as a Raspberry Pi, while maintaining a good accuracy for video analytics queries. In addition to a novel Edge optimized pre-processing method, the pipeline uses a context-aware binary model, which is fine-tuned using semi-automatic synthetic data augmentation, Generative AI, and Cut-and-Paste techniques to contextualize the model to the input camera background and the Objects of Interest (e.g., car or person) from a user's video analytics query, in a fast process that requires only 10 seconds of original footage for training. This makes it the first Edge filtering with specialized models with a viable online training solution. Compared to a baseline state-of-art Nano-YoloV5 model, the proposed early filtering pipeline in its high speed profile shows an 48.8x increase in speed and is the first of its kind that is able to run on physical hardware (i.e., non-simulated) commodity Edge devices at more than 80 FPS in HD ( 1920x1080 ) resolution, with a small accuracy loss of 5% compared to the baseline. On the high accuracy setting the pipeline still runs at more than 41 FPS (26.9x faster than Nano-Yolo) and shows an increase of 2.5% in accuracy.
引用
收藏
页码:194728 / 194749
页数:22
相关论文
共 57 条
  • [1] Seo J., Han S., Lee S., Kim H., Computer vision techniques for construction safety and health monitoring, Adv. Eng. Informat., 29, 2, pp. 239-251, (2015)
  • [2] Aslam A., Curry E., Towards a generalized approach for deep neural network based event processing for the Internet of Multimedia Things, IEEE Access, 6, pp. 25573-25587, (2018)
  • [3] Mazzia V., Khaliq A., Salvetti F., Chiaberge M., Real-time apple detection system using embedded systems with hardware accelerators: An edge AI application, IEEE Access, 8, pp. 9102-9114, (2020)
  • [4] Cheong K.H., Poeschmann S., Lai J.W., Koh J.M., Acharya U.R., Yu S.C.M., Tang K.J.W., Practical automated video analytics for crowd monitoring and counting, IEEE Access, 7, pp. 183252-183261, (2019)
  • [5] Aslam A., Curry E., A survey on object detection for the Internet of Multimedia Things (IoMT) using deep learning and event-based middleware: Approaches, challenges, and future directions, Image Vis. Comput., 106, (2021)
  • [6] Wang X., Han Y., Leung V.C.M., Niyato D., Yan X., Chen X., Convergence of edge computing and deep learning: A comprehensive survey, IEEE Commun. Surveys Tuts., 22, 2, pp. 869-904, (2020)
  • [7] Khan M.A., Baccour E., Chkirbene Z., Erbad A., Hamila R., Hamdi M., Gabbouj M., A survey on mobile edge computing for video streaming: Opportunities and challenges, IEEE Access, 10, pp. 120514-120550, (2022)
  • [8] Du K., Pervaiz A., Yuan X., Chowdhery A., Zhang Q., Hoffmann H., Jiang J., Server-driven video streaming for deep learning inference, Proc. Annu. Conf. ACM Special Interest Group Data Commun. Appl., Technol., Architectures, Protocols Comput. Commun., pp. 557-570, (2020)
  • [9] Teerapittayanon S., McDanel B., Kung H.T., Distributed deep neural networks over the cloud, the edge and end devices, Proc. IEEE 37th Int. Conf. Distrib. Comput. Syst. (ICDCS), pp. 328-339, (2017)
  • [10] Lee R., Venieris S.I., Lane N.D., Deep neural network-based enhancement for image and video streaming systems: A survey and future directions, ACM Comput. Surveys, 54, 8, pp. 1-30, (2022)