Deep learning based object detection method and its application for intelligent transport systems

被引:2
作者
Kim J.-Y. [1 ]
Kim S.-H. [1 ]
机构
[1] School of AI Convergence, College of Information Technology, Soongsil University
基金
新加坡国家研究基金会;
关键词
Convolutional Neural Network; Event detection; Intelligent Transport System; Object Awareness; Velocity Estimation;
D O I
10.5302/J.ICROS.2021.21.0145
中图分类号
学科分类号
摘要
Most machine vision solutions for intelligent transport systems begin with the extraction of hand-crafted visual features from traffic scenes. In this study, the category-specific information for detecting specific visual patterns was investigated by replacing machine vision solutions with a standard convolutional neural network (CNN)-based object detector, and three practical applications of the system were developed. First, our system learned important categories, such as pedestrian and vehicles, for traffic monitoring and management. In addition, while collecting related databases, we efficiently performed data augmentation and improved the recognition accuracy of the system for several user-defined events. Further, the displacement of the detected positions between consecutive frames was converted into the real-world distance to compute the physical velocity of a vehicle. Second, we developed a vision-based system for a real-time lane-level traffic congestion measurement. After tracking the detected vehicles, the estimated velocities of vehicles for each lane were averaged. Subsequently, traffic congestion was determined based on the number of detected vehicles and averaged velocity. Third, we presented a context-aware method for background maintenance. To handle dynamic background objects, we utilized a 2D object detector to identify the category-specific background patterns. The key observation was that the detected regions do not belong to a true background. Hence, we developed a new confidence map to update the static background model and exclude pre-learnt background objects for conventional background subtraction methods. In the study, more than eight user-defined events were suggested by the combination of traditional machine vision techniques and deep learning-based object detectors with a substantial number of training images. In addition, our key ideas were validated using various datasets, such as five different scenes for lane-level traffic congestion and two CCTV image sequences for object-aware background subtraction and unseen object detection in the challenging traffic congestion. Lastly, the suggested applications of this system for intelligent transport systems were successfully demonstrated. © ICROS 2021.
引用
收藏
页码:1016 / 1022
页数:6
相关论文
共 14 条
  • [1] Datondji S., Dupuis Y., Subirats P., Vasseur P., A survey of vision-based traffic monitoring of road intersections, IEEE Transactions on Intelligent Transportation Systems (T-ITS), 17, 10, (2016)
  • [2] Lee K.-M., Lin C.-H., A new boosted 3-D PCA detection system for pedestrian traffic safety, Journal of Institute of Control, Robotics and Systems (In Korean), 27, 4, pp. 292-297, (2021)
  • [3] Cho Y.J., Kim J.W., A study on the classification of Targetobjects with the deep-learning model in the vision-images, Journal of Institute of Control, Robotics and Systems (In Korean), 22, 2, pp. 20-25, (2021)
  • [4] Afrin T., Yodo N., A survey of road traffic congestion measures towards a sustainable and resilient transportation system, Sustainability, (2020)
  • [5] Mandal M., Vipparthi S.K., An empirical review of deep learning frameworks for change detection: Model design, experimental frameworks, challenges and research needs, IEEE Transactions on Intelligent Transportation Systems (T-ITS) 2021
  • [6] Bochkovskiy A., Wang C.-.Y., Liao H.-.Y.M., Yolov4: Optimal Speed and Accuracy of Object Detection, (2020)
  • [7] Everingham M., Gool L.V., Williams C.K.I., Winn J., Zisserman A., The pascal Visual Object Classes (VOC) challenge, International Journal of Computer Vision (IJCV), 88, 2, pp. 303-338, (2010)
  • [8] Lin T.-.Y., Maire M., Belongie S., Hays J., Perona P., Ramanan D., Dollar P., Zitnick C.L., MS COCO: Common objects in context, European Conference on Computer Vision (ECCV), (2014)
  • [9] Kuznetsova A., Rom H., Alldirn N., Uijlings J., Krasin I., Pont-Tuset J., Kamali S., Popov S., Malloci M., Kolesnikov A., Duerig T., Ferrari V., The open images dataset v4, International Journal of Computer Vision (IJCV), 128, 7, (2020)
  • [10] Kim S.-H., Choe G., Park M., Kweon I.S., Salient view selection for visual recognition of industrial components, IEEE Robotics and Automation Letters (RAL), 5, 2, pp. 2506-2513, (2020)