Train Distance Estimation in Turnout Area Based on Monocular Vision

被引:3
作者
Hao, Yang [1 ,2 ]
Tang, Tao [1 ]
Gao, Chunhai [2 ]
机构
[1] Beijing Jiaotong Univ, Sch Elect & Informat Engn, Beijing 100044, Peoples R China
[2] Traff Control Technol Co Ltd, Beijing 100070, Peoples R China
关键词
autonomous driving; urban railway transit; object detection; vision; instance segmentation;
D O I
10.3390/s23218778
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Train distance estimation in a turnout area is an important task for the autonomous driving of urban railway transit, since this function can assist trains in sensing the positions of other trains within the turnout area and prevent potential collision accidents. However, because of large incident angles on object surfaces and far distances, Lidar or stereo vision cannot provide satisfactory precision for such scenarios. In this paper, we propose a method for train distance estimation in a turnout area based on monocular vision: firstly, the side windows of trains in turnout areas are detected by instance segmentation based on YOLOv8; secondly, the vertical directions, the upper edges and lower edges of side windows of the train are extracted by feature extraction; finally, the distance to the target train is calculated with an appropriated pinhole camera model. The proposed method is validated by practical data captured from Hong Kong Metro Tsuen Wan Line. A dataset of 2477 images is built to train the instance segmentation neural network, and the network is able to attain an MIoU of 92.43% and a MPA of 97.47% for segmentation. The accuracy of train distance estimation is then evaluated in four typical turnout area scenarios with ground truth data from on-board Lidar. The experiment results indicate that the proposed method achieves a mean RMSE of 0.9523 m for train distance estimation in four typical turnout area scenarios, which is sufficient for determining the occupancy of crossover in turnout areas.
引用
收藏
页数:18
相关论文
共 27 条
  • [1] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [2] Durmus M. S., 2010, IFAC P, P337
  • [3] Comparison of Major LiDAR Data-Driven Feature Extraction Methods for Autonomous Vehicles
    Fernandes, Duarte
    Nevoa, Rafael
    Silva, Antonio
    Simoes, Claudia
    Monteiro, Joao
    Novais, Paulo
    Melo, Pedro
    [J]. TRENDS AND INNOVATIONS IN INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 2, 2020, 1160 : 574 - 583
  • [4] Bounding Box Dataset Augmentation for Long-range Object Distance Estimation
    Franke, Marten
    Gopinath, Vaishnavi
    Reddy, Chaitra
    Ristic-Durrant, Danijela
    Michels, Kai
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 1669 - 1677
  • [5] Gebauer O., 2012, Autonomously Driving Trains on Open TracksConcepts, System Architecture and Implementation Aspects
  • [6] Haseeb M.A., 2018, P 10 PLANN PERC NAV
  • [7] Hongfei Gao, 2021, Proceedings of the 2021 IEEE 10th Data Driven Control and Learning Systems Conference (DDCLS), P1171, DOI 10.1109/DDCLS52934.2021.9455542
  • [8] Jocher G., 2023, License: AGPL-3.0, Version: 8.0.0
  • [9] Kudinov IA, 2020, MEDD C EMBED COMPUT, P640
  • [10] Long-Range Pose Estimation for Aerial Refueling Approaches Using Deep Neural Networks
    Lee, Andrew
    Dallmann, Will
    Nykl, Scott
    Taylor, Clark
    Borghetti, Brett
    [J]. JOURNAL OF AEROSPACE INFORMATION SYSTEMS, 2020, 17 (11): : 634 - 646