Trajectory Prediction Using Video Generation in Autonomous Driving

被引:2
作者
Iancu, David-Traian [1 ]
Nan, Mihai [1 ]
Ghita, Stefania-Alexandra [1 ]
Florea, Adina-Magda [1 ]
机构
[1] Univ Politehn Bucuresti, 313 Splaiul Independentei, Bucharest 060042, Romania
来源
STUDIES IN INFORMATICS AND CONTROL | 2022年 / 31卷 / 01期
关键词
Trajectory prediction; Video generation; Object detection; Semantic segmentation; Depth prediction; Autonomous driving;
D O I
10.24846/v31i1y202204
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Trajectory prediction for the surrounding cars is a useful task in autonomous driving for obvious reasons. The traditional methods for predicting the future trajectories of surrounding cars involved complex motion models and patterns, complex maneuvers or physical models of the car trajectories. More recent works aim to predict the future car positions by using deep learning and neural networks. In this paper, video generation models were employed, which provide an estimation of the future frames related to the car positions based on an existing video and can obtain the position of the selected cars by employing an object detection algorithm along with additional information obtained by a segmentation module that uses a semantic segmentation network. The results were validated by employing the Root Mean Square Error (RMSE) metric in order to predict the locations of the surrounding cars and estimate their depth. Apparently, this approach has never been implemented in order to obtain the trajectory and the future position of the surrounding cars in autonomous driving.
引用
收藏
页码:37 / 48
页数:12
相关论文
共 62 条
[1]   Social LSTM: Human Trajectory Prediction in Crowded Spaces [J].
Alahi, Alexandre ;
Goel, Kratarth ;
Ramanathan, Vignesh ;
Robicquet, Alexandre ;
Li Fei-Fei ;
Savarese, Silvio .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :961-971
[2]  
Altché F, 2017, IEEE INT C INTELL TR
[3]  
[Anonymous], 2020, AAAI CONF ARTIF INTE, DOI 10.1609/aaai.v34i01.5477
[4]  
[Anonymous], 2017, ARXIV170704993
[5]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[6]   Recognition of dangerous situations within a cooperative group of vehicles [J].
Batz, Thomas ;
Watson, Kym ;
Beyerer, Juergen .
2009 IEEE INTELLIGENT VEHICLES SYMPOSIUM, VOLS 1 AND 2, 2009, :907-912
[7]  
Bochkovskiy A., 2020, ARXIV 200410934
[8]   TraPHic: Trajectory Prediction in Dense and Heterogeneous Traffic Using Weighted Interactions [J].
Chandra, Rohan ;
Bhattacharya, Uttaran ;
Bera, Aniket ;
Manocha, Dinesh .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :8475-8484
[9]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[10]   The Cityscapes Dataset for Semantic Urban Scene Understanding [J].
Cordts, Marius ;
Omran, Mohamed ;
Ramos, Sebastian ;
Rehfeld, Timo ;
Enzweiler, Markus ;
Benenson, Rodrigo ;
Franke, Uwe ;
Roth, Stefan ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223