共 61 条
- [1] Bahdanau D, Cho K, Bengio Y., Neural machine translation by jointly learning to align and translate, (2014)
- [2] Ballas N, Yao L, Pal C, Courville A., Delving deeper into convolutional networks for learning video representations, (2015)
- [3] Cao Y, Wu Z, Shen C., Estimating depth from monocular images as classification using deep fully convolutional residual networks, IEEE Transactions on Circuits and Systems for Video Technology, 28, 11, pp. 3174-3182, (2018)
- [4] Casser V, Pirk S, Mahjourian R, Angelova A., Depth Prediction without the sensors: leveraging structure for unsupervised learning from monocular videos, (2018)
- [5] Chen Y, Zhao H, Hu Z., Attention-based context aggregation network for monocular depth estimation, (2019)
- [6] Chung J, Gulcehre C, Cho K, Bengio Y., Empirical evaluation of gated recurrent neural networks on sequence modeling, (2014)
- [7] Diba A, Sharma V, Gool LV, Stiefelhagen R., DynamoNet: dynamic action and motion network, The IEEE international conference on computer vision (ICCV), (2019)
- [8] Eigen D, Fergus R., Predicting depth, surface normals and semantic labels with a common multi-scale convolutional architecture, 2015 IEEE international conference on computer vision (ICCV), pp. 2650-2658, (2015)
- [9] Eigen D, Puhrsch C, Fergus R., Depth map prediction from a single image using a multi-scale deep network, (2014)
- [10] Fu H, Gong M, Wang C, Batmanghelich K, Tao D., Deep ordinal regression network for monocular depth estimation, IEEE conference on computer vision and pattern recognition (CVPR), (2018)