共 107 条
- [1] Lecun Y, Bottou L, Bengio Y, Et al., Gradient-based learning applied to document recognition, Proceedings of the IEEE, 86, 11, pp. 2278-2324, (1998)
- [2] Simonyan K, Zisserman A., Very deep convolutional networks for large-scale image recognition, (2014)
- [3] Szegedy C, Liu Wei, Jia Yangqing, Et al., Going deeper with convolutions, Proc of 2015 IEEE Conf on Computer Vision and Pattern Recognition (CVPR), pp. 1-9, (2015)
- [4] Ioffe S, Szegedy C., Batch normalization: Accelerating deep network training by reducing internal covariate shift, Proc of the 32nd Int Conf on Machine Learning(ICML'15), pp. 448-456, (2015)
- [5] He Kaiming, Zhang Xiangyu, Ren Shaoqing, Et al., Deep residual learning for image recognition, Proc of 2016 IEEE Conf on Computer Vision and Pattern Recognition (CVPR), pp. 770-778, (2016)
- [6] Socher R, Pennington J, Huang E H, Et al., Semi-supervised recursive autoencoders for predicting sentiment distributions, Proc of the 2011 Conf on Empirical Methods in Natural Language Processing, pp. 151-161, (2011)
- [7] Mueller J, Thyagarajan A., Siamese recurrent architectures for learning sentence similarity, Proc of the 30th AAAI Conf on Artificial Intelligence(AAAI'16), pp. 2786-2792, (2016)
- [8] Peng Hao, Li Jianxin, He Yu, Et al., Large-scale hierarchical text classification with recursively regularized deep graph-CNN, Proc of the 2018 World Wide Web Conf (WWW'18), pp. 1063-1072, (2018)
- [9] Turc I, Chang M W, Lee K, Et al., Well-read students learn better: On the importance of pre-training compact models, (2019)
- [10] Ma Xuezhe, Hovy E., End-to-end sequence labeling via bi-directional LSTM-CNNs-CRF, Proc of the 54th Annual Meeting of the Association for Computational Linguistics, pp. 1064-1074, (2016)