共 38 条
- [1] Szegedy C., Liu W., Jia Y., Sermanet P., Reed S., Anguelov D., Erhan D., Vanhoucke V., Rabinovich A., Going deeper with convolutions, In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)., pp. 1-9, (2015)
- [2] Simonyan K., Zisserman A., Very deep convolutional networks for large-scale image recognition, . In: Bengio Y, Lecun Y. (Eds.) 3Rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, 2015, Conference Track Proceedings, (2015)
- [3] Vaswani A., Shazeer N., Parmar N., Uszkoreit J., Jones L., Gomez A.N., Kaiser L., Polosukhin I., Attention is all you need, In: Proceedings of the 31St International Conference on Neural Information Processing Systems. NIPS’17., pp. 6000-6010, (2017)
- [4] Brown T.B., Mann B., Ryder N., Subbiah M., Kaplan J., Dhariwal P., Neelakantan A., Shyam P., Sastry G., Askell A., Agarwal S., Herbert-Voss A., Krueger G., Henighan T., Child R., Ramesh A., Ziegler D.M., Wu J., Winter C., Hesse C., Chen M., Sigler E., Litwin M., Gray S., Chess B., Clark J., Berner C., McCandlish S., Radford A., Sutskever I., Amodei D., Language Models are Few-Shot Learners, (2020)
- [5] Brown T.B., Mann B., Ryder N., Subbiah M., Kaplan J., Dhariwal P., Neelakantan A., Shyam P., Sastry G., Askell A., Agarwal S., Herbert-Voss A., Krueger G., Henighan T., Child R., Ramesh A., Ziegler D.M., Wu J., Winter C., Hesse C., Chen M., Sigler E., Litwin M., Gray S., Chess B., Clark J., Berner C., McCandlish S., Radford A., Sutskever I., Amodei D., Language models are few-shot learners, . In: Proceedings of the 34Th International Conference on Neural Information Processing Systems. NIPS’20.
- [6] He K., Zhang X., Ren S., Sun J., Deep residual learning for image recognition, 2016 IEEE conference on computer vision and pattern recognition (CVPR), pp. 770-778, (2016)
- [7] He K., Zhang X., Ren S., Sun J., Identity mappings in deep residual networks, Computer vision–ECCV 2016, pp. 630-645, (2016)
- [8] Ramesh A., Pavlov M., Goh G., Gray S., Voss C., Radford A., Chen M., Sutskever I., Zero-shot text-to-image generation, Proceedings of the 38Th International Conference on Machine Learning. Proceedings of Machine Learning Research, PMLR, Virtual Only, 139, pp. 8821-8831, (2021)
- [9] Radford A., Wu J., Child R., Luan D., Amodei D., Sutskever I., Language Models are Unsupervised Multitask Learners, (2019)
- [10] Li H., Xu Z., Taylor G., Studer C., Goldstein T., Visualizing the loss landscape of neural nets, Proceedings of the 32St International Conference on Neural Information Processing Systems. NIPS’18, 31, (2018)