共 48 条
- [1] [Anonymous], 2019, PHYSIOTHER THEOR PR, DOI DOI 10.1080/09593985.2019.1709234
- [2] [Anonymous], 2013, CoRR abs/1308.3432
- [3] Ba Jimmy Lei, 2016, LAYER NORMALIZATION, DOI 10.48550/arXiv.1607.06450
- [4] Caron Mathilde, 2020, ABS200103554 CORR
- [5] The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16301 - 16311
- [6] Chen Tianlong, 2020, Advances in neural information processing systems, V33, P15834
- [7] Chen XH, 2021, 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), P2195
- [8] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
- [9] Elsen E, 2020, PROC CVPR IEEE, P14617, DOI 10.1109/CVPR42600.2020.01464
- [10] Frankle Jonathan, 2020, INT C MACHINE LEARNI, P3259