共 69 条
- [31] Ascend: a Scalable and Unified Architecture for Ubiquitous Deep Neural Network Computing [J]. 2021 27TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2021), 2021, : 789 - 801
- [32] TENET: A Framework for Modeling Tensor Dataflow Based on Relation-centric Notation [J]. 2021 ACM/IEEE 48TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2021), 2021, : 720 - 733
- [33] Ma LX, 2020, PROCEEDINGS OF THE 14TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDI '20), P881
- [34] Ma NN, 2020, Arxiv, DOI arXiv:2007.11823
- [36] Markidis S, 2018, Arxiv, DOI arXiv:1803.04014
- [37] Automatically Scheduling Halide Image Processing Pipelines [J]. ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (04):
- [38] A Scheduling Framework for Spatial Architectures Across Multiple Constraint-Solving Theories [J]. ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 2015, 37 (01):
- [39] Nvidia, 2022, CUTLASS
- [40] Nvidia, 2022, VOLT ARCH WHIT PAP