共 66 条
[61]
Yuki Tomofumi, 2013, Languages and Compilers for Parallel Computing. 25th International Workshop (LCPC 2012). Revised Selected Papers, P17, DOI 10.1007/978-3-642-37658-0_2
[62]
AKG: Automatic Kernel Generation for Neural Processing Units using Polyhedral Transformations
[J].
PROCEEDINGS OF THE 42ND ACM SIGPLAN INTERNATIONAL CONFERENCE ON PROGRAMMING LANGUAGE DESIGN AND IMPLEMENTATION (PLDI '21),
2021,
:1233-1248
[63]
Optimizing the Memory Hierarchy by Compositing Automatic Transformations on Computations and Data
[J].
2020 53RD ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO 2020),
2020,
:427-441
[64]
Zhao Jie, 2022, P MACHINE LEARNING S, V4, P1
[65]
Zheng LM, 2020, PROCEEDINGS OF THE 14TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDI '20), P863
[66]
Zou Yun., 2012, Proceedings of the Tenth International Symposium on Code Generation and Optimization, CGO'12, P74, DOI 10.1145/2259016.2259027