共 47 条
[1]
A 7nm 4-Core AI Chip with 25.6TFLOPS Hybrid FP8 Training, 102.4TOPS INT4 Inference and Workload-Aware Throttling
[J].
2021 IEEE INTERNATIONAL SOLID-STATE CIRCUITS CONFERENCE (ISSCC),
2021, 64
:144-+
[2]
[Anonymous], DEEP COMPRESSION: COMPRESSING DEEP NEURAL NETWORKS WITH PRUNING, TRAINED QUANTIZATION AND HUFFMAN CODING
[3]
Cambier L., 2020, PROC INT C LEARN REP
[6]
TrainWare: A Memory Optimized Weight Update Architecture for On-Device Convolutional Neural Network Training
[J].
PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN (ISLPED '18),
2018,
:104-109
[8]
Choquette W., 2020, 2020 IEEE HOT CHIPS, P1
[10]
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848