MulTCIM: Digital Computing-in-Memory-Based Multimodal Transformer Accelerator With Attention-Token-Bit Hybrid Sparsity
被引:7
|
作者:
Tu, Fengbin
论文数: 0引用数: 0
h-index: 0
机构:
Tsinghua Univ, Sch Integrated Circuits, Beijing 100084, Peoples R China
Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R ChinaTsinghua Univ, Sch Integrated Circuits, Beijing 100084, Peoples R China
Tu, Fengbin
[1
,2
]
Wu, Zihan
论文数: 0引用数: 0
h-index: 0
机构:
Tsinghua Univ, Sch Integrated Circuits, Beijing 100084, Peoples R ChinaTsinghua Univ, Sch Integrated Circuits, Beijing 100084, Peoples R China
Wu, Zihan
[1
]
Wang, Yiqi
论文数: 0引用数: 0
h-index: 0
机构:
Tsinghua Univ, Sch Integrated Circuits, Beijing 100084, Peoples R ChinaTsinghua Univ, Sch Integrated Circuits, Beijing 100084, Peoples R China
Wang, Yiqi
[1
]
Wu, Weiwei
论文数: 0引用数: 0
h-index: 0
机构:
Tsinghua Univ, Sch Integrated Circuits, Beijing 100084, Peoples R ChinaTsinghua Univ, Sch Integrated Circuits, Beijing 100084, Peoples R China
Wu, Weiwei
[1
]
Liu, Leibo
论文数: 0引用数: 0
h-index: 0
机构:
Tsinghua Univ, Sch Integrated Circuits, Beijing 100084, Peoples R ChinaTsinghua Univ, Sch Integrated Circuits, Beijing 100084, Peoples R China
Liu, Leibo
[1
]
Hu, Yang
论文数: 0引用数: 0
h-index: 0
机构:
Tsinghua Univ, Sch Integrated Circuits, Beijing 100084, Peoples R ChinaTsinghua Univ, Sch Integrated Circuits, Beijing 100084, Peoples R China
Hu, Yang
[1
]
Wei, Shaojun
论文数: 0引用数: 0
h-index: 0
机构:
Tsinghua Univ, Sch Integrated Circuits, Beijing 100084, Peoples R ChinaTsinghua Univ, Sch Integrated Circuits, Beijing 100084, Peoples R China
Wei, Shaojun
[1
]
Yin, Shouyi
论文数: 0引用数: 0
h-index: 0
机构:
Tsinghua Univ, Sch Integrated Circuits, Beijing 100084, Peoples R ChinaTsinghua Univ, Sch Integrated Circuits, Beijing 100084, Peoples R China
Yin, Shouyi
[1
]
机构:
[1] Tsinghua Univ, Sch Integrated Circuits, Beijing 100084, Peoples R China
[2] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
Multimodal Transformers are emerging artificial intelligence (AI) models that comprehend a mixture of signals from different modalities like vision, natural language, and speech. The attention mechanism and massive matrix multiplications (MMs) cause high latency and energy. Prior work has shown that a digital computing-in-memory (CIM) network can be an efficient architecture to process Transformers while maintaining high accuracy. To further improve energy efficiency, attention-token-bit hybrid sparsity in multimodal Transformers can be exploited. The hybrid sparsity significantly reduces computation, but the irregularity also harms CIM utilization. To fully utilize the attention-token-bit hybrid sparsity of multimodal Transformers, we design a digital CIM-based accelerator called MulTCIM with three corresponding features: The long reuse elimination dynamically reshapes the attention pattern to improve CIM utilization. The runtime token pruner (RTP) removes insignificant tokens, and the modal-adaptive CIM network (MACN) exploits symmetric modal overlapping to reduce CIM idleness. The effective bitwidth-balanced CIM (EBB-CIM) macro balances input bits across in-memory multiply-accumulations (MACs) to reduce computation time. The fabricated MulTCIM consumes only 2.24 mu J/Token for the ViLBERT-base model, achieving 2.50x-5.91x lower energy than previous Transformer accelerators and digital CIM accelerators.
机构:
Chinese Acad Sci, Inst Microelect, Lab Microelect Device & Integrated Technol, Beijing 100029, Peoples R China
Univ Chinese Acad Sci, Sch Integrated Circuit, Beijing 100049, Peoples R ChinaChinese Acad Sci, Inst Microelect, Lab Microelect Device & Integrated Technol, Beijing 100029, Peoples R China
Wu, Hao
Chen, Yong
论文数: 0引用数: 0
h-index: 0
机构:
Univ Macau, State Key Lab Analog & Mixed Signal VLSI & IME ECE, Macau, Peoples R ChinaChinese Acad Sci, Inst Microelect, Lab Microelect Device & Integrated Technol, Beijing 100029, Peoples R China
Chen, Yong
Yuan, Yiyang
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Inst Microelect, Lab Microelect Device & Integrated Technol, Beijing 100029, Peoples R China
Univ Chinese Acad Sci, Sch Integrated Circuit, Beijing 100049, Peoples R ChinaChinese Acad Sci, Inst Microelect, Lab Microelect Device & Integrated Technol, Beijing 100029, Peoples R China
Yuan, Yiyang
Yue, Jinshan
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Inst Microelect, Lab Microelect Device & Integrated Technol, Beijing 100029, Peoples R China
Univ Chinese Acad Sci, Sch Integrated Circuit, Beijing 100049, Peoples R ChinaChinese Acad Sci, Inst Microelect, Lab Microelect Device & Integrated Technol, Beijing 100029, Peoples R China
Yue, Jinshan
Fu, Xiangqu
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Inst Microelect, Lab Microelect Device & Integrated Technol, Beijing 100029, Peoples R China
Univ Chinese Acad Sci, Sch Integrated Circuit, Beijing 100049, Peoples R ChinaChinese Acad Sci, Inst Microelect, Lab Microelect Device & Integrated Technol, Beijing 100029, Peoples R China
Fu, Xiangqu
Ren, Qirui
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Inst Microelect, Lab Microelect Device & Integrated Technol, Beijing 100029, Peoples R China
Univ Chinese Acad Sci, Sch Integrated Circuit, Beijing 100049, Peoples R ChinaChinese Acad Sci, Inst Microelect, Lab Microelect Device & Integrated Technol, Beijing 100029, Peoples R China
Ren, Qirui
Luo, Qing
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Inst Microelect, Lab Microelect Device & Integrated Technol, Beijing 100029, Peoples R China
Univ Chinese Acad Sci, Sch Integrated Circuit, Beijing 100049, Peoples R ChinaChinese Acad Sci, Inst Microelect, Lab Microelect Device & Integrated Technol, Beijing 100029, Peoples R China
Luo, Qing
论文数: 引用数:
h-index:
机构:
Mak, Pui-In
Wang, Xinghua
论文数: 0引用数: 0
h-index: 0
机构:
Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R ChinaChinese Acad Sci, Inst Microelect, Lab Microelect Device & Integrated Technol, Beijing 100029, Peoples R China
Wang, Xinghua
Zhang, Feng
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Inst Microelect, Lab Microelect Device & Integrated Technol, Beijing 100029, Peoples R China
Univ Chinese Acad Sci, Sch Integrated Circuit, Beijing 100049, Peoples R ChinaChinese Acad Sci, Inst Microelect, Lab Microelect Device & Integrated Technol, Beijing 100029, Peoples R China
机构:
Fudan Univ, State Key Lab Integrated Chips & Syst, Shanghai 200433, Peoples R ChinaFudan Univ, State Key Lab Integrated Chips & Syst, Shanghai 200433, Peoples R China
Liu, Shiwei
Mu, Chen
论文数: 0引用数: 0
h-index: 0
机构:
Fudan Univ, State Key Lab Integrated Chips & Syst, Shanghai 200433, Peoples R ChinaFudan Univ, State Key Lab Integrated Chips & Syst, Shanghai 200433, Peoples R China
Mu, Chen
Jiang, Hao
论文数: 0引用数: 0
h-index: 0
机构:
Fudan Univ, State Key Lab Integrated Chips & Syst, Shanghai 200433, Peoples R ChinaFudan Univ, State Key Lab Integrated Chips & Syst, Shanghai 200433, Peoples R China
Jiang, Hao
Wang, Yunzhengmao
论文数: 0引用数: 0
h-index: 0
机构:
Fudan Univ, State Key Lab Integrated Chips & Syst, Shanghai 200433, Peoples R ChinaFudan Univ, State Key Lab Integrated Chips & Syst, Shanghai 200433, Peoples R China
Wang, Yunzhengmao
Zhang, Jinshan
论文数: 0引用数: 0
h-index: 0
机构:
Fudan Univ, State Key Lab Integrated Chips & Syst, Shanghai 200433, Peoples R ChinaFudan Univ, State Key Lab Integrated Chips & Syst, Shanghai 200433, Peoples R China
Zhang, Jinshan
Lin, Feng
论文数: 0引用数: 0
h-index: 0
机构:
Fudan Univ, State Key Lab Integrated Chips & Syst, Shanghai 200433, Peoples R ChinaFudan Univ, State Key Lab Integrated Chips & Syst, Shanghai 200433, Peoples R China
Lin, Feng
Zhou, Keji
论文数: 0引用数: 0
h-index: 0
机构:
Fudan Univ, State Key Lab Integrated Chips & Syst, Shanghai 200433, Peoples R China
Zhangjiang Lab, Shanghai 201210, Peoples R ChinaFudan Univ, State Key Lab Integrated Chips & Syst, Shanghai 200433, Peoples R China
Zhou, Keji
Liu, Qi
论文数: 0引用数: 0
h-index: 0
机构:
Fudan Univ, State Key Lab Integrated Chips & Syst, Shanghai 200433, Peoples R China
Zhangjiang Lab, Shanghai 201210, Peoples R ChinaFudan Univ, State Key Lab Integrated Chips & Syst, Shanghai 200433, Peoples R China
Liu, Qi
Chen, Chixiao
论文数: 0引用数: 0
h-index: 0
机构:
Fudan Univ, State Key Lab Integrated Chips & Syst, Shanghai 200433, Peoples R China
Zhangjiang Lab, Shanghai 201210, Peoples R ChinaFudan Univ, State Key Lab Integrated Chips & Syst, Shanghai 200433, Peoples R China