Original self-attention has the problem of quadratical complexity. In this paper, we propose a novel paradigm for tokenization that decouples the token scope from the spatial dimension. This new approach introduces dynamic tokens, which reduce computational complexity to linear while capturing multi-scale features. This paradigm is implemented in the proposed Dynamic Channel Token Vision Transformer (DCT-ViT), combining Window Self-Attention (WSA) and Dynamic Channel Self-Attention (DCSA) to capture both fine-grained and coarse-grained features. Our hierarchical window settings in DCSA prioritizes small tokens. DCT-ViT-S/B achieves a 82.9%/84.3% Top-1 accuracy on ImageNet-1k (Deng et al., 2009) and a 47.9/49.8 mAPb and a 43.4/44.6 mAPm on COCO 2017 (Lin et al., 2014) for Mask R-CNN (He et al., 2017) 3x schedule. The visualization of features in DCSA shows that dynamic channel tokens recognize objects at very early stages.
机构:
Cent South Univ, Sch Comp Sci & Engn, South Lu Shan Rd 932, Changsha 410083, Peoples R ChinaCent South Univ, Sch Comp Sci & Engn, South Lu Shan Rd 932, Changsha 410083, Peoples R China
Tang, Hao
Liu, Dawei
论文数: 0引用数: 0
h-index: 0
机构:
Cent South Univ, Sch Comp Sci & Engn, South Lu Shan Rd 932, Changsha 410083, Peoples R ChinaCent South Univ, Sch Comp Sci & Engn, South Lu Shan Rd 932, Changsha 410083, Peoples R China
Liu, Dawei
Shen, Chengchao
论文数: 0引用数: 0
h-index: 0
机构:
Cent South Univ, Sch Comp Sci & Engn, South Lu Shan Rd 932, Changsha 410083, Peoples R ChinaCent South Univ, Sch Comp Sci & Engn, South Lu Shan Rd 932, Changsha 410083, Peoples R China
机构:
Shanghai Maritime Univ, Inst Logist Sci & Engn, Shanghai 200135, Peoples R ChinaShanghai Maritime Univ, Inst Logist Sci & Engn, Shanghai 200135, Peoples R China
Rao, Yao
Li, Chaofeng
论文数: 0引用数: 0
h-index: 0
机构:
Shanghai Maritime Univ, Inst Logist Sci & Engn, Shanghai 200135, Peoples R ChinaShanghai Maritime Univ, Inst Logist Sci & Engn, Shanghai 200135, Peoples R China
Li, Chaofeng
Xu, Feiran
论文数: 0引用数: 0
h-index: 0
机构:
Hefei Univ Technol, Sch Food & Biol Engn, Hefei 230601, Peoples R ChinaShanghai Maritime Univ, Inst Logist Sci & Engn, Shanghai 200135, Peoples R China
Xu, Feiran
Guo, Ya
论文数: 0引用数: 0
h-index: 0
机构:
Jiangnan Univ, Sch Internet Things Engn, Wuxi 214122, Peoples R ChinaShanghai Maritime Univ, Inst Logist Sci & Engn, Shanghai 200135, Peoples R China
机构:
Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
Beijing Natl Res Ctr Informat Sci & Technol, Beijing 100084, Peoples R China
State Key Lab Space Network & Commun, Beijing 100084, Peoples R ChinaTsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
Peng, Xiang
Qin, Zhijin
论文数: 0引用数: 0
h-index: 0
机构:
Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
Beijing Natl Res Ctr Informat Sci & Technol, Beijing 100084, Peoples R China
State Key Lab Space Network & Commun, Beijing 100084, Peoples R ChinaTsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
Qin, Zhijin
Tao, Xiaoming
论文数: 0引用数: 0
h-index: 0
机构:
Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
Beijing Natl Res Ctr Informat Sci & Technol, Beijing 100084, Peoples R China
State Key Lab Space Network & Commun, Beijing 100084, Peoples R ChinaTsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
Tao, Xiaoming
Lu, Jianhua
论文数: 0引用数: 0
h-index: 0
机构:
Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
Beijing Natl Res Ctr Informat Sci & Technol, Beijing 100084, Peoples R China
State Key Lab Space Network & Commun, Beijing 100084, Peoples R ChinaTsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
Lu, Jianhua
Letaief, Khaled B.
论文数: 0引用数: 0
h-index: 0
机构:
Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R ChinaTsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
机构:
Army Engn Univ, Dept Mech Engn, Coll Field Engn, PLA, Nanjing 210007, Peoples R ChinaArmy Engn Univ, Dept Mech Engn, Coll Field Engn, PLA, Nanjing 210007, Peoples R China
Lu, Guanlin
He, Xiaohui
论文数: 0引用数: 0
h-index: 0
机构:
Army Engn Univ, Dept Mech Engn, Coll Field Engn, PLA, Nanjing 210007, Peoples R ChinaArmy Engn Univ, Dept Mech Engn, Coll Field Engn, PLA, Nanjing 210007, Peoples R China
He, Xiaohui
Wang, Qiang
论文数: 0引用数: 0
h-index: 0
机构:
Army Engn Univ, Dept Mech Engn, Coll Field Engn, PLA, Nanjing 210007, Peoples R ChinaArmy Engn Univ, Dept Mech Engn, Coll Field Engn, PLA, Nanjing 210007, Peoples R China
Wang, Qiang
Shao, Faming
论文数: 0引用数: 0
h-index: 0
机构:
Army Engn Univ, Dept Mech Engn, Coll Field Engn, PLA, Nanjing 210007, Peoples R ChinaArmy Engn Univ, Dept Mech Engn, Coll Field Engn, PLA, Nanjing 210007, Peoples R China
Shao, Faming
Wang, Hongwei
论文数: 0引用数: 0
h-index: 0
机构:
Army Engn Univ, Dept Mech Engn, Coll Field Engn, PLA, Nanjing 210007, Peoples R ChinaArmy Engn Univ, Dept Mech Engn, Coll Field Engn, PLA, Nanjing 210007, Peoples R China
Wang, Hongwei
Wang, Jinkang
论文数: 0引用数: 0
h-index: 0
机构:
Army Engn Univ, Dept Mech Engn, Coll Field Engn, PLA, Nanjing 210007, Peoples R ChinaArmy Engn Univ, Dept Mech Engn, Coll Field Engn, PLA, Nanjing 210007, Peoples R China