ABC-Trans: a novel adaptive border-augmented cross-attention transformer for object detection

被引:0
作者
Qianjun Zhang [1 ]
Pan Wang [2 ]
Zihao Wu [2 ]
Binhong Yang [2 ]
Jin Yuan [2 ]
机构
[1] The Tenth Research Institute of China Electronics Technology Group Corporation,
[2] Hunan University,undefined
关键词
Object detection; Transformer; Cross-attention; Point sampling;
D O I
10.1007/s11042-024-19405-3
中图分类号
学科分类号
摘要
Transformer-based vision object detection has demonstrated superior performance due to its effective removal of the need for many hand-designed components like anchor generation or a non-maximum suppression procedure. This paper presents a novel Adaptive Border-augmented Cross-attention Transformer (ABC-Trans) for vision-based object detection. By integrating the classic DETR and Deformable DETR, we design an adaptive cross-attention module to simultaneously perform global and local point sampling strategies to generate fusion features according to an estimated weight, well-capturing representative features for both small and large objects. On this basis, we further introduce a border-augmented cross-attention module to incorporate notable border features for object detection. Border features could well represent objects as well as distinguish from them backgrounds, thereby helping our model to accurately predict objects. Extensive experiments are conducted on MSCOCO and Pascal VOC datasets, and the results demonstrate the effectiveness of the proposed components, achieving promising performance as compared to the classic transformer-based detection approaches.
引用
收藏
页码:15671 / 15688
页数:17
相关论文
empty
未找到相关数据