ABC-Trans: a novel adaptive border-augmented cross-attention transformer for object detection

被引：0

作者：

Qianjun Zhang ^{[1
]}

Pan Wang ^{[2
]}

Zihao Wu ^{[2
]}

Binhong Yang ^{[2
]}

Jin Yuan ^{[2
]}

机构：

[1] The Tenth Research Institute of China Electronics Technology Group Corporation,

[2] Hunan University,undefined

来源：

Multimedia Tools and Applications | 2025年 / 84卷 / 16期

关键词：

Object detection; Transformer; Cross-attention; Point sampling;

D O I：

10.1007/s11042-024-19405-3

中图分类号：

学科分类号：

摘要：

Transformer-based vision object detection has demonstrated superior performance due to its effective removal of the need for many hand-designed components like anchor generation or a non-maximum suppression procedure. This paper presents a novel Adaptive Border-augmented Cross-attention Transformer (ABC-Trans) for vision-based object detection. By integrating the classic DETR and Deformable DETR, we design an adaptive cross-attention module to simultaneously perform global and local point sampling strategies to generate fusion features according to an estimated weight, well-capturing representative features for both small and large objects. On this basis, we further introduce a border-augmented cross-attention module to incorporate notable border features for object detection. Border features could well represent objects as well as distinguish from them backgrounds, thereby helping our model to accurately predict objects. Extensive experiments are conducted on MSCOCO and Pascal VOC datasets, and the results demonstrate the effectiveness of the proposed components, achieving promising performance as compared to the classic transformer-based detection approaches.

引用

页码：15671 / 15688

页数：17