Transformer-based vision object detection has demonstrated superior performance due to its effective removal of the need for many hand-designed components like anchor generation or a non-maximum suppression procedure. This paper presents a novel Adaptive Border-augmented Cross-attention Transformer (ABC-Trans) for vision-based object detection. By integrating the classic DETR and Deformable DETR, we design an adaptive cross-attention module to simultaneously perform global and local point sampling strategies to generate fusion features according to an estimated weight, well-capturing representative features for both small and large objects. On this basis, we further introduce a border-augmented cross-attention module to incorporate notable border features for object detection. Border features could well represent objects as well as distinguish from them backgrounds, thereby helping our model to accurately predict objects. Extensive experiments are conducted on MSCOCO and Pascal VOC datasets, and the results demonstrate the effectiveness of the proposed components, achieving promising performance as compared to the classic transformer-based detection approaches.