共 57 条
- [1] Bertinetto L., Valmadre J., Henriques J.F., Vedaldi A., Torr P.H.S., Fully-convolutional Siamese networks for object tracking, Proc. ECCV Workshops, pp. 850-865, (2016)
- [2] Borsuk V., Vei R., Kupyn O., Martyniuk T., Krashenyi I., Matas J., FEAR: Fast, efficient, accurate and robust visual tracker, Proc. Eur. Conf. Comput. Vis, pp. 644-663, (2022)
- [3] Cui Y., Jiang C., Wang L., Wu G., MixFormer: End-to-end tracking with iterative mixed attention, Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit. (CVPR), pp. 13608-13618, (2022)
- [4] Dosovitskiy A., Et al., An image is worth 16x16 words: Transformers for image recognition at scale, (2020)
- [5] Feng M., Song K., Wang Y., Liu J., Yan Y., Learning discriminative update adaptive spatial-temporal regularized correlation filter for RGBT tracking, J. Vis. Commun. Image Represent, 72, (2020)
- [6] Feng M., Su J., Learning reliable modal weight with transformer for robust RGBT tracking, Knowl.-Based Syst, 249, (2022)
- [7] He K., Gkioxari G., Dollar P., Girshick R., Mask R-CNN, Proc. IEEE Int. Conf. Comput. Vis. ICCV, pp. 2961-2969, (2017)
- [8] Hou R., Ren T., Wu G., MIRNet: A robust RGBT tracking jointly with multi-modal interaction and refinement, Proc. IEEE Int. Conf. Multimedia Expo ICME, pp. 1-6, (2022)
- [9] Kenton J.D.M.-W.C., Toutanova L.K., BERT: Pre-training of deep bidirectional transformers for language understanding, Proc. NaaCL-HLT, 1, (2019)
- [10] Kristan M., Et al., The seventh visual object tracking VOT2019 challenge results, Proc. IEEE/CVF Int. Conf. Comput. Vis. Workshop ICCVW, pp. 2206-2241, (2019)