共 49 条
[31]
VideoBERT: A Joint Model for Video and Language Representation Learning
[J].
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019),
2019,
:7463-7472
[32]
Beyond Part Models: Person Retrieval with Refined Part Pooling (and A Strong Convolutional Baseline)
[J].
COMPUTER VISION - ECCV 2018, PT IV,
2018, 11208
:501-518
[33]
Tan H, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P5100
[34]
Touvron H, 2021, PR MACH LEARN RES, V139, P7358
[35]
Vaswani A, 2017, ADV NEUR IN, V30
[36]
Wang Chengji, 2021, IJCAI
[37]
Learning Discriminative Features with Multiple Granularities for Person Re-Identification
[J].
PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18),
2018,
:274-282
[39]
Wang P, 2022, IEEE Transactions on Multimedia
[40]
Wang Pengfei, 2022, IEEE T MULTIMEDIA