共 54 条
[1]
[Anonymous], PROC CVPR IEEE
[2]
Chen Kai, 2019, arXiv:1906.07155
[3]
Query-guided Regression Network with Context Policy for Phrase Grounding
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:824-832
[4]
Dai JF, 2016, ADV NEUR IN, V29
[5]
Deformable Convolutional Networks
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:764-773
[6]
Visual Grounding via Accumulated Attention
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:7746-7755
[7]
CenterNet: Keypoint Triplets for Object Detection
[J].
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019),
2019,
:6568-6577
[8]
Aligning Linguistic Words and Visual Semantic Units for Image Captioning
[J].
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19),
2019,
:765-773
[9]
He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/ICCV.2017.322, 10.1109/TPAMI.2018.2844175]
[10]
Deep Residual Learning for Image Recognition
[J].
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2016,
:770-778