共 39 条
[1]
Zhou B(2018)Semantic understanding of scenes through the ade20k dataset Int J Comput Vis 127 302-321
[2]
Zhao H(2022)Structured attention network for referring image segmentation IEEE Trans Multimed 24 1922-1932
[3]
Puig X(1997)Long short-term memory Neural Comput 9 1735-1780
[4]
Fidler S(2022)Learning to compose and reason with language tree structures for visual grounding IEEE Trans Pattern Anal Mach Intell 44 684-696
[5]
Barriuso A(2021)Interpretable visual question answering by reasoning on dependency trees IEEE Trans Pattern Anal Mach Intell 43 887-901
[6]
Torralba A(2010)The segmented and annotated IAPR TC-12 benchmark Comput Vis Image Underst 114 419-428
[7]
Lin L(2009)The pascal visual object classes (VOC) challenge Int J Comput Vis 88 303-338
[8]
Yan P(1997)Bidirectional recurrent neural networks IEEE Trans Signal Process 45 2673-2681
[9]
Xu X(undefined)undefined undefined undefined undefined-undefined
[10]
Yang S(undefined)undefined undefined undefined undefined-undefined