Pyramid regional graph representation learning for content-based video retrieval

被引:14
作者
Zhao, Guoping [1 ]
Zhang, Mingyu [1 ]
Li, Yaxian [1 ]
Liu, Jiajun [1 ,2 ,3 ]
Zhang, Bingqing [1 ]
Wen, Ji-Rong [1 ,2 ,4 ]
机构
[1] Renmin Univ China, Sch Informat, Beijing, Peoples R China
[2] Beijing Key Lab Big Data Management & Anal Method, Beijing, Peoples R China
[3] CSIRO, Data 61, Pullenvale, Australia
[4] Renmin Univ China, Gaoling Sch Artificial Intelligence, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Graph embedding; Video retrieval; Regional graph; Pyramid feature map; FEATURES;
D O I
10.1016/j.ipm.2020.102488
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Conventionally, it is common that video retrieval methods aggregate the visual feature representations from every frame as the feature of the video, where each frame is treated as an isolated, static image. Such methods lack the power of modeling the intra-frame and interframe relationships for the local regions, and are often vulnerable to the visual redundancy and noise caused by various types of video transformation and editing, such as adding image patches, adding banner, etc. From the perspective of video retrieval, a video's key information is more often than not convoyed by geometrically centered, dynamic visual content, and static areas often reside in regions that are farther from the center and often exhibit heavy visual redundancies temporally. This phenomenon is hardly investigated by conventional retrieval methods. In this article, we propose an unsupervised video retrieval method that simultaneously models intra-frame and inter-frame contextual information for video representation with a graph topology that is constructed on top of pyramid regional feature maps. By decomposing a frame into a pyramid regional sub-graph, and transforming a video into a regional graph, we use graph convolutional networks to extract features that incorporate information from multiple types of context. Our method is unsupervised and only uses the frame features extracted by pre-trained network. We have conducted extensive experiments and have demonstrated that the proposed method outperforms state-of-the-art video retrieval methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] GCViR: grid content-based video retrieval with work allocation brokering
    Toharia, Pablo
    Sanchez, Alberto
    Bosque, Jose Luis
    Robles, Oscar D.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2010, 22 (11) : 1450 - 1475
  • [22] Toward a higher-level visual representation for content-based image retrieval
    El Sayad, Ismail
    Martinet, Jean
    Urruty, Thierry
    Djeraba, Chabane
    MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 60 (02) : 455 - 482
  • [23] An Efficient Deep Learning-based Content-based Image Retrieval Framework
    Sivakumar, M.
    Kumar, N. M. Saravana
    Karthikeyan, N.
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2022, 43 (02): : 683 - 700
  • [24] Enhancing Content-Based Image Retrieval Using Machine Learning Techniques
    Hu, Qinmin Vivian
    Ye, Zheng
    Huang, Xiangji Jimmy
    ACTIVE MEDIA TECHNOLOGY, 2010, 6335 : 383 - 394
  • [25] Content-based retinal image retrieval
    Sukhia, Komal Nain
    Riaz, Muhammad Mohsin
    Ghafoor, Abdul
    IET IMAGE PROCESSING, 2019, 13 (09) : 1525 - 1534
  • [26] Graph-based reasoning attention pooling with curriculum design for content-based image retrieval
    Zhu, Xiaoguang
    Wang, Haoyu
    Liu, Peilin
    Yang, Zhantao
    Qian, Jiuchao
    IMAGE AND VISION COMPUTING, 2021, 115
  • [27] WHEN CONTENT-BASED VIDEO RETRIEVAL AND HUMAN COMPUTATION UNITE: TOWARDS EFFECTIVE COLLABORATIVE VIDEO SEARCH
    Muenzer, B.
    Primus, M. J.
    Hudelist, M.
    Beecks, C.
    Huerst, W.
    Schoeffmann, K.
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2017,
  • [28] CONTENT-BASED IMAGE RETRIEVAL USING A SIGNATURE GRAPH AND A SELF-ORGANIZING MAP
    Thanh The Van
    Thanh Manh Le
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2016, 26 (02) : 423 - 438
  • [29] Effective Image Representation using Double Colour Histogram for Content-Based Image Retrieval
    Martey, Ezekiel Mensah
    Lei, Hang
    Li, Xiaoyu
    Appiah, Obed
    INFORMATICA-AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS, 2021, 45 (07): : 97 - 105
  • [30] A New Tool for Collaborative Video Search via Content-based Retrieval and Visual Inspection
    Hurst, Wolfgang
    Ching, Algernon Ip Vai
    Hudelist, Marco A.
    Primus, Manfred J.
    Schoeffmann, Klaus
    Beecks, Christian
    MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE, 2016, : 731 - 732