Pyramid regional graph representation learning for content-based video retrieval

被引:14
|
作者
Zhao, Guoping [1 ]
Zhang, Mingyu [1 ]
Li, Yaxian [1 ]
Liu, Jiajun [1 ,2 ,3 ]
Zhang, Bingqing [1 ]
Wen, Ji-Rong [1 ,2 ,4 ]
机构
[1] Renmin Univ China, Sch Informat, Beijing, Peoples R China
[2] Beijing Key Lab Big Data Management & Anal Method, Beijing, Peoples R China
[3] CSIRO, Data 61, Pullenvale, Australia
[4] Renmin Univ China, Gaoling Sch Artificial Intelligence, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Graph embedding; Video retrieval; Regional graph; Pyramid feature map; FEATURES;
D O I
10.1016/j.ipm.2020.102488
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Conventionally, it is common that video retrieval methods aggregate the visual feature representations from every frame as the feature of the video, where each frame is treated as an isolated, static image. Such methods lack the power of modeling the intra-frame and interframe relationships for the local regions, and are often vulnerable to the visual redundancy and noise caused by various types of video transformation and editing, such as adding image patches, adding banner, etc. From the perspective of video retrieval, a video's key information is more often than not convoyed by geometrically centered, dynamic visual content, and static areas often reside in regions that are farther from the center and often exhibit heavy visual redundancies temporally. This phenomenon is hardly investigated by conventional retrieval methods. In this article, we propose an unsupervised video retrieval method that simultaneously models intra-frame and inter-frame contextual information for video representation with a graph topology that is constructed on top of pyramid regional feature maps. By decomposing a frame into a pyramid regional sub-graph, and transforming a video into a regional graph, we use graph convolutional networks to extract features that incorporate information from multiple types of context. Our method is unsupervised and only uses the frame features extracted by pre-trained network. We have conducted extensive experiments and have demonstrated that the proposed method outperforms state-of-the-art video retrieval methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Content-based video retrieval by example video clip
    Dimitrova, N
    AbdelMottaleb, M
    STORAGE AND RETRIEVAL FOR IMAGE AND VIDEO DATABASES V, 1997, 3022 : 59 - 70
  • [2] A systematic review on content-based video retrieval
    Spolaor, Newton
    Lee, Huei Diana
    Resende Takaki, Weber Shoity
    Ensina, Leandro Augusto
    Rodrigues Coy, Claudio Saddy
    Wu, Feng Chung
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 90 (90)
  • [3] Content-based Video Retrieval System Research
    Kong Juan
    Han Cuiying
    ICCSIT 2010 - 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, VOL 4, 2010, : 701 - 704
  • [4] Content-Based Video Big Data Retrieval with Extensive Features and Deep Learning
    Thuong-Cang Phan
    Anh-Cang Phan
    Hung-Phi Cao
    Thanh-Ngoan Trieu
    APPLIED SCIENCES-BASEL, 2022, 12 (13):
  • [5] Content-based video retrieval via motion trajectories
    Shan, MK
    Lee, SY
    ELECTRONIC IMAGING AND MULTIMEDIA SYSTEMS II, 1998, 3561 : 52 - 61
  • [6] An integrated system for content-based video retrieval and browsing
    Zhang, HJ
    Wu, JH
    Zhong, D
    Smoliar, SW
    PATTERN RECOGNITION, 1997, 30 (04) : 643 - 658
  • [7] A Survey on Visual Content-Based Video Indexing and Retrieval
    Hu, Weiming
    Xie, Nianhua
    Li, Li
    Zeng, Xianglin
    Maybank, Stephen
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2011, 41 (06): : 797 - 819
  • [8] Content-Based Video Retrieval With Prototypes of Deep Features
    Yoon, Hyeok
    Han, Ji-Hyeong
    IEEE ACCESS, 2022, 10 : 30730 - 30742
  • [9] Content-based video retrieval based on object motion trajectory
    Lie, WN
    Hsiao, WC
    PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2002, : 237 - 240
  • [10] The automatic video shot detection and characterization for content-based video retrieval
    Sun, JF
    Cui, SY
    Xu, X
    Luo, Y
    VISUALIZATION AND OPTIMIZATION TECHNIQUES, 2001, 4553 : 313 - 320