Pyramid regional graph representation learning for content-based video retrieval

被引：14

作者：

Zhao, Guoping ^{[1
]}

Zhang, Mingyu ^{[1
]}

Li, Yaxian ^{[1
]}

Liu, Jiajun ^{[1
,2
,3
]}

Zhang, Bingqing ^{[1
]}

Wen, Ji-Rong ^{[1
,2
,4
]}

机构：

[1] Renmin Univ China, Sch Informat, Beijing, Peoples R China

[2] Beijing Key Lab Big Data Management & Anal Method, Beijing, Peoples R China

[3] CSIRO, Data 61, Pullenvale, Australia

[4] Renmin Univ China, Gaoling Sch Artificial Intelligence, Beijing, Peoples R China

来源：

INFORMATION PROCESSING & MANAGEMENT | 2021年 / 58卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Graph embedding; Video retrieval; Regional graph; Pyramid feature map; FEATURES;

D O I：

10.1016/j.ipm.2020.102488

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Conventionally, it is common that video retrieval methods aggregate the visual feature representations from every frame as the feature of the video, where each frame is treated as an isolated, static image. Such methods lack the power of modeling the intra-frame and interframe relationships for the local regions, and are often vulnerable to the visual redundancy and noise caused by various types of video transformation and editing, such as adding image patches, adding banner, etc. From the perspective of video retrieval, a video's key information is more often than not convoyed by geometrically centered, dynamic visual content, and static areas often reside in regions that are farther from the center and often exhibit heavy visual redundancies temporally. This phenomenon is hardly investigated by conventional retrieval methods. In this article, we propose an unsupervised video retrieval method that simultaneously models intra-frame and inter-frame contextual information for video representation with a graph topology that is constructed on top of pyramid regional feature maps. By decomposing a frame into a pyramid regional sub-graph, and transforming a video into a regional graph, we use graph convolutional networks to extract features that incorporate information from multiple types of context. Our method is unsupervised and only uses the frame features extracted by pre-trained network. We have conducted extensive experiments and have demonstrated that the proposed method outperforms state-of-the-art video retrieval methods.

引用

页数：12

共 50 条

[1] Content-based video retrieval by example video clip
Dimitrova, N
AbdelMottaleb, M
STORAGE AND RETRIEVAL FOR IMAGE AND VIDEO DATABASES V, 1997, 3022 : 59 - 70
[2] A systematic review on content-based video retrieval
Spolaor, Newton
Lee, Huei Diana
Resende Takaki, Weber Shoity
Ensina, Leandro Augusto
Rodrigues Coy, Claudio Saddy
Wu, Feng Chung
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 90 (90)
[3] Content-based Video Retrieval System Research
Kong Juan
Han Cuiying
ICCSIT 2010 - 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, VOL 4, 2010, : 701 - 704
[4] Content-Based Video Big Data Retrieval with Extensive Features and Deep Learning
Thuong-Cang Phan
Anh-Cang Phan
Hung-Phi Cao
Thanh-Ngoan Trieu
APPLIED SCIENCES-BASEL, 2022, 12 (13):
[5] Content-based video retrieval via motion trajectories
Shan, MK
Lee, SY
ELECTRONIC IMAGING AND MULTIMEDIA SYSTEMS II, 1998, 3561 : 52 - 61
[6] An integrated system for content-based video retrieval and browsing
Zhang, HJ
Wu, JH
Zhong, D
Smoliar, SW
PATTERN RECOGNITION, 1997, 30 (04) : 643 - 658
[7] A Survey on Visual Content-Based Video Indexing and Retrieval
Hu, Weiming
Xie, Nianhua
Li, Li
Zeng, Xianglin
Maybank, Stephen
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2011, 41 (06): : 797 - 819
[8] Content-Based Video Retrieval With Prototypes of Deep Features
Yoon, Hyeok
Han, Ji-Hyeong
IEEE ACCESS, 2022, 10 : 30730 - 30742
[9] Content-based video retrieval based on object motion trajectory
Lie, WN
Hsiao, WC
PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2002, : 237 - 240
[10] The automatic video shot detection and characterization for content-based video retrieval
Sun, JF
Cui, SY
Xu, X
Luo, Y
VISUALIZATION AND OPTIMIZATION TECHNIQUES, 2001, 4553 : 313 - 320

← 1 2 3 4 5 →