Pyramid regional graph representation learning for content-based video retrieval

被引:14
作者
Zhao, Guoping [1 ]
Zhang, Mingyu [1 ]
Li, Yaxian [1 ]
Liu, Jiajun [1 ,2 ,3 ]
Zhang, Bingqing [1 ]
Wen, Ji-Rong [1 ,2 ,4 ]
机构
[1] Renmin Univ China, Sch Informat, Beijing, Peoples R China
[2] Beijing Key Lab Big Data Management & Anal Method, Beijing, Peoples R China
[3] CSIRO, Data 61, Pullenvale, Australia
[4] Renmin Univ China, Gaoling Sch Artificial Intelligence, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Graph embedding; Video retrieval; Regional graph; Pyramid feature map; FEATURES;
D O I
10.1016/j.ipm.2020.102488
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Conventionally, it is common that video retrieval methods aggregate the visual feature representations from every frame as the feature of the video, where each frame is treated as an isolated, static image. Such methods lack the power of modeling the intra-frame and interframe relationships for the local regions, and are often vulnerable to the visual redundancy and noise caused by various types of video transformation and editing, such as adding image patches, adding banner, etc. From the perspective of video retrieval, a video's key information is more often than not convoyed by geometrically centered, dynamic visual content, and static areas often reside in regions that are farther from the center and often exhibit heavy visual redundancies temporally. This phenomenon is hardly investigated by conventional retrieval methods. In this article, we propose an unsupervised video retrieval method that simultaneously models intra-frame and inter-frame contextual information for video representation with a graph topology that is constructed on top of pyramid regional feature maps. By decomposing a frame into a pyramid regional sub-graph, and transforming a video into a regional graph, we use graph convolutional networks to extract features that incorporate information from multiple types of context. Our method is unsupervised and only uses the frame features extracted by pre-trained network. We have conducted extensive experiments and have demonstrated that the proposed method outperforms state-of-the-art video retrieval methods.
引用
收藏
页数:12
相关论文
共 50 条
[41]   Multilayer Architecture for Content-based Image Retrieval Systems [J].
Grycuk, Rafal ;
Najgebauer, Patryk ;
Nowicki, Robert ;
Scherer, Rafal .
2019 IEEE 12TH CONFERENCE ON SERVICE-ORIENTED COMPUTING AND APPLICATIONS (SOCA 2019), 2019, :119-126
[42]   An intelligent content-based image retrieval methodology using transfer learning for digital IP protection [J].
Trappey, Amy J. C. ;
Trappey, Charles, V ;
Shih, Samuel .
ADVANCED ENGINEERING INFORMATICS, 2021, 48
[43]   A Visual Saliency-Based Approach for Content-Based Image Retrieval [J].
Khan, Aamir ;
Jalal, Anand Singh .
INTERNATIONAL JOURNAL OF COGNITIVE INFORMATICS AND NATURAL INTELLIGENCE, 2021, 15 (01) :1-15
[44]   Lightweight hashing with contrastive self-cross XLNet and combinational similarity matching for content-based video retrieval [J].
Rashmi, R. ;
Chethan, H.K. .
Australian Journal of Electrical and Electronics Engineering, 2025, 22 (02) :254-270
[45]   The Research and Application of Content-Based Multi-Feature Comprehensively Weighting Video-Image Retrieval Algorithm [J].
Zong Fang-Yi ;
Li Zheng-Bao ;
Wu Cheng-Xuan ;
Gao Hao .
PROCEEDINGS 2016 EIGHTH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION ICMTMA 2016, 2016, :817-821
[46]   Automated and effective content-based image retrieval for digital mammography [J].
Singh, Vibhav Prakash ;
Srivastava, Subodh ;
Srivastava, Rajeev .
JOURNAL OF X-RAY SCIENCE AND TECHNOLOGY, 2018, 26 (01) :29-49
[47]   Content-Based Image Retrieval Using Texture Structure Histogram [J].
Hou, Gang ;
Feng, Qinghe ;
Zhang, Xiaoxue ;
Kong, Jun ;
Zhang, Ming .
PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON MULTIMEDIA TECHNOLOGY (ICMT-13), 2013, 84 :1356-1363
[48]   Content-based multimedia information retrieval: State of the art and challenges [J].
Lew, Michael S. ;
Sebe, Nicu ;
Djeraba, Chabane ;
Jain, Ramesh .
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2006, 2 (01) :1-19
[49]   A Comparison of Histogram Distance Metrics for Content-Based Image Retrieval [J].
Zhang, Qianwen ;
Canosa, Roxanne. L. .
IMAGING AND MULTIMEDIA ANALYTICS IN A WEB AND MOBILE WORLD 2014, 2014, 9027
[50]   Content-Based Image Retrieval Using Color Volume Histograms [J].
Hua, Ji-Zhao ;
Liu, Guang-Hai ;
Song, Shu-Xiang .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (11)