Pyramid regional graph representation learning for content-based video retrieval

被引:14
作者
Zhao, Guoping [1 ]
Zhang, Mingyu [1 ]
Li, Yaxian [1 ]
Liu, Jiajun [1 ,2 ,3 ]
Zhang, Bingqing [1 ]
Wen, Ji-Rong [1 ,2 ,4 ]
机构
[1] Renmin Univ China, Sch Informat, Beijing, Peoples R China
[2] Beijing Key Lab Big Data Management & Anal Method, Beijing, Peoples R China
[3] CSIRO, Data 61, Pullenvale, Australia
[4] Renmin Univ China, Gaoling Sch Artificial Intelligence, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Graph embedding; Video retrieval; Regional graph; Pyramid feature map; FEATURES;
D O I
10.1016/j.ipm.2020.102488
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Conventionally, it is common that video retrieval methods aggregate the visual feature representations from every frame as the feature of the video, where each frame is treated as an isolated, static image. Such methods lack the power of modeling the intra-frame and interframe relationships for the local regions, and are often vulnerable to the visual redundancy and noise caused by various types of video transformation and editing, such as adding image patches, adding banner, etc. From the perspective of video retrieval, a video's key information is more often than not convoyed by geometrically centered, dynamic visual content, and static areas often reside in regions that are farther from the center and often exhibit heavy visual redundancies temporally. This phenomenon is hardly investigated by conventional retrieval methods. In this article, we propose an unsupervised video retrieval method that simultaneously models intra-frame and inter-frame contextual information for video representation with a graph topology that is constructed on top of pyramid regional feature maps. By decomposing a frame into a pyramid regional sub-graph, and transforming a video into a regional graph, we use graph convolutional networks to extract features that incorporate information from multiple types of context. Our method is unsupervised and only uses the frame features extracted by pre-trained network. We have conducted extensive experiments and have demonstrated that the proposed method outperforms state-of-the-art video retrieval methods.
引用
收藏
页数:12
相关论文
共 50 条
[21]   Efficient algorithms for content-based video retrieval using motion information [J].
Jeong, JM ;
Moon, YS .
IEICE TRANSACTIONS ON COMMUNICATIONS, 2003, E86B (02) :876-879
[22]   Toward a higher-level visual representation for content-based image retrieval [J].
El Sayad, Ismail ;
Martinet, Jean ;
Urruty, Thierry ;
Djeraba, Chabane .
MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 60 (02) :455-482
[23]   An Efficient Deep Learning-based Content-based Image Retrieval Framework [J].
Sivakumar, M. ;
Kumar, N. M. Saravana ;
Karthikeyan, N. .
COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2022, 43 (02) :683-700
[24]   Enhancing Content-Based Image Retrieval Using Machine Learning Techniques [J].
Hu, Qinmin Vivian ;
Ye, Zheng ;
Huang, Xiangji Jimmy .
ACTIVE MEDIA TECHNOLOGY, 2010, 6335 :383-394
[25]   Content-based retinal image retrieval [J].
Sukhia, Komal Nain ;
Riaz, Muhammad Mohsin ;
Ghafoor, Abdul .
IET IMAGE PROCESSING, 2019, 13 (09) :1525-1534
[26]   Graph-based reasoning attention pooling with curriculum design for content-based image retrieval [J].
Zhu, Xiaoguang ;
Wang, Haoyu ;
Liu, Peilin ;
Yang, Zhantao ;
Qian, Jiuchao .
IMAGE AND VISION COMPUTING, 2021, 115
[27]   WHEN CONTENT-BASED VIDEO RETRIEVAL AND HUMAN COMPUTATION UNITE: TOWARDS EFFECTIVE COLLABORATIVE VIDEO SEARCH [J].
Muenzer, B. ;
Primus, M. J. ;
Hudelist, M. ;
Beecks, C. ;
Huerst, W. ;
Schoeffmann, K. .
2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2017,
[28]   CONTENT-BASED IMAGE RETRIEVAL USING A SIGNATURE GRAPH AND A SELF-ORGANIZING MAP [J].
Thanh The Van ;
Thanh Manh Le .
INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2016, 26 (02) :423-438
[29]   Effective Image Representation using Double Colour Histogram for Content-Based Image Retrieval [J].
Martey, Ezekiel Mensah ;
Lei, Hang ;
Li, Xiaoyu ;
Appiah, Obed .
INFORMATICA-AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS, 2021, 45 (07) :97-105
[30]   A New Tool for Collaborative Video Search via Content-based Retrieval and Visual Inspection [J].
Hurst, Wolfgang ;
Ching, Algernon Ip Vai ;
Hudelist, Marco A. ;
Primus, Manfred J. ;
Schoeffmann, Klaus ;
Beecks, Christian .
MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE, 2016, :731-732