ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning

被引:39
|
作者
Kordopatis-Zilos, Giorgos [1 ,2 ]
Papadopoulos, Symeon [1 ]
Patras, Ioannis [2 ]
Kompatsiaris, Ioannis [1 ]
机构
[1] CERTH, Informat Technol Inst, Thessaloniki, Greece
[2] Queen Mary Univ London, Mile End Rd, London E1 4NS, England
基金
英国工程与自然科学研究理事会; 欧盟地平线“2020”;
关键词
COPY DETECTION; LOCALIZATION; RETRIEVAL; FEATURES;
D O I
10.1109/ICCV.2019.00645
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we introduce ViSiL, a Video Similarity Learning architecture that considers fine-grained SpatioTemporal relations between pairs of videos - such relations are typically lost in previous video retrieval approaches that embed the whole frame or even the whole video into a vector descriptor before the similarity estimation. By contrast, our Convolutional Neural Network (CNN)-based approach is trained to calculate video-to-video similarity from refined frame-to-frame similarity matrices, so as to consider both intra- and inter-frame relations. In the proposed method, pairwise frame similarity is estimated by applying Tensor Dot (TD) followed by Chamfer Similarity (CS) on regional CNN frame features - this avoids feature aggregation before the similarity calculation between frames. Subsequently, the similarity matrix between all video frames is fed to a four-layer CNN, and then summarized using Chamfer Similarity (CS) into a video-to-video similarity score - this avoids feature aggregation before the similarity calculation between videos and captures the temporal similarity patterns between matching frame sequences. We train the proposed network using a triplet loss scheme and evaluate it on five public benchmark datasets on four different video retrieval problems where we demonstrate large improvements in comparison to the state of the art. The implementation of ViSiL is publicly available(1).
引用
收藏
页码:6360 / 6369
页数:10
相关论文
共 50 条
  • [1] Dynamic Spatio-Temporal Specialization Learning for Fine-Grained Action Recognition
    Li, Tianjiao
    Foo, Lin Geng
    Ke, Qiuhong
    Rahmani, Hossein
    Wang, Anran
    Wang, Jinghua
    Liu, Jun
    COMPUTER VISION - ECCV 2022, PT IV, 2022, 13664 : 386 - 403
  • [2] ENFIRE: A Spatio-Temporal Fine-Grained Reconfigurable Hardware
    Qian, Wenchao
    Babecki, Christopher
    Karam, Robert
    Paul, Somnath
    Bhunia, Swarup
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2017, 25 (01) : 177 - 188
  • [3] FineMoGen: Fine-Grained Spatio-Temporal Motion Generation and Editing
    Zhang, Mingyuan
    Li, Huirong
    Cai, Zhongang
    Ren, Jiawei
    Yang, Lei
    Liu, Ziwei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [4] Spatio-Temporal Detection of Fine-Grained Dyadic Human Interactions
    van Gemeren, Coert
    Poppe, Ronald
    Veltkamp, Remco C.
    HUMAN BEHAVIOR UNDERSTANDING, 2016, 9997 : 116 - 133
  • [5] RADIAL LOSS FOR LEARNING FINE-GRAINED VIDEO SIMILARITY METRIC
    Jain, Abhinav
    Agarwal, Prerna
    Mujumdar, Shashank
    Gupta, Nitin
    Mehta, Sameep
    Chattopadhyay, Chiranjoy
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1652 - 1656
  • [6] Fine-Grained Spatio-Temporal Parsing Network for Action Quality Assessment
    Gedamu, Kumie
    Ji, Yanli
    Yang, Yang
    Shao, Jie
    Shen, Heng Tao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 6386 - 6400
  • [7] Predicting Fine-Grained Traffic Conditions via Spatio-Temporal LSTM
    Wei, Xiaojuan
    Li, Jinglin
    Yuan, Quan
    Chen, Kaihui
    Zhou, Ao
    Yang, Fangchun
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2019, 2019
  • [8] A novel framework for fine-grained spatio-temporal change detection in satellite images
    Agarwal, Riya
    Jindal, Shaifali
    Narain, Shradha
    Kaushal, Rishabh
    Yadav, Kalpana
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (1) : 1241 - 1260
  • [9] Secure fine-grained spatio-temporal Top-k queries in TMWSNs
    Ma, Xingpo
    Liang, Junbin
    Wang, Jianxin
    Wen, Sheng
    Wang, Tian
    Li, Yin
    Ma, Wenpeng
    Qi, Chuanda
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 86 : 174 - 184
  • [10] A novel framework for fine-grained spatio-temporal change detection in satellite images
    Riya Agarwal
    Shaifali Jindal
    Shradha Narain
    Rishabh Kaushal
    Kalpana Yadav
    Multimedia Tools and Applications, 2024, 83 : 1241 - 1260