Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments

被引:18
|
作者
Nguyen, Khoi D. [1 ]
Quoc-Huy Tran [2 ]
Khoi Nguyen [1 ]
Binh-Son Hua [1 ]
Rang Nguyen [1 ]
机构
[1] VinAI Res, Hanoi, Vietnam
[2] Retrocausal Inc, Redmond, WA USA
来源
COMPUTER VISION, ECCV 2022, PT XX | 2022年 / 13680卷
关键词
Few-shot learning; Video classification; Appearance alignment; Temporal alignment; Inductive inference; Transductive inference;
D O I
10.1007/978-3-031-20044-1_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel method for few-shot video classification, which performs appearance and temporal alignments. In particular, given a pair of query and support videos, we conduct appearance alignment via frame-level feature matching to achieve the appearance similarity score between the videos, while utilizing temporal order-preserving priors for obtaining the temporal similarity score between the videos. Moreover, we introduce a few-shot video classification framework that leverages the above appearance and temporal similarity scores across multiple steps, namely prototype-based training and testing as well as inductive and transductive prototype refinement. To the best of our knowledge, our work is the first to explore transductive few-shot video classification. Extensive experiments on both Kinetics and Something-Something V2 datasets show that both appearance and temporal alignments are crucial for datasets with temporal order sensitivity such as SomethingSomething V2. Our approach achieves similar or better results than previous methods on both datasets. Our code is available at https://github.com/VinAIResearch/fsvc-ata.
引用
收藏
页码:471 / 487
页数:17
相关论文
共 50 条
  • [1] Temporal Transductive Inference for Few-Shot Video Object Segmentation
    Siam, Mennatullah
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025,
  • [2] Transductive Few-Shot Classification on the Oblique Manifold
    Qi, Guodong
    Yu, Huimin
    Lu, Zhaohui
    Li, Shuzhao
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8392 - 8402
  • [3] TRANSDUCTIVE PROTOTYPICAL NETWORK FOR FEW-SHOT CLASSIFICATION
    Liu, Xinyue
    Liu, Pengxin
    Zong, Linlin
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1671 - 1675
  • [4] Learning Implicit Temporal Alignment for Few-shot Video Classification
    Zhang, Songyang
    Zhou, Jiale
    He, Xuming
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1309 - 1315
  • [5] STTMC: A Few-Shot Spatial Temporal Transductive Modulation Classifier
    Shi, Yunhao
    Xu, Hua
    Qi, Zisen
    Zhang, Yue
    Wang, Dan
    Jiang, Lei
    IEEE Transactions on Machine Learning in Communications and Networking, 2024, 2 : 546 - 559
  • [6] Inductive Unsupervised Domain Adaptation for Few-Shot Classification via Clustering
    Cong, Xin
    Yu, Bowen
    Liu, Tingwen
    Cui, Shiyao
    Tang, Hengzhu
    Wang, Bin
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2020, PT II, 2021, 12458 : 624 - 639
  • [7] Transductive Graph-Attention Network for Few-shot Classification
    Pan, Lili
    Liu, Weifeng
    2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1, 2022, : 190 - 195
  • [8] Feature Transductive Distribution Optimization for Few-Shot Image Classification
    Liu, Qing
    Tang, Xianlun
    Wang, Ying
    Li, Xingchen
    Jiang, Xinyan
    Li, Weisheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2230 - 2243
  • [9] Transductive clustering optimization learning for few-shot image classification
    Wang, Yi
    Bian, Xiong
    Zhu, Songhao
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (04)
  • [10] Few-Shot Malware Classification via Attention-Based Transductive Learning Network
    Deng, Liting
    Yu, Chengli
    Wen, Hui
    Xin, Mingfeng
    Sun, Yue
    Sun, Limin
    Zhu, Hongsong
    MOBILE NETWORKS & APPLICATIONS, 2024, : 1690 - 1704