Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments

被引：18

作者：

Nguyen, Khoi D. ^{[1
]}

Quoc-Huy Tran ^{[2
]}

Khoi Nguyen ^{[1
]}

Binh-Son Hua ^{[1
]}

Rang Nguyen ^{[1
]}

机构：

[1] VinAI Res, Hanoi, Vietnam

[2] Retrocausal Inc, Redmond, WA USA

来源：

COMPUTER VISION, ECCV 2022, PT XX | 2022年 / 13680卷

关键词：

Few-shot learning; Video classification; Appearance alignment; Temporal alignment; Inductive inference; Transductive inference;

D O I：

10.1007/978-3-031-20044-1_27

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a novel method for few-shot video classification, which performs appearance and temporal alignments. In particular, given a pair of query and support videos, we conduct appearance alignment via frame-level feature matching to achieve the appearance similarity score between the videos, while utilizing temporal order-preserving priors for obtaining the temporal similarity score between the videos. Moreover, we introduce a few-shot video classification framework that leverages the above appearance and temporal similarity scores across multiple steps, namely prototype-based training and testing as well as inductive and transductive prototype refinement. To the best of our knowledge, our work is the first to explore transductive few-shot video classification. Extensive experiments on both Kinetics and Something-Something V2 datasets show that both appearance and temporal alignments are crucial for datasets with temporal order sensitivity such as SomethingSomething V2. Our approach achieves similar or better results than previous methods on both datasets. Our code is available at https://github.com/VinAIResearch/fsvc-ata.

引用

页码：471 / 487

页数：17

共 50 条

[1] Temporal Transductive Inference for Few-Shot Video Object Segmentation
Siam, Mennatullah
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025,
[2] Transductive Few-Shot Classification on the Oblique Manifold
Qi, Guodong
Yu, Huimin
Lu, Zhaohui
Li, Shuzhao
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8392 - 8402
[3] TRANSDUCTIVE PROTOTYPICAL NETWORK FOR FEW-SHOT CLASSIFICATION
Liu, Xinyue
Liu, Pengxin
Zong, Linlin
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1671 - 1675
[4] Learning Implicit Temporal Alignment for Few-shot Video Classification
Zhang, Songyang
Zhou, Jiale
He, Xuming
PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1309 - 1315
[5] STTMC: A Few-Shot Spatial Temporal Transductive Modulation Classifier
Shi, Yunhao
Xu, Hua
Qi, Zisen
Zhang, Yue
Wang, Dan
Jiang, Lei
IEEE Transactions on Machine Learning in Communications and Networking, 2024, 2 : 546 - 559
[6] Inductive Unsupervised Domain Adaptation for Few-Shot Classification via Clustering
Cong, Xin
Yu, Bowen
Liu, Tingwen
Cui, Shiyao
Tang, Hengzhu
Wang, Bin
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2020, PT II, 2021, 12458 : 624 - 639
[7] Transductive Graph-Attention Network for Few-shot Classification
Pan, Lili
Liu, Weifeng
2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1, 2022, : 190 - 195
[8] Feature Transductive Distribution Optimization for Few-Shot Image Classification
Liu, Qing
Tang, Xianlun
Wang, Ying
Li, Xingchen
Jiang, Xinyan
Li, Weisheng
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2230 - 2243
[9] Transductive clustering optimization learning for few-shot image classification
Wang, Yi
Bian, Xiong
Zhu, Songhao
JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (04)
[10] Few-Shot Malware Classification via Attention-Based Transductive Learning Network
Deng, Liting
Yu, Chengli
Wen, Hui
Xin, Mingfeng
Sun, Yue
Sun, Limin
Zhu, Hongsong
MOBILE NETWORKS & APPLICATIONS, 2024, : 1690 - 1704

← 1 2 3 4 5 →