Multi-Level Second-Order Few-Shot Learning

被引：21

作者：

Zhang, Hongguang ^{[1
]}

Li, Hongdong ^{[2
]}

Koniusz, Piotr ^{[2
,3
]}

机构：

[1] AMS, Syst Engn Inst, Shanghai 100141, Peoples R China

[2] Australian Natl Univ, Coll Engn & Comp Sci, Canberra, ACT 0200, Australia

[3] CSIRO, Data61, Acton, ACT 2601, Australia

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2023年 / 25卷

基金：

中国国家自然科学基金;

关键词：

Task analysis; Pipelines; Image recognition; Visualization; Feature extraction; Training; Streaming media; Few-shot learning; second-order statistics; image classification; action recognition; FINE-GRAINED IMAGE; COVARIANCE; RETRIEVAL;

D O I：

10.1109/TMM.2022.3142955

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We propose a Multi-level Second-order (MlSo) few-shot learning network for supervised or unsupervised few-shot image classification and few-shot action recognition. We leverage so-called power-normalized second-order base learner streams combined with features that express multiple levels of visual abstraction, and we use self-supervised discriminating mechanisms. As Second-order Pooling (SoP) is popular in image recognition, we employ its basic element-wise variant in our pipeline. The goal of multi-level feature design is to extract feature representations at different layer-wise levels of CNN, realizing several levels of visual abstraction to achieve robust few-shot learning. As SoP can handle convolutional feature maps of varying spatial sizes, we also introduce image inputs at multiple spatial scales into MlSo. To exploit the discriminative information from multi-level and multi-scale features, we develop a Feature Matching (FM) module that reweights their respective branches. We also introduce a self-supervised step, which is a discriminator of the spatial level and the scale of abstraction. Our pipeline is trained in an end-to-end manner. With a simple architecture, we demonstrate respectable results on standard datasets such as Omniglot, mini-ImageNet, tiered-ImageNet, Open MIC, fine-grained datasets such as CUB Birds, Stanford Dogs and Cars, and action recognition datasets such as HMDB51, UCF101, and mini-MIT.

引用

页码：2111 / 2126

页数：16

共 50 条

[31] A few-shot link prediction framework to drug repurposing using multi-level attention network
Yang, Chenglin
Chen, Xianlai
Huang, Jincai
An, Ying
Huang, Zhenyu
Sun, Yu
COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 170
[32] Multi-Scale Metric Learning for Few-Shot Learning
Jiang, Wen
Huang, Kai
Geng, Jie
Deng, Xinyang
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (03) : 1091 - 1102
[33] Improving Domain-Generalized Few-Shot Text Classification with Multi-Level Distributional Signatures
Wang, Xuyang
Du, Yajun
Chen, Danroujing
Li, Xianyong
Chen, Xiaoliang
Fan, Yongquan
Xie, Chunzhi
Li, Yanli
Liu, Jia
APPLIED SCIENCES-BASEL, 2023, 13 (02):
[34] MuL-GRN: Multi-Level Graph Relation Network for Few-Shot Node Classification
Zhang, Lingling
Wang, Shaowei
Liu, Jun
Chang, Xiaojun
Lin, Qika
Wu, Yaqiang
Zheng, Qinghua
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (06) : 6085 - 6098
[35] Multi-level Semantic Fusion Network For Few-shot Multimedia Image Recognition In Education Management
Yuan, Chunlin
JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2025, 28 (02): : 227 - 235
[36] Multi-level similarity transfer and adaptive fusion data augmentation for few-shot object detection
Zhu, Songhao
Wang, Yi
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 105
[37] LLM-based Multi-Level Knowledge Generation for Few-shot Knowledge Graph Completion
Li, Qian
Chen, Zhuo
Ji, Cheng
Jiang, Shiqi
Li, Jianxin
PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 2135 - 2143
[38] Multi-Branch Network for Few-shot Learning
Ren, Kai
Guo, Zijie
Zhang, Zhimin
Zhu, Rui
Li, Xiaoxu
PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 520 - 525
[39] Few-shot Learning for Multi-Modality Tasks
Chen, Jie
Ye, Qixiang
Yang, Xiaoshan
Zhou, S. Kevin
Hong, Xiaopeng
Zhang, Li
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5673 - 5674
[40] Multi-Prototype Few-shot Learning in Histopathology
Deuschel, Jessica
Firmbach, Daniel
Geppert, Carol, I
Eckstein, Markus
Hartmann, Arndt
Bruns, Volker
Kuritcyn, Petr
Dexl, Jakob
Hartmann, David
Perrin, Dominik
Wittenberg, Thomas
Benz, Michaela
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 620 - 628

← 1 2 3 4 5 →