A Forward and Backward Compatible Framework for Few-Shot Class-Incremental Pill Recognition

被引：0

作者：

Zhang, Jinghua ^{[1
]}

Liu, Li ^{[2
]}

Gao, Kai ^{[1
]}

Hu, Dewen ^{[1
]}

机构：

[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China

[2] Natl Univ Def Technol, Coll Elect Sci, Changsha 410073, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年

基金：

芬兰科学院; 中国国家自然科学基金;

关键词：

Power capacitors; Medical diagnostic imaging; Training; Computer vision; Hospitals; Computed tomography; Computational modeling; Benchmark testing; Visualization; Uncertainty; Automatic pill recognition (APR); class-incremental learning (CIL); computer vision; few-shot learning (FSL); pill dataset; SYSTEM;

D O I：

10.1109/TNNLS.2024.3497956

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Automatic pill recognition (APR) systems are crucial for enhancing hospital efficiency, assisting visually impaired individuals, and preventing cross-infection. However, most existing deep learning-based pill recognition systems can only perform classification on classes with sufficient training data. In practice, the high cost of data annotation and the continuous increase in new pill classes necessitate the development of a few-shot class-incremental pill recognition (FSCIPR) system. This article introduces the first FSCIPR framework, discriminative and bidirectional compatible few-shot class-incremental learning (DBC-FSCIL). It encompasses forward-compatible and backward-compatible learning components. In forward-compatible learning, we propose an innovative virtual class generation strategy and a center-triplet (CT) loss to enhance discriminative feature learning. These virtual classes serve as placeholders in the feature space for future class updates, providing diverse semantic knowledge for model training. For backward-compatible learning, we develop a strategy to synthesize reliable pseudo-features of old classes using uncertainty quantification, facilitating data replay (DR) and knowledge distillation (KD). This approach allows for the flexible synthesis of features and effectively reduces additional storage requirements for samples and models. Additionally, we construct a new pill image dataset for FSCIL and assess various mainstream FSCIL methods, establishing new benchmarks. Our experimental results demonstrate that our framework surpasses existing state-of-the-art (SOTA) methods.

引用

页数：15

共 57 条

[1] A Deep Learning-Based Intelligent Medicine Recognition System for Chronic Patients
Chang, Wan-Jung
Chen, Liang-Bi
Hsu, Chia-Hao
Lin, Cheng-Pei
Yang, Tzu-Chin
[J]. IEEE ACCESS, 2019, 7 : 44441 - 44458
[2] Holistic Prototype Activation for Few-Shot Segmentation
Cheng, Gong
Lang, Chunbo
Han, Junwei
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 4650 - 4666
[3] Dong SL, 2021, AAAI CONF ARTIF INTE, V35, P1255
[4] DyTox: Transformers for Continual Learning with DYnamic TOken eXpansion
Douillard, Arthur
Rame, Alexandre
Couairon, Guillaume
Cord, Matthieu
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9275 - 9285
[5] Elsayed GF, 2018, ADV NEUR IN, V31
[6] Prototype Bayesian Meta-Learning for Few-Shot Image Classification
Fu, Meijun
Wang, Xiaomin
Wang, Jun
Yi, Zhang
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
[7] A Mutually Supervised Graph Attention Network for Few-Shot Segmentation: The Perspective of Fully Utilizing Limited Samples
Gao, Honghao
Xiao, Junsheng
Yin, Yuyu
Liu, Tong
Shi, Jiangang
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 4826 - 4838
[8] Triplet-Center Loss for Multi-View 3D Object Retrieval
He, Xinwei
Zhou, Yang
Zhou, Zhichao
Bai, Song
Bai, Xiang
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1945 - 1954
[9] Dense Network Expansion for Class Incremental Learning
Hu, Zhiyuan
Li, Yunsheng
Lyu, Jiancheng
Gao, Dashan
Vasconcelos, Nuno
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11858 - 11867
[10] Huang LB, 2024, AAAI CONF ARTIF INTE, P12591

← 1 2 3 4 5 6 →