A Forward and Backward Compatible Framework for Few-Shot Class-Incremental Pill Recognition

被引:0
作者
Zhang, Jinghua [1 ]
Liu, Li [2 ]
Gao, Kai [1 ]
Hu, Dewen [1 ]
机构
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China
[2] Natl Univ Def Technol, Coll Elect Sci, Changsha 410073, Peoples R China
基金
芬兰科学院; 中国国家自然科学基金;
关键词
Power capacitors; Medical diagnostic imaging; Training; Computer vision; Hospitals; Computed tomography; Computational modeling; Benchmark testing; Visualization; Uncertainty; Automatic pill recognition (APR); class-incremental learning (CIL); computer vision; few-shot learning (FSL); pill dataset; SYSTEM;
D O I
10.1109/TNNLS.2024.3497956
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic pill recognition (APR) systems are crucial for enhancing hospital efficiency, assisting visually impaired individuals, and preventing cross-infection. However, most existing deep learning-based pill recognition systems can only perform classification on classes with sufficient training data. In practice, the high cost of data annotation and the continuous increase in new pill classes necessitate the development of a few-shot class-incremental pill recognition (FSCIPR) system. This article introduces the first FSCIPR framework, discriminative and bidirectional compatible few-shot class-incremental learning (DBC-FSCIL). It encompasses forward-compatible and backward-compatible learning components. In forward-compatible learning, we propose an innovative virtual class generation strategy and a center-triplet (CT) loss to enhance discriminative feature learning. These virtual classes serve as placeholders in the feature space for future class updates, providing diverse semantic knowledge for model training. For backward-compatible learning, we develop a strategy to synthesize reliable pseudo-features of old classes using uncertainty quantification, facilitating data replay (DR) and knowledge distillation (KD). This approach allows for the flexible synthesis of features and effectively reduces additional storage requirements for samples and models. Additionally, we construct a new pill image dataset for FSCIL and assess various mainstream FSCIL methods, establishing new benchmarks. Our experimental results demonstrate that our framework surpasses existing state-of-the-art (SOTA) methods.
引用
收藏
页数:15
相关论文
共 57 条
  • [1] A Deep Learning-Based Intelligent Medicine Recognition System for Chronic Patients
    Chang, Wan-Jung
    Chen, Liang-Bi
    Hsu, Chia-Hao
    Lin, Cheng-Pei
    Yang, Tzu-Chin
    [J]. IEEE ACCESS, 2019, 7 : 44441 - 44458
  • [2] Holistic Prototype Activation for Few-Shot Segmentation
    Cheng, Gong
    Lang, Chunbo
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 4650 - 4666
  • [3] Dong SL, 2021, AAAI CONF ARTIF INTE, V35, P1255
  • [4] DyTox: Transformers for Continual Learning with DYnamic TOken eXpansion
    Douillard, Arthur
    Rame, Alexandre
    Couairon, Guillaume
    Cord, Matthieu
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9275 - 9285
  • [5] Elsayed GF, 2018, ADV NEUR IN, V31
  • [6] Prototype Bayesian Meta-Learning for Few-Shot Image Classification
    Fu, Meijun
    Wang, Xiaomin
    Wang, Jun
    Yi, Zhang
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
  • [7] A Mutually Supervised Graph Attention Network for Few-Shot Segmentation: The Perspective of Fully Utilizing Limited Samples
    Gao, Honghao
    Xiao, Junsheng
    Yin, Yuyu
    Liu, Tong
    Shi, Jiangang
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 4826 - 4838
  • [8] Triplet-Center Loss for Multi-View 3D Object Retrieval
    He, Xinwei
    Zhou, Yang
    Zhou, Zhichao
    Bai, Song
    Bai, Xiang
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1945 - 1954
  • [9] Dense Network Expansion for Class Incremental Learning
    Hu, Zhiyuan
    Li, Yunsheng
    Lyu, Jiancheng
    Gao, Dashan
    Vasconcelos, Nuno
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11858 - 11867
  • [10] Huang LB, 2024, AAAI CONF ARTIF INTE, P12591