Few-Shot Learning on Edge Devices Using CLIP: A Resource-Efficient Approach for Image Classification

被引：0

作者：

Lu, Jin ^{[1
]}

机构：

[1] Shenzhen Polytech Univ, Guangdong Key Lab Big Data Intelligence Vocat Educ, Shenzhen 518055, Guangdong, Peoples R China

来源：

INFORMATION TECHNOLOGY AND CONTROL | 2024年 / 53卷 / 03期

关键词：

Few-shot learning; CLIP model; image classification; edge devices; deep learnig;

D O I：

10.5755/j01.itc.53.3.36943

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In the field of deep learning, traditional image classification tasks typically require extensive annotated data-sets and complex model training processes, which pose significant challenges for deployment on resource-con-strained edge devices. To address these challenges, this study introduces a few-shot learning method based on OpenAI's CLIP model that significantly reduces computational demands by eliminating the need to run a text encoder at the inference stage. By pre-computing the embedding centers of classification text with a small set of image-text data, our approach enables the direct use of CLIP's image encoder and pre-calculated text embeddings for efficient image classification. This adaptation not only allows for high-precision classification tasks on edge devices with limited computing capabilities but also achieves accuracy and recall rates that close-ly approximate those of the pre-trained ResNet approach while using far less data. Furthermore, our method halves the memory usage compared to other large-scale visual models of similar capacity by avoiding the use of a text encoder during inference, making it particularly suitable for low-resource environments. This com-parative advantage underscores the efficiency of our approach in handling few-shot image classification tasks, demonstrating both competitive accuracy and practical viability in resource-limited settings. The outcomes of this research not only highlight the potential of the CLIP model in few-shot learning scenarios but also pave a new path for efficient, low-resource deep learning applications in edge computing environments

引用

页码：833 / 845

页数：324