Few-shot Food Recognition via Multi-view Representation Learning

被引:28
作者
Jiang, Shuqiang [1 ]
Min, Weiqing [1 ]
Lyu, Yongqiang [2 ,3 ]
Liu, Linhu [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, 6 Kexueyuan South Rd, Beijing 100190, Peoples R China
[2] Qingdao KingAgroot Precis Agr Technol Co Ltd, Qingdao, Peoples R China
[3] Shandong Reebow Automat Equipment Co LTD, Qingdao Branch, Room 1901,Bldg 5, Qingdao, Shandong, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Food recognition; few-shot learning; visual recognition; deep learning;
D O I
10.1145/3391624
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article considers the problem of few-shot learning for food recognition. Automatic food recognition can support various applications, e.g., dietary assessment and food journaling. Most existing works focus on food recognition with large numbers of labelled samples, and fail to recognize food categories with few samples. To address this problem, we propose a Multi-View Few-Shot Learning (MVFSL) framework to explore additional ingredient information for few-shot food recognition. Besides category-oriented deep visual features, we introduce ingredient-supervised deep network to extract ingredient-oriented features. As general and intermediate attributes of food, ingredient-oriented features are informative and complementary to category-oriented features, and thus they play an important role in improving food recognition. Particularly in few-shot food recognition, ingredient information can bridge the gap between disjoint training categories and test categories. To take advantage of ingredient information, we fuse these two kinds of features by first combining their feature maps from their respective deep networks and then convolving combined feature maps. Such convolution is further incorporated into a multi-view relation network, which is capable of comparing pairwise images to enable fine-grained feature learning. MVFSL is trained in an end-to-end fashion for joint optimization on two types of feature learning subnetworks and relation subnetworks. Extensive experiments on different food datasets have consistently demonstrated the advantage of MVFSL in multi-view feature fusion. Furthermore, we extend another two types of networks, namely, Siamese Network and Matching Network, by introducing ingredient information for few-shot food recognition. Experimental results have also shown that introducing ingredient information into these two networks can improve the performance of few-shot food recognition.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Few-Shot Action Recognition via Multi-View Representation Learning
    Wang, Xiao
    Lu, Yang
    Yu, Wanchuan
    Pang, Yanwei
    Wang, Hanzi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (09) : 8522 - 8535
  • [2] Multi-view Interaction Learning for Few-Shot Relation Classification
    Han, Yi
    Qiao, Linbo
    Zheng, Jianming
    Kan, Zhigang
    Gao, Yifu
    Feng, Linhui
    Tang, Yu
    Zhai, Qi
    Li, Dongsheng
    Liao, Xiangke
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 649 - 658
  • [3] Few-shot multi-view object classification via dual augmentation network
    Zhou, Yaqian
    Lu, Haochun
    Hao, Tong
    Li, Xuanya
    Liu, An-An
    INFORMATION FUSION, 2023, 100
  • [4] Few-shot learning for ear recognition
    Zhang, Jie
    Yu, Wen
    Yang, Xudong
    Deng, Fang
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO AND SIGNAL PROCESSING (IVSP 2019), 2019, : 50 - 54
  • [5] Few-shot and Many-shot Fusion Learning in Mobile Visual Food Recognition
    Zhao, Heng
    Yap, Kim-Hui
    Kot, Alex C.
    Duan, Lingyu
    Cheung, Ngai-Man
    2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,
  • [6] Few-shot Low-resource Knowledge Graph Completion with Multi-view Task Representation Generation
    Pei, Shichao
    Kou, Ziyi
    Zhang, Qiannan
    Zhang, Xiangliang
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 1862 - 1871
  • [7] Multi-label Few-shot Learning for Sound Event Recognition
    Cheng, Kai-Hsiang
    Chou, Szu-Yu
    Yang, Yi-Hsuan
    2019 IEEE 21ST INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP 2019), 2019,
  • [8] MULTI-TASK REPRESENTATION LEARNING NETWORK FOR FEW-SHOT SAR AUTOMATIC TARGET RECOGNITION
    Wang, Xi
    Yu, Xuelian
    Ren, Haohao
    Zhou, Yun
    Zou, Lin
    Wang, Xuegang
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 2618 - 2621
  • [9] Class Representation Networks for Few-Shot Learning
    Zhai, Yongping
    Wang, Junhua
    PROCEEDINGS OF 2020 IEEE 11TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2020), 2020, : 133 - 137
  • [10] Multi-level Metric Learning for Few-Shot Image Recognition
    Chen, Haoxing
    Li, Huaxiong
    Li, Yaohui
    Chen, Chunlin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 : 243 - 254