Zero-shot counting with a dual-stream neural network model

被引:1
|
作者
Thompson, Jessica A. F. [1 ]
Sheahan, Hannah [1 ]
Dumbalska, Tsvetomira [1 ]
Sandbrink, Julian D. [1 ]
Piazza, Manuela [2 ]
Summerfield, Christopher [1 ]
机构
[1] Univ Oxford, Dept Expt Psychol, Oxford OX2 6GG, England
[2] Univ Trento, Dept Psychol & Cognit Sci, I-38068 Trento, Italy
基金
欧洲研究理事会;
关键词
SUPERIOR COLLICULUS; PRIMATE PARIETAL; NUMBER; CORTEX; INFORMATION; NUMEROSITY; DORSAL; EMERGENCE; ACALCULIA; DECISION;
D O I
10.1016/j.neuron.2024.10.008
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
To understand a visual scene, observers need to both recognize objects and encode relational structure. For example, a scene comprising three apples requires the observer to encode concepts of "apple"and "three."In the primate brain, these functions rely on dual (ventral and dorsal) processing streams. Object recognition in primates has been successfully modeled with deep neural networks, but how scene structure (including numerosity) is encoded remains poorly understood. Here, we built a deep learning model, based on the dual-stream architecture of the primate brain, which is able to count items "zero-shot"-even if the objects themselves are unfamiliar. Our dual-stream network forms spatial response fields and lognormal number codes that resemble those observed in the macaque posterior parietal cortex. The dual-stream network also makes successful predictions about human counting behavior. Our results provide evidence for an enactive theory of the role of the posterior parietal cortex in visual scene understanding.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Dual-Stream Contrastive Learning for Compositional Zero-Shot Recognition
    Yang, Yanhua
    Pan, Rui
    Li, Xiangyu
    Yang, Xu
    Deng, Cheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1909 - 1919
  • [2] Dual-stream generative adversarial networks for distributionally robust zero-shot learning
    Liu, Huan
    Yao, Lina
    Zheng, Qinghua
    Luo, Minnan
    Zhao, Hongke
    Lyu, Yanzhang
    INFORMATION SCIENCES, 2020, 519 : 407 - 422
  • [3] Zero-Shot Object Counting
    Xu, Jingyi
    Le, Hieu
    Nguyen, Vu
    Ranjan, Viresh
    Samaras, Dimitris
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15548 - 15557
  • [4] Dual-stream VO: Visual Odometry Based on LSTM Dual-Stream Convolutional Neural Network
    Luo, Yuan
    Zeng, YongChao
    Lv, RunZhe
    Wang, WenHao
    ENGINEERING LETTERS, 2022, 30 (03) : 926 - 934
  • [5] Dual Part Discovery Network for Zero-Shot Learning
    Ge, Jiannan
    Xie, Hongtao
    Min, Shaobo
    Li, Pandeng
    Zhang, Yongdong
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3244 - 3252
  • [6] Dual triplet network for image zero-shot learning
    Ji, Zhong
    Wang, Hai
    Pang, Yanwei
    Shao, Ling
    NEUROCOMPUTING, 2020, 373 : 90 - 97
  • [7] Dual-verification network for zero-shot learning
    Zhang, Haofeng
    Long, Yang
    Yang, Wankou
    Shao, Ling
    INFORMATION SCIENCES, 2019, 470 : 43 - 57
  • [8] Dual-Stream Recurrent Neural Network for Video Captioning
    Xu, Ning
    Liu, An-An
    Wong, Yongkang
    Zhang, Yongdong
    Nie, Weizhi
    Su, Yuting
    Kankanhalli, Mohan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (08) : 2482 - 2493
  • [9] Dual-stream reinforcement network for few-shot image segmentation
    Tang, Mingwei
    Zhu, Lin
    Xu, Yangsheng
    Zhao, Mingfeng
    DIGITAL SIGNAL PROCESSING, 2023, 134
  • [10] Dual Expert Distillation Network for Generalized Zero-Shot Learning
    Rao, Zhijie
    Guo, Jingcai
    Lu, Xiaocheng
    Liang, Jingming
    Zhang, Jie
    Wang, Haozhao
    Wei, Kang
    Cao, Xiaofeng
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 4833 - 4841