Top-down modulated model for object recognition in different categorisation levels

被引:2
作者
Sharifizadeh, Fatemeh [1 ]
Ganjtabesh, Mohammad [1 ]
Nowzari-Dalini, Abbas [1 ]
机构
[1] Univ Tehran, Sch Math Stat & Comp Sci, Dept Comp Sci, Coll Sci, Tehran, Iran
基金
美国国家科学基金会;
关键词
object recognition; categorisation levels; computational models; bottom-up processing; top-down signals; BOTTOM-UP; INFORMATION; PLASTICITY; VISION; CORTEX;
D O I
10.1504/IJBIC.2021.117428
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The human visual system contains a hierarchical sequence of modules that take part in visual perception at superordinate, basic, and subordinate categorisation levels. The top-down signals facilitate the bottom-up processing of visual information in the cortical analysis of object recognition. We propose a novel computational model for object recognition in different categorisation levels, which mimics the effects of top-down signals in the hierarchical processing of the visual system. The top-down signal is incorporated in bottom-up processing of input image to increase the biological plausibility of our model as well as its efficiency for the object recognition in different categorisation levels. The top-down signals provide a pre-knowledge about the input space, which can help to solve the complex object recognition tasks. The performance of our model is evaluated by various appraisal criteria with three benchmark datasets and significant improvement in recognition accuracy of our proposed model is achieved in all experiments.
引用
收藏
页码:13 / 26
页数:14
相关论文
共 34 条
  • [1] Ashtiani MN, 2017, FRONT PSYCHOL, V8, DOI [10.3389/Ipsyg.2017.01261, 10.3389/fpsyg.2017.01261]
  • [2] Top-down facilitation of visual recognition
    Bar, M
    Kassam, KS
    Ghuman, AS
    Boshyan, J
    Schmidt, AM
    Dale, AM
    Hämäläinen, MS
    Marinkovic, K
    Schacter, DL
    Rosen, BR
    Halgren, E
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (02) : 449 - 454
  • [3] Integrated model of visual processing
    Bullier, J
    [J]. BRAIN RESEARCH REVIEWS, 2001, 36 (2-3) : 96 - 107
  • [4] Bottom-up and top-down modulation of multisensory integration
    Choi, Ilsong
    Lee, Jae-Yun
    Lee, Seung-Hee
    [J]. CURRENT OPINION IN NEUROBIOLOGY, 2018, 52 : 115 - 122
  • [5] Early and late effects of objecthood and spatial frequency on event-related potentials and gamma band activity
    Craddock, Matt
    Martinovic, Jasna
    Mueller, Matthias M.
    [J]. BMC NEUROSCIENCE, 2015, 16
  • [6] Neocognitron for handwritten digit recognition
    Fukushima, K
    [J]. NEUROCOMPUTING, 2003, 51 : 161 - 180
  • [7] Towards a unified theory of neocortex: laminar cortical circuits for vision and cognition
    Grossberg, Stephen
    [J]. COMPUTATIONAL NEUROSCIENCE: THEORETICAL INSIGHTS INTO BRAIN FUNCTION, 2007, 165 : 79 - 104
  • [8] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [9] Interaction of bottom-up and top-down processes in the perception of ambiguous figures
    Intaite, Monika
    Noreika, Valdas
    Soliunas, Alvydas
    Falter, Christine M.
    [J]. VISION RESEARCH, 2013, 89 : 24 - 31
  • [10] Caffe: Convolutional Architecture for Fast Feature Embedding
    Jia, Yangqing
    Shelhamer, Evan
    Donahue, Jeff
    Karayev, Sergey
    Long, Jonathan
    Girshick, Ross
    Guadarrama, Sergio
    Darrell, Trevor
    [J]. PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 675 - 678