Top-down modulated model for object recognition in different categorisation levels

被引：2

作者：

Sharifizadeh, Fatemeh ^{[1
]}

Ganjtabesh, Mohammad ^{[1
]}

Nowzari-Dalini, Abbas ^{[1
]}

机构：

[1] Univ Tehran, Sch Math Stat & Comp Sci, Dept Comp Sci, Coll Sci, Tehran, Iran

来源：

INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION | 2021年 / 18卷 / 01期

基金：

美国国家科学基金会;

关键词：

object recognition; categorisation levels; computational models; bottom-up processing; top-down signals; BOTTOM-UP; INFORMATION; PLASTICITY; VISION; CORTEX;

D O I：

10.1504/IJBIC.2021.117428

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The human visual system contains a hierarchical sequence of modules that take part in visual perception at superordinate, basic, and subordinate categorisation levels. The top-down signals facilitate the bottom-up processing of visual information in the cortical analysis of object recognition. We propose a novel computational model for object recognition in different categorisation levels, which mimics the effects of top-down signals in the hierarchical processing of the visual system. The top-down signal is incorporated in bottom-up processing of input image to increase the biological plausibility of our model as well as its efficiency for the object recognition in different categorisation levels. The top-down signals provide a pre-knowledge about the input space, which can help to solve the complex object recognition tasks. The performance of our model is evaluated by various appraisal criteria with three benchmark datasets and significant improvement in recognition accuracy of our proposed model is achieved in all experiments.

引用

页码：13 / 26

页数：14

共 34 条

[1] Ashtiani MN, 2017, FRONT PSYCHOL, V8, DOI [10.3389/Ipsyg.2017.01261, 10.3389/fpsyg.2017.01261]
[2] Top-down facilitation of visual recognition
Bar, M
Kassam, KS
Ghuman, AS
Boshyan, J
Schmidt, AM
Dale, AM
Hämäläinen, MS
Marinkovic, K
Schacter, DL
Rosen, BR
Halgren, E
[J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (02) : 449 - 454
[3] Integrated model of visual processing
Bullier, J
[J]. BRAIN RESEARCH REVIEWS, 2001, 36 (2-3) : 96 - 107
[4] Bottom-up and top-down modulation of multisensory integration
Choi, Ilsong
Lee, Jae-Yun
Lee, Seung-Hee
[J]. CURRENT OPINION IN NEUROBIOLOGY, 2018, 52 : 115 - 122
[5] Early and late effects of objecthood and spatial frequency on event-related potentials and gamma band activity
Craddock, Matt
Martinovic, Jasna
Mueller, Matthias M.
[J]. BMC NEUROSCIENCE, 2015, 16
[6] Neocognitron for handwritten digit recognition
Fukushima, K
[J]. NEUROCOMPUTING, 2003, 51 : 161 - 180
[7] Towards a unified theory of neocortex: laminar cortical circuits for vision and cognition
Grossberg, Stephen
[J]. COMPUTATIONAL NEUROSCIENCE: THEORETICAL INSIGHTS INTO BRAIN FUNCTION, 2007, 165 : 79 - 104
[8] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
[9] Interaction of bottom-up and top-down processes in the perception of ambiguous figures
Intaite, Monika
Noreika, Valdas
Soliunas, Alvydas
Falter, Christine M.
[J]. VISION RESEARCH, 2013, 89 : 24 - 31
[10] Caffe: Convolutional Architecture for Fast Feature Embedding
Jia, Yangqing
Shelhamer, Evan
Donahue, Jeff
Karayev, Sergey
Long, Jonathan
Girshick, Ross
Guadarrama, Sergio
Darrell, Trevor
[J]. PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 675 - 678

← 1 2 3 4 →