Computational Model Based on Neural Network of Visual Cortex for Human Action Recognition

被引:29
作者
Liu, Haihua [1 ,2 ,3 ]
Shu, Na [1 ]
Tang, Qiling [1 ]
Zhang, Wensheng [4 ]
机构
[1] South Cent Univ Nationalities, Sch Biomed Engn, Wuhan 430074, Hubei, Peoples R China
[2] Key Lab Cognit Sci State Ethn Affairs Commiss, Wuhan 430074, Hubei, Peoples R China
[3] Hubei Key Lab Med Informat Anal & Tumor Diag & Tr, Wuhan 430074, Hubei, Peoples R China
[4] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recognition; classical receptive field (RF); spiking neural networks (SNNs); surround suppression; visual cortex; CELL RECEPTIVE-FIELDS; SPATIOTEMPORAL ORGANIZATION; MOTION; FEATURES; ARCHITECTURE; ENHANCEMENT; SUPPRESSION; SELECTIVITY; DYNAMICS;
D O I
10.1109/TNNLS.2017.2669522
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a bioinspired model for human action recognition through modeling neural mechanisms of information processing in two visual cortical areas: the primary visual cortex (V1) and the middle temporal cortex (MT) dedicated to motion. This model, named V1-MT, is composed of V1 and MT models (layers) corresponding to their cortical areas, which are built with layered spiking neural networks (SNNs). Some neuron properties in V1 and MT, such as direction and speed selectivity, spatiotemporal inseparability, and center surround suppression, are integrated into SNNs. Based on speed and direction selectivity, V1 and MT models contain multiple SNN channels, each of which processes motion information in sequences with spatiotemporal tunings of neurons at a certain speed and different directions. Therefore, we propose two operations, input signal perceiving with 3-D Gabor filters and surround inhibition processing with 3-D differences of Gaussian functions, to perform this task according to the spatiotemporal inseparability and center surround suppression of neurons. Then, neurons are modeled with our simplified integrate-and-fire model and motion information is transformed into spike trains. Afterward, we define a new feature vector: a mean motion map computed from spike trains in all channels to represent human actions. Finally, a support vector machine is trained to classify actions represented by the feature vectors. We conducted extensive experiments on public action databases, and the results show that our model outperforms other bioinspired models and rivals the state-of-the-art approaches.
引用
收藏
页码:1427 / 1440
页数:14
相关论文
共 55 条
[1]  
Al Ghamdi M, 2012, LECT NOTES COMPUT SC, V7583, P301, DOI 10.1007/978-3-642-33863-2_30
[2]  
[Anonymous], 2008, 2008 IEEE C COMP VIS, DOI DOI 10.1109/CVPR.2008.4587730
[3]   A CORF computational model of a simple cell that relies on LGN input outperforms the Gabor function model [J].
Azzopardi, George ;
Petkov, Nicolai .
BIOLOGICAL CYBERNETICS, 2012, 106 (03) :177-189
[4]   A fast biologically inspired algorithm for recurrent motion estimation [J].
Bayerl, Pierre ;
Neumann, Heiko .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (02) :246-260
[5]   Brightness induction: Rate enhancement and neuronal synchronization as complementary codes [J].
Biederlack, Julia ;
Castelo-Branco, Miguel ;
Neuenschwander, Sergio ;
Wheeler, Diek W. ;
Singer, Wolf ;
Nikolic, Danko .
NEURON, 2006, 52 (06) :1073-1083
[6]   Perception of human motion [J].
Blake, Randolph ;
Shiffrar, Maggie .
ANNUAL REVIEW OF PSYCHOLOGY, 2007, 58 :47-73
[7]   Critical features for the recognition of biological motion [J].
Casile, A ;
Giese, MA .
JOURNAL OF VISION, 2005, 5 (04) :348-360
[8]   UNCERTAINTY RELATION FOR RESOLUTION IN SPACE, SPATIAL-FREQUENCY, AND ORIENTATION OPTIMIZED BY TWO-DIMENSIONAL VISUAL CORTICAL FILTERS [J].
DAUGMAN, JG .
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1985, 2 (07) :1160-1169
[9]   SPATIOTEMPORAL ORGANIZATION OF SIMPLE-CELL RECEPTIVE-FIELDS IN THE CATS STRIATE CORTEX .1. GENERAL-CHARACTERISTICS AND POSTNATAL-DEVELOPMENT [J].
DEANGELIS, GC ;
OHZAWA, I ;
FREEMAN, RD .
JOURNAL OF NEUROPHYSIOLOGY, 1993, 69 (04) :1091-1117
[10]   The high-conductance state of neocortical neurons in vivo [J].
Destexhe, A ;
Rudolph, M ;
Paré, D .
NATURE REVIEWS NEUROSCIENCE, 2003, 4 (09) :739-751