A fast, invariant representation for human action in the visual system

被引:24
作者
Isik, Leyla [1 ]
Tacchetti, Andrea [1 ]
Poggio, Tomaso [1 ]
机构
[1] MIT, Ctr Brains Minds & Machines, 77 Massachusetts Ave, Cambridge, MA 02139 USA
基金
美国国家科学基金会;
关键词
action recognition; magnetoencephalography; neural decoding; vision; LATERAL OCCIPITOTEMPORAL CORTEX; ABSTRACT ACTION REPRESENTATIONS; BIOLOGICAL MOTION PERCEPTION; EVENT-RELATED POTENTIALS; OBJECT RECOGNITION; TEMPORAL CORTEX; MACAQUE MONKEY; HUMAN-BODY; AREA; MEG;
D O I
10.1152/jn.00642.2017
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Humans can effortlessly recognize others' actions in the presence of complex transformations, such as changes in viewpoint. Several studies have located the regions in the brain involved in invariant action recognition; however, the underlying neural computations remain poorly understood. We use magnetoencephalography decoding and a data set of well-controlled, naturalistic videos of five actions (run, walk, jump, eat, drink) performed by different actors at different viewpoints to study the computational steps used to recognize actions across complex transformations. In particular, we ask when the brain discriminates between different actions, and when it does so in a manner that is invariant to changes in 3D viewpoint. We measure the latency difference between invariant and noninvariant action decoding when subjects view full videos as well as form-depleted and motion-depleted stimuli. We were unable to detect a difference in decoding latency or temporal profile between invariant and noninvariant action recognition in full videos. However, when either form or motion information is removed from the stimulus set, we observe a decrease and delay in invariant action decoding. Our results suggest that the brain recognizes actions and builds invariance to complex transformations at the same time and that both form and motion information are crucial for fast, invariant action recognition. NEW & NOTEWORTHY The human brain can quickly recognize actions despite transformations that change their visual appearance. We use neural timing data to uncover the computations underlying this ability. We find that within 200 ms action can be read out of magnetoencephalography data and that this representation is invariant to changes in viewpoint. We find form and motion are needed for this fast action decoding, suggesting that the brain quickly integrates complex spatiotemporal features to form invariant action representations.
引用
收藏
页码:631 / 640
页数:10
相关论文
共 56 条
[1]   Systematic biases in early ERP and ERF components as a result of high-pass filtering [J].
Acunzo, David J. ;
MacKenzie, Graham ;
van Rossum, Mark C. W. .
JOURNAL OF NEUROSCIENCE METHODS, 2012, 209 (01) :212-218
[2]  
[Anonymous], 2015, ARXIV150201852CSCV
[3]  
[Anonymous], 2008, 2008 IEEE C COMP VIS, DOI DOI 10.1109/CVPR.2008.4587730
[4]   fMRI responses to video and point-light displays of moving humans and manipulable objects [J].
Beauchamp, MS ;
Lee, KE ;
Haxby, JV ;
Martin, A .
JOURNAL OF COGNITIVE NEUROSCIENCE, 2003, 15 (07) :991-1001
[5]   Representational dynamics of object vision: The first 1000 ms [J].
Carlson, Thomas ;
Tovar, David A. ;
Alink, Arjen ;
Kriegeskorte, Nikolaus .
JOURNAL OF VISION, 2013, 13 (10)
[6]   High temporal resolution decoding of object position and category [J].
Carlson, Thomas A. ;
Hogendoorn, Hinze ;
Kanai, Ryota ;
Mesik, Juraj ;
Turret, Jeremy .
JOURNAL OF VISION, 2011, 11 (10)
[7]   Similarity-Based Fusion of MEG and fMRI Reveals Spatio-Temporal Dynamics in Human Cortex During Visual Object Recognition [J].
Cichy, Radoslaw Martin ;
Pantazis, Dimitrios ;
Oliva, Aude .
CEREBRAL CORTEX, 2016, 26 (08) :3563-3579
[8]   Resolving human object recognition in space and time [J].
Cichy, Radoslaw Martin ;
Pantazis, Dimitrios ;
Oliva, Aude .
NATURE NEUROSCIENCE, 2014, 17 (03) :455-462
[9]   Untangling invariant object recognition [J].
DiCarlo, James J. ;
Cox, David D. .
TRENDS IN COGNITIVE SCIENCES, 2007, 11 (08) :333-341
[10]   Executed and Observed Movements Have Different Distributed Representations in Human aIPS [J].
Dinstein, Ilan ;
Gardner, Justin L. ;
Jazayeri, Mehrdad ;
Heeger, David J. .
JOURNAL OF NEUROSCIENCE, 2008, 28 (44) :11231-11239