Exploring biological motion perception in two-stream convolutional neural networks

被引:6
作者
Peng, Yujia [1 ]
Lee, Hannah [1 ]
Shu, Tianmin [2 ,3 ]
Lu, Hongjing [1 ,2 ]
机构
[1] Univ Calif Los Angeles, Dept Psychol, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Dept Stat, Los Angeles, CA USA
[3] MIT, Dept Brain & Cognit Sci, E25-618, Cambridge, MA 02139 USA
关键词
Biological motion; Action recognition; Two-stream convolutional neural network; Local image motion; Inversion effect; Motion congruency; Causal perception; POINT-LIGHT DISPLAYS; VISUAL-PERCEPTION; GENDER RECOGNITION; MECHANISMS; FORM; REPRESENTATION; SENSITIVITY; SELECTIVITY; EMOTION; NEURONS;
D O I
10.1016/j.visres.2020.09.005
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Visual recognition of biological motion recruits form and motion processes supported by both dorsal and ventral pathways. This neural architecture inspired the two-stream convolutional neural network (CNN) model, which includes a spatial CNN to process appearance information in a sequence of image frames, a temporal CNN to process optical flow information, and a fusion network to integrate the features extracted by the two CNNs and make final decisions about action recognition. In five simulations, we compared the CNN model's performance with classical findings in biological motion perception. The CNNs trained with raw RGB action videos showed weak performance in recognizing point-light actions. Additional transfer training with actions shown in other display formats (e.g., skeletal) was necessary for CNNs to recognize point-light actions. The CNN models exhibited largely viewpoint-dependent recognition of actions, with a limited ability to generalize to viewpoints close to the training views. The CNNs predicted the inversion effect in the presence of global body configuration, but failed to predict the inversion effect driven solely by local motion signals. The CNNs provided a qualitative account of some behavioral results observed in human biological motion perception for fine discrimination tasks with noisy inputs, such as point-light actions with disrupted local motion signals, and walking actions with temporally misaligned motion cues. However, these successes are limited by the CNNs' lack of adaptive integration for form and motion processes, and failure to incorporate specialized mechanisms (e.g., a life detector) as well as top-down influences on biological motion perception.
引用
收藏
页码:35 / 47
页数:13
相关论文
共 81 条
[71]   Person identification from biological motion: Effects of structural and kinematic cues [J].
Troje, NF ;
Westhoff, C ;
Lavrov, M .
PERCEPTION & PSYCHOPHYSICS, 2005, 67 (04) :667-675
[72]   Decomposing biological motion: A framework for analysis and synthesis of human gait patterns [J].
Troje, Nikolaus F. .
JOURNAL OF VISION, 2002, 2 (05) :371-387
[73]   What do you mean with "direction"? Local and global cues to biological motion perception in pigeons [J].
Troje, Nikolaus F. ;
Aust, Ulrike .
VISION RESEARCH, 2013, 79 :47-55
[74]   Functional neuroanatomy of biological motion perception in humans [J].
Vaina, LM ;
Solomon, J ;
Chowdhury, S ;
Sinha, P ;
Belliveau, JW .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (20) :11656-11661
[75]   INTACT BIOLOGICAL MOTION AND STRUCTURE FROM MOTION PERCEPTION IN A PATIENT WITH IMPAIRED MOTION MECHANISMS - A CASE-STUDY [J].
VAINA, LM ;
LEMAY, M ;
BIENFANG, DC ;
CHOI, AY ;
NAKAYAMA, K .
VISUAL NEUROSCIENCE, 1990, 5 (04) :353-369
[76]   Gravity bias in the interpretation of biological motion by inexperienced chicks [J].
Vallortigara, G ;
Regolin, L .
CURRENT BIOLOGY, 2006, 16 (08) :R279-R280
[77]   Joints and their relations as critical features in action discrimination: Evidence from a classification image method [J].
van Boxtel, Jeroen J. A. ;
Lu, Hongjing .
JOURNAL OF VISION, 2015, 15 (01) :1-17
[78]   A biological motion toolbox for reading, displaying, and manipulating motion capture data in research settings [J].
van Boxtel, Jeroen J. A. ;
Lu, Hongjing .
JOURNAL OF VISION, 2013, 13 (12)
[79]   Functional Differentiation of Macaque Visual Temporal Cortical Neurons Using a Parametric Action Space [J].
Vangeneugden, Joris ;
Pollick, Frank ;
Vogels, Rufin .
CEREBRAL CORTEX, 2009, 19 (03) :593-611
[80]   Perceiving human locomotion: Priming effects in direction discrimination [J].
Verfaillie, K .
BRAIN AND COGNITION, 2000, 44 (02) :192-213