On the Continuity of Rotation Representations in Neural Networks

被引:615
作者
Zhou, Yi [1 ]
Barnes, Connelly [2 ]
Lu, Jingwan [2 ]
Yang, Jimei [2 ]
Li, Hao [3 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90007 USA
[2] Adobe Res, San Jose, CA USA
[3] Univ Southern Calif, Pinscreen USC Inst Creat Technol, Los Angeles, CA 90007 USA
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年
关键词
APPROXIMATION; ORDER;
D O I
10.1109/CVPR.2019.00589
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In neural networks, it is often desirable to work with various representations of the same space. For example, 3D rotations can be represented with quaternions or Euler angles. In this paper,we advance a definition of a continuous representation,which can be helpful for training deep neural networks. We relate this to topological concepts such as homeomorphism and embedding. We then investigate what are continuous and discontinuous representations for 2D, 3D, and n-dimensional rotations. We demonstrate that for 3D rotations, all representations are discontinuous in the real Euclidean spaces of four or fewer dimensions. Thus, widely used representations such as quaternions and Euler angles are discontinuous and difficult for neural networks to learn. We show that the 3D rotations have continuous representations in 5D and 6D, which are more suitable for learning. We also present continuous representations for the general case of the n-dimensional rotation group SO(n). While our main focus is on rotations,we also show that our constructions apply to other groups such as the orthogonal group and similarity transforms. We finally present empirical results, which show that our continuous rotation representations outperform discontinuous ones for several practical problems in graphics and vision, including a simple autoencoder sanity test, a rotation estimator for 3D point clouds, and an inverse kinematics solver for 3D human poses.
引用
收藏
页码:5738 / 5746
页数:9
相关论文
共 31 条
[1]  
Allen-Blanchette C., 2014, MOTION INTERPOLATION
[2]  
[Anonymous], 2017, IEEE P COMPUT VIS PA, DOI DOI 10.1109/CVPR.2017.16
[3]  
Baker M. J., MATHS CONVERSION MAT
[4]   UNIVERSAL APPROXIMATION BOUNDS FOR SUPERPOSITIONS OF A SIGMOIDAL FUNCTION [J].
BARRON, AR .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1993, 39 (03) :930-945
[5]  
BELONGIE S, RODRIGUES ROTATION F
[6]  
Bloom D. M., 1979, LINEAR ALGEBRAAND GE
[7]  
Chang Angel X., 2015, arXiv
[8]  
Chen ZX, 2013, MATH COMMUN, V18, P185
[9]  
Csiszar Akos., 2017, 24 INT C MECHATRONIC, P1, DOI DOI 10.1109/M2VIP.2017.8211457
[10]  
Davis DM, 1998, BOL SOC MAT MEX, V4, P115