DCPB: Deformable Convolution based on the Poincare Ball for Top-view Fisheye Cameras

被引:1
作者
Wei, Xuan [1 ]
Ran, Zhidan [1 ]
Lu, Xiaobo [1 ]
机构
[1] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China
来源
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023) | 2023年
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
NETWORK;
D O I
10.1109/ICCV51070.2023.01224
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The accuracy of the visual tasks for top- view fisheye cameras is limited by the Euclidean geometry for pose-distorted objects in images. In this paper, we demonstrate the analogy between the fisheye model and the Poincare ' ball and that learning the shape of convolution kernels in the Poincare ' Ball can alleviate the spatial distortion problem. In particular, we propose the Deformable Convolution based on the Poincare ' Ball, named DCPB, which conducts the Graph Convolutional Network (GCN) in the Poincare ' ball and calculates the geodesic distances to Poincare ' hyperplanes as the offsets and modulation scalars of the modulated deformable convolution. Besides, we explore an appropriate network structure in the baseline with the DCPB. The DCPB markedly improves the neural network's performance. Experimental results on the public dataset THEODORE show that DCPB obtains a higher accuracy, and its efficiency demonstrates the potential for using temporal information in fisheye videos.
引用
收藏
页码:13262 / 13271
页数:10
相关论文
共 50 条
[1]  
Ahmad O, 2022, AAAI CONF ARTIF INTE, P5968
[2]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[3]  
Balazevic Ivana, 2019, Advances in Neural Information Processing Systems, V32
[4]   融合可变形卷积网络的鱼眼图像目标检测 [J].
包俊 ;
刘宏哲 .
计算机工程, 2021, 47 (04) :248-255
[5]   Geometric Deep Learning Going beyond Euclidean data [J].
Bronstein, Michael M. ;
Bruna, Joan ;
LeCun, Yann ;
Szlam, Arthur ;
Vandergheynst, Pierre .
IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (04) :18-42
[6]   Monocular and Binocular Interactions Oriented Deformable Convolutional Networks for Blind Quality Assessment of Stereoscopic Omnidirectional Images [J].
Chai, Xiongli ;
Shao, Feng ;
Jiang, Qiuping ;
Meng, Xiangchao ;
Ho, Yo-Sung .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (06) :3407-3421
[7]   CaMap: Camera-based Map Manipulation on Mobile Devices [J].
Chen, Liang ;
Chen, Dongyi .
PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2018), 2018,
[8]  
Chen LB, 2017, IEEE INT SYMP NANO, P1, DOI 10.1109/NANOARCH.2017.8053709
[9]   Efficient pedestrian detection in top-view fisheye images using compositions of perspective view patches [J].
Chiang, Sheng-Ho ;
Wang, Tsaipei ;
Chen, Yi-Fu .
IMAGE AND VISION COMPUTING, 2021, 105
[10]  
Cohen T., 2017, ARXIV170904893, P1