Contrastive 3D Shape Completion and Reconstruction for Agricultural Robots Using RGB-D Frames

被引:20
|
作者
Magistri, Federico [1 ]
Marks, Elias [1 ]
Nagulavancha, Sumanth [1 ]
Vizzo, Ignacio [1 ]
Laeebe, Thomas [1 ]
Behley, Jens [1 ]
Halstead, Michael [1 ]
McCool, Chris [1 ]
Stachniss, Cyrill [1 ,2 ,3 ]
机构
[1] Univ Bonn, D-53111 Bonn, Germany
[2] Univ Oxford, Dept Engn Sci, Oxford OX1 2JD, England
[3] Lamas Inst Machine Learning & Artificial Intellig, D-53757 St Augustin, Germany
关键词
Deep learning for visual perception; robotics and automation in agriculture and forestry; RGB-D perception;
D O I
10.1109/LRA.2022.3193239
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Monitoring plants and fruits is important in modern agriculture, with applications ranging from high-throughput phenotyping to autonomous harvesting. Obtaining highly accurate 3D measurements under real agricultural conditions is a challenging task. In this letter, we address the problem of estimating the 3D shape of fruits when only a partial view is available. We propose a pipeline that exploits high-resolution 3D data in the learning phase but only requires a single RGB-D frame to predict the 3D shape of a complete fruit during operation. To achieve this, we first learn a latent space of potential fruit appearances that we can decode into an SDF volume. With the pretrained, frozen decoder, we subsequently learn an encoder that can produce meaningful latent vectors from a single RGB-D frame. The experiments presented in this letter suggest that our approach can predict the 3D shape of whole fruits online, needing only 4 ms for inference. We evaluate our approach in controlled environments and illustrate its deployment in greenhouses without modifications.
引用
收藏
页码:10120 / 10127
页数:8
相关论文
共 50 条
  • [21] Easy and Fast Reconstruction of a 3D Avatar with an RGB-D Sensor
    Mao, Aihua
    Zhang, Hong
    Liu, Yuxin
    Zheng, Yinglong
    Li, Guiqing
    Han, Guoqiang
    SENSORS, 2017, 17 (05)
  • [22] A Benchmark for RGB-D Visual Odometry, 3D Reconstruction and SLAM
    Handa, Ankur
    Whelan, Thomas
    McDonald, John
    Davison, Andrew J.
    2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 1524 - 1531
  • [23] When 3D Reconstruction Meets Ubiquitous RGB-D Images
    Zhang, Quanshi
    Song, Xuan
    Shao, Xiaowei
    Zhao, Huijing
    Shibasaki, Ryosuke
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 700 - 707
  • [24] Applications of RGB-D data for 3D reconstruction in the indoor environment
    Zou, Runyang
    Ge, Xueshi
    Wang, Geng
    2016 IEEE CHINESE GUIDANCE, NAVIGATION AND CONTROL CONFERENCE (CGNCC), 2016, : 375 - 378
  • [25] A Survey of Indoor 3D Reconstruction Based on RGB-D Cameras
    Zhu, Jinlong
    Gao, Changbo
    Sun, Qiucheng
    Wang, Mingze
    Deng, Zhengkai
    IEEE ACCESS, 2024, 12 : 112742 - 112766
  • [26] STAR3D: Simultaneous Tracking And Reconstruction of 3D Objects Using RGB-D Data
    Ren, Carl Yuheng
    Prisacariu, Victor
    Murray, David
    Reid, Ian
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 1561 - 1568
  • [27] Interactive Mapping in 3D Using RGB-D Data
    Vieira, Pedro
    Ventura, Rodrigo
    2012 IEEE INTERNATIONAL SYMPOSIUM ON SAFETY, SECURITY, AND RESCUE ROBOTICS (SSRR), 2012,
  • [28] Achieving Flexible 3D Reconstruction Volumes for RGB-D and RGB Camera Based Approaches
    Mock, Sebastian
    Lensing, Philipp
    Broll, Wolfgang
    COMPUTER VISION AND GRAPHICS, ICCVG 2016, 2016, 9972 : 221 - 232
  • [29] 3D Camouflaging Object using RGB-D Sensors
    Siddek, Ahmed M.
    Rashwan, Mohsen A.
    Eshrah, Islam A.
    2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 1232 - 1237
  • [30] 3D Visual SLAM using RGB-D Camera
    Krerngkamjornkit, Rapee
    Simic, Milan
    SMART DIGITAL FUTURES 2014, 2014, 262 : 533 - 544