Semi-automatic image annotation using 3D LiDAR projections and depth camera data

被引:0
|
作者
Li, Pei Yao [1 ]
Parrilla, Nicholas A. [1 ]
Salathe, Marco [1 ]
Joshi, Tenzing H. [1 ]
Cooper, Reynold J. [1 ]
Park, Ki [2 ]
Sudderth, Asa, V [2 ]
机构
[1] Lawrence Berkeley Natl Lab, 1 Cyclotron Rd, Berkeley, CA 94720 USA
[2] Nevada Natl Secur Sites NLV Facil, 232 Energy Way, North Las Vegas, NV 89030 USA
关键词
Computer vision; Object recognition neural networks; LiDAR-assisted image annotation; Nuclear safeguards;
D O I
10.1016/j.anucene.2024.111080
中图分类号
TL [原子能技术]; O571 [原子核物理学];
学科分类号
0827 ; 082701 ;
摘要
Efficient image annotation is necessary to utilize deep learning object recognition neural networks in nuclear safeguards, such as for the detection and localization of target objects like nuclear material containers (NMCs). This capability can help automate the inventory accounting of different types of NMCs within nuclear storage facilities. The conventional manual annotation process is labor-intensive and time-consuming, hindering the rapid deployment of deep learning models for NMC identifications. This paper introduces a novel semiautomatic method for annotating 2D images of nuclear material containers (NMCs) by combining 3D light detection and ranging (LiDAR) data with color and depth camera images collected from a handheld scan system. The annotation pipeline involves an operator manually marking new target objects on a LiDARgenerated map, and projecting these 3D locations to images, thereby automatically creating annotations from the projections. The semi-automatic approach significantly reduces manual efforts and the expertise in image annotation that is required to perform the task, allowing deep learning models to be trained on-site within a few hours. The paper compares the performance of models trained on datasets annotated through various methods, including semi-automatic, manual, and commercial annotation services. The evaluation demonstrates that the semi-automatic annotation method achieves comparable or superior results, with a mean average precision (mAP) above 0.9, showcasing its efficiency in training object recognition models. Additionally, the paper explores the application of the proposed method to instance segmentation, achieving promising results in detecting multiple types of NMCs in various formations.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] SEMI-AUTOMATIC 2D TO 3D IMAGE CONVERSION USING SCALE-SPACE RANDOM WALKS AND A GRAPH CUTS BASED DEPTH PRIOR
    Phan, Raymond
    Rzeszutek, Richard
    Androutsos, Dimitrios
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011, : 865 - 868
  • [2] SEMI-AUTOMATIC 2D TO 3D IMAGE CONVERSION USING A HYBRID RANDOMWALKS AND GRAPH CUTS BASED APPROACH
    Phan, Raymond
    Rzeszutek, Richard
    Androutsos, Dimitrios
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 897 - 900
  • [3] A Semi-Automatic 2D to Stereoscopic 3D Image and Video Conversion System in a Semi-Automated Segmentation Perspective
    Phan, Raymond
    Androutsos, Dimitrios
    STEREOSCOPIC DISPLAYS AND APPLICATIONS XXIV, 2013, 8648
  • [4] Robust Semi-Automatic Depth Map Generation in Unconstrained Images and Video Sequences for 2D to Stereoscopic 3D Conversion
    Phan, Raymond
    Androutsos, Dimitrios
    IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 16 (01) : 122 - 136
  • [5] Automatic Dense Annotation for Monocular 3D Scene Understanding
    Reza, Md Alimoor
    Chen, Kai
    Naik, Akshay
    Crandall, David J.
    Jung, Soon-Heung
    IEEE ACCESS, 2020, 8 : 68852 - 68865
  • [6] Automatic Identification of Drinking Activities at Home using Depth Data From RGB-D Camera
    Tham, Jie Sheng
    Chang, Yoong Choon
    Fauzi, Mohammad Faizal Ahmad
    2014 INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND INFORMATION SCIENCES (ICCAIS 2014), 2014, : 153 - 158
  • [7] Improvements to Target-Based 3D LiDAR to Camera Calibration
    Huang, Jiunn-Kai
    Grizzle, Jessy W.
    IEEE ACCESS, 2020, 8 : 134101 - 134110
  • [8] PASSIVE DEPTH ACQUISITION FOR 3D IMAGE DISPLAYS
    SATOH, K
    OHTA, Y
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1994, E77D (09) : 949 - 957
  • [9] An automatic pothole detection algorithm using pavement 3D data
    Bosurgi, G.
    Modica, M.
    Pellegrino, O.
    Sollazzo, G.
    INTERNATIONAL JOURNAL OF PAVEMENT ENGINEERING, 2023, 24 (02)
  • [10] 3D Head Trajectory using a Single Camera
    Rougier, Caroline
    Meunier, Jean
    INTERNATIONAL JOURNAL OF FUTURE GENERATION COMMUNICATION AND NETWORKING, 2010, 3 (04): : 43 - 54