A New Method of 3D Scene Recognition from Still Images

被引:0
|
作者
Zheng Li-ming [1 ]
Wang Xing-song [1 ]
机构
[1] Southeast Univ, Sch Mech Engn, Nanjing 210096, Jiangsu, Peoples R China
关键词
Unsupervised learning; monocular visual; 3D scene recognition; superpixels; spectral clustering; CAMERA CALIBRATION; SPECTRAL METHODS; GROUND SURFACE; DISTANCE; KERNEL; RECONSTRUCTION; REPRESENTATION; SHIFT;
D O I
10.1117/12.2064179
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Most methods of monocular visual three dimensional (3D) scene recognition involve supervised machine learning. However, these methods often rely on prior knowledge. Specifically, they learn the image scene as part of a training dataset. For this reason, when the sampling equipment or scene is changed, monocular visual 3D scene recognition may fail. To cope with this problem, a new method of unsupervised learning for monocular visual 3D scene recognition is here proposed. First, the image is made using superpixel segmentation based on the CIELAB color space values L, a, and b and on the coordinate values x and y of pixels, forming a superpixel image with a specific density. Second, a spectral clustering algorithm based on the superpixels' color characteristics and neighboring relationships was used to reduce the dimensions of the superpixel image. Third, the fuzzy distribution density functions representing sky, ground, and facade are multiplied with the segment pixels, where the expectations of these segments are obtained. A preliminary classification of sky, ground, and facade is generated in this way. Fourth, the most accurate classification images of sky, ground, and facade were extracted through the tier-1 wavelet sampling and Manhattan direction feature. Finally, a depth perception map is generated based on the pinhole imaging model and the linear perspective information of ground surface. Here, 400 images of Make3D Image data from the Cornell University website were used to test the algorithm. The experimental results showed that this unsupervised learning method provides a more effective monocular visual 3D scene recognition model than other methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] A Method for 3D Scene Recognition Using Shadow Information and a Single Fixed Viewpoint
    Bamber, David. C.
    Rogers, Jeremy. D.
    Page, Scott F.
    VISUAL INFORMATION PROCESSING XXI, 2012, 8399
  • [42] Combinations of range data and panoramic images - New opportunities in 3D scene modeling
    Klette, R
    Scheibe, K
    COMPUTER GRAPHICS, IMAGING AND VISION: NEW TRENDS, 2005, : 3 - 10
  • [43] Face recognition from 2D and 3D images using 3D Gabor filters
    Wang, YJ
    Chua, CS
    IMAGE AND VISION COMPUTING, 2005, 23 (11) : 1018 - 1028
  • [44] Indoor Scene Recognition from RGB-D Images by Learning Scene Bases
    Wan, Shaohua
    Hu, Changbo
    Aggarwal, J. K.
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 3416 - 3421
  • [45] Studies of Vision Recognition of 3D Images
    Huang, Je-Yi
    Fang, Yi-Chin
    Tsai, Chen-Mu
    Chen, Ling-Fei
    IDW'11: PROCEEDINGS OF THE 18TH INTERNATIONAL DISPLAY WORKSHOPS, VOLS 1-3, 2011, : 961 - 962
  • [47] An Effective 3D Instance Map Reconstruction Method Based on RGBD Images for Indoor Scene
    Wu, Heng
    Liu, Yanjie
    Wang, Chao
    Wei, Yanlong
    REMOTE SENSING, 2025, 17 (01)
  • [48] A new method to estimate 3D cell parameters from 2D microscopy images
    Urbaniak, P.
    Wronski, S.
    Tarasiuk, J.
    Lipinski, P.
    Kotwicka, M.
    BIOCHIMICA ET BIOPHYSICA ACTA-MOLECULAR CELL RESEARCH, 2022, 1869 (09):
  • [49] A new method to estimate 3D cell parameters from 2D microscopy images
    Urbaniak, P.
    Wronski, S.
    Tarasiuk, J.
    Lipinski, P.
    Kotwicka, M.
    BIOCHIMICA ET BIOPHYSICA ACTA-MOLECULAR AND CELL BIOLOGY OF LIPIDS, 2022, 1869 (09):
  • [50] 3D box method for 3D reconstruction of an object from multi-images
    Alam, J
    Hama, H
    VISION GEOMETRY IX, 2000, 4117 : 81 - 90