A New Method of 3D Scene Recognition from Still Images

被引:0
|
作者
Zheng Li-ming [1 ]
Wang Xing-song [1 ]
机构
[1] Southeast Univ, Sch Mech Engn, Nanjing 210096, Jiangsu, Peoples R China
关键词
Unsupervised learning; monocular visual; 3D scene recognition; superpixels; spectral clustering; CAMERA CALIBRATION; SPECTRAL METHODS; GROUND SURFACE; DISTANCE; KERNEL; RECONSTRUCTION; REPRESENTATION; SHIFT;
D O I
10.1117/12.2064179
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Most methods of monocular visual three dimensional (3D) scene recognition involve supervised machine learning. However, these methods often rely on prior knowledge. Specifically, they learn the image scene as part of a training dataset. For this reason, when the sampling equipment or scene is changed, monocular visual 3D scene recognition may fail. To cope with this problem, a new method of unsupervised learning for monocular visual 3D scene recognition is here proposed. First, the image is made using superpixel segmentation based on the CIELAB color space values L, a, and b and on the coordinate values x and y of pixels, forming a superpixel image with a specific density. Second, a spectral clustering algorithm based on the superpixels' color characteristics and neighboring relationships was used to reduce the dimensions of the superpixel image. Third, the fuzzy distribution density functions representing sky, ground, and facade are multiplied with the segment pixels, where the expectations of these segments are obtained. A preliminary classification of sky, ground, and facade is generated in this way. Fourth, the most accurate classification images of sky, ground, and facade were extracted through the tier-1 wavelet sampling and Manhattan direction feature. Finally, a depth perception map is generated based on the pinhole imaging model and the linear perspective information of ground surface. Here, 400 images of Make3D Image data from the Cornell University website were used to test the algorithm. The experimental results showed that this unsupervised learning method provides a more effective monocular visual 3D scene recognition model than other methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] A DISTRIBUTED APPROACH TO 3D ROAD SCENE RECOGNITION
    FORESTI, G
    MURINO, V
    REGAZZONI, CS
    VERNAZZA, G
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 1994, 43 (02) : 389 - 406
  • [32] Scene recognition for 3D point clouds:a review
    Hao W.
    Zhang W.
    Liang W.
    Xiao Z.
    Jin H.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2022, 30 (16): : 1988 - 2005
  • [33] Deep Learning for 3D Scene Reconstruction and Segmentation from Stereo Images
    Kniaz, Vladimir V.
    Knyaz, Vladimir A.
    Ippolitov, Evgeny, V
    Novikov, Mikhail M.
    Grodzistky, Lev
    Moshkantsev, Petr
    MULTIMODAL SENSING AND ARTIFICIAL INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS II, 2021, 11785
  • [34] 3D SCENE RECONSTRUCTION FROM STEREO IMAGES WITH UNKNOWN EXTRINSIC PARAMETERS
    Goshin, Ye. V.
    Fursov, V. A.
    COMPUTER OPTICS, 2015, 39 (05) : 770 - 776
  • [35] Interactive 3D modeling from multiple images using scene regularities
    Shum, HY
    Szeliski, R
    Baker, S
    Han, M
    Anandan, P
    FOURTH IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION - WACV'98, PROCEEDINGS, 1998, : 234 - 235
  • [36] Biologically inspired 3D scene depth recovery from stereo images
    Maingreaud, F
    Pissaloux, E
    Leroux, C
    Micaelli, A
    PROCEEDINGS OF THE IEEE-ISIE 2004, VOLS 1 AND 2, 2004, : 721 - 725
  • [37] Optimized 3D Street Scene Reconstruction from Driving Recorder Images
    Zhang, Yongjun
    Li, Qian
    Lu, Hongshu
    Liu, Xinyi
    Huang, Xu
    Song, Chao
    Huang, Shan
    Huang, Jingyi
    REMOTE SENSING, 2015, 7 (07) : 9091 - 9121
  • [38] SynthText3D:synthesizing scene text images from 3D virtual worlds
    Minghui LIAO
    Boyu SONG
    Shangbang LONG
    Minghang HE
    Cong YAO
    Xiang BAI
    ScienceChina(InformationSciences), 2020, 63 (02) : 65 - 78
  • [39] SynthText3D: synthesizing scene text images from 3D virtual worlds
    Minghui Liao
    Boyu Song
    Shangbang Long
    Minghang He
    Cong Yao
    Xiang Bai
    Science China Information Sciences, 2020, 63
  • [40] SynthText3D: synthesizing scene text images from 3D virtual worlds
    Liao, Minghui
    Song, Boyu
    Long, Shangbang
    He, Minghang
    Yao, Cong
    Bai, Xiang
    SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (02)