LangSplat: 3D Language Gaussian Splatting

被引:7
|
作者
Qin, Minghan [1 ]
Li, Wanhua [2 ]
Zhou, Jiawei [1 ]
Wang, Haoqian [1 ]
Pfister, Hanspeter [2 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] Harvard Univ, Cambridge, MA USA
关键词
D O I
10.1109/CVPR52733.2024.01895
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Humans live in a 3D world and commonly use natural language to interact with a 3D scene. Modeling a 3D language field to support open-ended language queries in 3D has gained increasing attention recently. This paper introduces LangSplat, which constructs a 3D language field that enables precise and efficient open-vocabulary querying within 3D spaces. Unlike existing methods that ground CLIP language embeddings in a NeRF model, LangSplat advances the field by utilizing a collection of 3D Gaussians, each encoding language features distilled from CLIP, to represent the language field. By employing a tile-based splatting technique for rendering language features, we circumvent the costly rendering process inherent in NeRF. Instead of directly learning CLIP embeddings, LangSplat first trains a scene-wise language autoencoder and then learns language features on the scene-specific latent space, thereby alleviating substantial memory demands imposed by explicit modeling. Existing methods struggle with imprecise and vague 3D language fields, which fail to discern clear boundaries between objects. We delve into this issue and propose to learn hierarchical semantics using SAM, thereby eliminating the need for extensively querying the language field across various scales and the regularization of DINO features. Extensive experimental results show that LangSplat significantly outperforms the previous state-of-the-art method LERF by a large margin. Notably, LangSplat is extremely efficient, achieving a 199 x speedup compared to LERF at the resolution of 1440 x 1080. We strongly recommend readers to check out our video results at https://langsplat.github.io/.
引用
收藏
页码:20051 / 20060
页数:10
相关论文
共 50 条
  • [21] On the Error Analysis of 3D Gaussian Splatting and an Optimal Projection Strategy
    Huang, Letian
    Bai, Jiayang
    Guo, Jie
    Li, Yuanqi
    Guo, Yanwen
    COMPUTER VISION - ECCV 2024, PT XVII, 2025, 15075 : 247 - 263
  • [22] GS-IR: 3D Gaussian Splatting for Inverse Rendering
    Liang, Zhihao
    Zhang, Qi
    Feng, Ying
    Shan, Ying
    Jia, Kui
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 21644 - 21653
  • [23] GauLoc: 3D Gaussian Splatting-based Camera Relocalization
    Xin, Zhe
    Dai, Chengkai
    Li, Ying
    Wu, Chenming
    COMPUTER GRAPHICS FORUM, 2024, 43 (07)
  • [24] Impact of Data Capture Methods on 3D Reconstruction with Gaussian Splatting
    Rangelov, Dimitar
    Waanders, Sierd
    Waanders, Kars
    van Keulen, Maurice
    Miltchev, Radoslav
    JOURNAL OF IMAGING, 2025, 11 (02)
  • [25] Analytic-Splatting: Anti-Aliased 3D Gaussian Splatting via Analytic Integration
    Liang, Zhihao
    Zhang, Qi
    Hu, Wenbo
    Zhu, Lei
    Feng, Ying
    Jia, Kui
    COMPUTER VISION - ECCV 2024, PT XVII, 2025, 15075 : 281 - 297
  • [26] A review of recent advances in 3D Gaussian Splatting for optimization and reconstruction
    Luo, Jie
    Huang, Tianlun
    Wang, Weijun
    Feng, Wei
    IMAGE AND VISION COMPUTING, 2024, 151
  • [27] Gaussian Splatting: 3D Reconstruction and Novel View Synthesis: A Review
    Dalal, Anurag
    Hagen, Daniel
    Robbersmyr, Kjell G.
    Knausgard, Kristian Muri
    IEEE ACCESS, 2024, 12 : 96797 - 96820
  • [28] Superpixel-guided Sampling for Compact 3D Gaussian Splatting
    Kim, Myoung Gon
    Jeong, SeungWon
    Park, Seohyeon
    Han, JungHyun
    30TH ACM SYMPOSIUM ON VIRTUAL REALITY SOFTWARE AND TECHNOLOGY, VRST 2024, 2024,
  • [29] GaussianShader: 3D Gaussian Splatting with Shading Functions for Reflective Surfaces
    Jiang, Yingwenqi
    Tu, Jiadong
    Liu, Yuan
    Gao, Xifeng
    Long, Xiaoxiao
    Wang, Wenping
    Ma, Yuexin
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 5322 - 5332
  • [30] 3DGSR: Implicit Surface Reconstruction with 3D Gaussian Splatting
    Lyu, Xiaoyang
    Sun, Yang-Tian
    Huang, Yi-Hua
    Wu, Xiuzhe
    Yang, Ziyi
    Chen, Yilun
    Pang, Jiangmiao
    Qi, Xiaojuan
    ACM TRANSACTIONS ON GRAPHICS, 2024, 43 (06):