LitAR: Visually Coherent Lighting for Mobile Augmented Reality

被引：1

作者：

Zhao, Yiqin ^{[1
]}

Ma, Chongyang ^{[2
]}

Huang, Haibin ^{[2
]}

Guo, Tian ^{[1
]}

机构：

[1] Worcester Polytech Inst, 100 Inst Rd, Worcester, MA USA

[2] Kuaishou Technol, 6 Shangdi West Rd Haidian, Beijing, Peoples R China

来源：

PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT | 2022年 / 6卷 / 03期

关键词：

mobile augmented reality; lighting estimation; 3D vision;

D O I：

10.1145/3550291

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

An accurate understanding of omnidirectional environment lighting is crucial for high-quality virtual object rendering in mobile augmented reality (AR). In particular, to support reflective rendering, existing methods have leveraged deep learning models to estimate or have used physical light probes to capture physical lighting, typically represented in the form of an environment map. However, these methods often fail to provide visually coherent details or require additional setups. For example, the commercial framework ARKit uses a convolutional neural network that can generate realistic environment maps; however the corresponding reflective rendering might not match the physical environments. In this work, we present the design and implementation of a lighting reconstruction framework called LitAR that enables realistic and visually-coherent rendering. LitAR addresses several challenges of supporting lighting information for mobile AR. First, to address the spatial variance problem, LitAR uses two-field lighting reconstruction to divide the lighting reconstruction task into the spatial variance-aware near-field reconstruction and the directional-aware far-field reconstruction. The corresponding environment map allows reflective rendering with correct color tones. Second, LitAR uses two noise-tolerant data capturing policies to ensure data quality, namely guided bootstrapped movement and motion-based automatic capturing. Third, to handle the mismatch between the mobile computation capability and the high computation requirement of lighting reconstruction, LitAR employs two novel real-time environment map rendering techniques called multi-resolution projection and anchor extrapolation. These two techniques effectively remove the need of time-consuming mesh reconstruction while maintaining visual quality. Lastly, LitAR provides several knobs to facilitate mobile AR application developers making quality and performance trade-offs in lighting reconstruction. We evaluated the performance of LitAR using a small-scale testbed experiment and a controlled simulation. Our testbed-based evaluation shows that LitAR achieves more visually coherent rendering effects than ARKit. Our design of multi-resolution projection significantly reduces the time of point cloud projection from about 3 seconds to 14.6 milliseconds. Our simulation shows that LitAR, on average, achieves up to 44.1% higher PSNR value than a recent work Xihe on two complex objects with physically-based materials.

引用

页数：29

共 65 条

[1]

Alhakamy A, 2019, 2019 IEEE 2ND INTERNATIONAL CONFERENCE ON INFORMATION AND COMPUTER TECHNOLOGIES (ICICT), P170, DOI [10.1109/infoct.2019.8710982, 10.1109/INFOCT.2019.8710982]

[2]

Amazon, 2020, AM AR VIEW

[3] Extended Reality in Medical Practice [J].

Andrews C. ;

Southworth M.K. ;

Silva J.N.A. ;

Silva J.R. .

Current Treatment Options in Cardiovascular Medicine, 2019, 21 (4)

[4]

[Anonymous], 2018, arXiv

[5]

[Anonymous], 2014, Journal of Computer Graphics Techniques (JCGT)

[6]

Apple, 2022, IPHONE 13 PROTECH SP

[7]

Apple, 2022, ARCOACHINGOVERLAYVIE

[8]

Apple Inc, 2022, INTR ARKIT 5

[9]

Apple Inc, 2022, ADD REAL REFL AR EXP

[10]

Apple Inc, 2020, IPAD PRO 2020

← 1 2 3 4 5 6 7 →