VLSI Design of Light-Field Factorization for Dual-Layer Factored Display

被引:0
作者
Chen, Li-De [1 ]
Weng, Li-Qun [2 ,3 ]
Cheng, Hao-Chien [2 ,3 ]
Cheng, An-Yu [2 ,4 ]
Lin, Kai-Ping [1 ]
Huang, Chao-Tsung [1 ]
机构
[1] Natl Tsing Hua Univ, Dept Elect Engn, Hsinchu 30013, Taiwan
[2] Natl Tsing Hua Univ, Hsinchu 30013, Taiwan
[3] Novatek Microelect Corp, Hsinchu 30076, Taiwan
[4] MediaTek Inc, Hsinchu 30078, Taiwan
关键词
Three-dimensional displays; Light fields; Random access memory; Hardware; Bandwidth; Very large scale integration; Visualization; 3-D display; computational display; light field; nonnegative matrix factorization (NMF); NONNEGATIVE MATRIX; ALGORITHMS; RESOLUTION; RANGE;
D O I
10.1109/TVLSI.2024.3414262
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This article introduces a VLSI design for light-field factorization, aimed at enhancing immersive 3-D visual experiences for computational light-field factored displays. The main design challenges are intensive memory-access demands and high computational complexity. Accordingly, we first propose half-block-based factorization (HBBF) and sparse ray sampling (SRS) to reduce DRAM bandwidth by 99% and SRAM size by 74%. Then, we devise integer hybrid quantization (INTH) to cut down computational logic by 41%, leading to improvements in die area and power efficiency. Finally, we fabricated a processor chip that incorporates 75.1 kB of SRAM and 5.9M logic gates using 40-nm CMOS technology. It can operate with three different performance modes: high quality (56.9 MPixel/s at 971 mW), balanced (62.5 MPixel/s at 442 mW), and low power (61.7 MPixel/s at 283 mW). Across these modes, its normalized energy ranges between 4.4 and 16.2 nJ/pixel. This implementation surpasses existing GPU platforms and offers an 85 $\times$ increase in processing speed and a 311 $\times$ reduction in power consumption. We also showcase a real-time computational 3-D display system with this chip, demonstrating its practical efficacy in computational 3-D display technology.
引用
收藏
页码:2093 / 2106
页数:14
相关论文
共 48 条
[1]   SIMULTANEOUS ALGEBRAIC RECONSTRUCTION TECHNIQUE (SART) - A SUPERIOR IMPLEMENTATION OF THE ART ALGORITHM [J].
ANDERSEN, AH ;
KAK, AC .
ULTRASONIC IMAGING, 1984, 6 (01) :81-94
[2]  
Balram Nikhil, 2016, Information Display, V32, P6
[3]   Algorithms and applications for approximate nonnegative matrix factorization [J].
Berry, Michael W. ;
Browne, Murray ;
Langville, Amy N. ;
Pauca, V. Paul ;
Plemmons, Robert J. .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 52 (01) :155-173
[4]   Accelerating decomposition of light field video for compressive multi-layer display [J].
Cao, Xuan ;
Geng, Zheng ;
Li, Tuotuo ;
Zhang, Mei ;
Zhang, Zhaoxing .
OPTICS EXPRESS, 2015, 23 (26) :34007-34022
[5]  
Chen LD, 2018, IEEE ASIAN SOLID STA, P261
[6]  
Cichocki A, 2006, LECT NOTES COMPUT SC, V4029, P548, DOI 10.1007/11785231_58
[7]   Render-based factorization for additive light field display [J].
Deng, Nianchen ;
He, Zhenyi ;
Yang, Xubo .
COMPUTER ANIMATION AND VIRTUAL WORLDS, 2021, 32 (3-4)
[8]   Nonnegative Matrix Factorization: An Analytical and Interpretive Tool in Computational Biology [J].
Devarajan, Karthik .
PLOS COMPUTATIONAL BIOLOGY, 2008, 4 (07)
[9]   Autostereoscopic 3D displays [J].
Dodgson, NA .
COMPUTER, 2005, 38 (08) :31-+
[10]   A 4.29nJ/pixel Stereo Depth Coprocessor With Pixel Level Pipeline and Region Optimized Semi-Global Matching for IoT Application [J].
Dong, Pingcheng ;
Chen, Zhuoyu ;
Li, Zhuoao ;
Fu, Yuzhe ;
Chen, Lei ;
An, Fengwei .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2022, 69 (01) :334-346