Deep Reinforcement Learning-Based Camera Autofocus with Gaussian Process Regression

被引:0
作者
Wei, Li [1 ]
Jiang, Yuankun [2 ]
Li, Chenglin [1 ]
Dai, Wenrui [2 ]
Zou, Junni [2 ]
Xiong, Hongkai [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai, Peoples R China
[2] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai, Peoples R China
来源
2024 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING, VCIP | 2024年
基金
中国国家自然科学基金;
关键词
Autofocus; reinforcement learning; Gaussian process regression; SHAPE;
D O I
10.1109/VCIP63160.2024.10849790
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Autofocus (AF) is a fundamental task in digital image capturing, yet current approaches often exhibit a poor performance in terms of speed or accuracy. In this paper, we propose a deep reinforcement learning (DRL)-based framework for camera AF, by incorporating a pre-trained MobileNet-v2 encoder for image feature extraction and a Gaussian process regression (GPR) module for in-focus prediction. Specially, we formulate AF as a multi-step process, in which the lens movement trajectory is dynamically generated from a DRL-based policy. By leveraging image features extracted from the MobileNet-v2 encoder, the DRL-based policy can efficiently get close to the in-focus position, which enables the focus search in a fast speed, thus overcoming the drawback of slow focusing procedure suffered by traditional search-based AF methods. A GPR-based in-focus predictor is further designed to terminate the DRL's exploration once it is able to provide a reliable prediction on the in-focus position, by exploiting the sharpness metric values of image patches captured at the DRL-generated trajectory of lens positions. In further contrast with the one-step prediction-based methods, we leverage the sequential information brought additionally by the multi-step exploration, enabling to gain a higher prediction accuracy. Experimental results demonstrate that our approach provides a significant improvement compared to both the search-based and prediction-based AF baselines.
引用
收藏
页数:5
相关论文
共 34 条
[1]   CLASS OF ALGORITHMS FOR FAST DIGITAL IMAGE REGISTRATION [J].
BARNEA, DI ;
SILVERMAN, HF .
IEEE TRANSACTIONS ON COMPUTERS, 1972, C 21 (02) :179-+
[2]  
Chan C., 2019, Photography, Mobile, and Immersive Imaging 2019, P1
[3]   A passive auto-focus camera control system [J].
Chen, Chih-Yung ;
Hwang, Rey-Chue ;
Chen, Yu-Ju .
APPLIED SOFT COMPUTING, 2010, 10 (01) :296-303
[4]  
Choi M., 2023, IEEE CVF INT C COMP, p13 112
[5]   COMPARISON OF AUTOFOCUS METHODS FOR AUTOMATED MICROSCOPY [J].
FIRESTONE, L ;
COOK, K ;
CULP, K ;
TALSANIA, N ;
PRESTON, K .
CYTOMETRY, 1991, 12 (03) :195-206
[6]   Learning Single Camera Depth Estimation using Dual-Pixels [J].
Garg, Rahul ;
Wadhwa, Neal ;
Ansari, Sameer ;
Barron, Jonathan T. .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :7627-7636
[7]  
Hannah MJ, 1974, Computer Matching of Areas in Stereo Images
[8]   Modified fast climbing search auto-focus algorithm with adaptive step size searching technique for digital camera [J].
He, Jie ;
Zhou, Rongzhen ;
Hong, Zhiliang .
2003, Institute of Electrical and Electronics Engineers Inc. (49)
[9]   Learning to Autofocus [J].
Herrmann, Charles ;
Bowen, Richard Strong ;
Wadhwa, Neal ;
Garg, Rahul ;
He, Qiurui ;
Barron, Jonathan T. ;
Zabih, Ramin .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :2227-2236
[10]  
Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.8.1735, 10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]