Deep Reinforcement Learning-Based Camera Autofocus with Gaussian Process Regression

被引：0

作者：

Wei, Li ^{[1
]}

Jiang, Yuankun ^{[2
]}

Li, Chenglin ^{[1
]}

Dai, Wenrui ^{[2
]}

Zou, Junni ^{[2
]}

Xiong, Hongkai ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai, Peoples R China

[2] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai, Peoples R China

来源：

2024 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING, VCIP | 2024年

基金：

中国国家自然科学基金;

关键词：

Autofocus; reinforcement learning; Gaussian process regression; SHAPE;

D O I：

10.1109/VCIP63160.2024.10849790

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Autofocus (AF) is a fundamental task in digital image capturing, yet current approaches often exhibit a poor performance in terms of speed or accuracy. In this paper, we propose a deep reinforcement learning (DRL)-based framework for camera AF, by incorporating a pre-trained MobileNet-v2 encoder for image feature extraction and a Gaussian process regression (GPR) module for in-focus prediction. Specially, we formulate AF as a multi-step process, in which the lens movement trajectory is dynamically generated from a DRL-based policy. By leveraging image features extracted from the MobileNet-v2 encoder, the DRL-based policy can efficiently get close to the in-focus position, which enables the focus search in a fast speed, thus overcoming the drawback of slow focusing procedure suffered by traditional search-based AF methods. A GPR-based in-focus predictor is further designed to terminate the DRL's exploration once it is able to provide a reliable prediction on the in-focus position, by exploiting the sharpness metric values of image patches captured at the DRL-generated trajectory of lens positions. In further contrast with the one-step prediction-based methods, we leverage the sequential information brought additionally by the multi-step exploration, enabling to gain a higher prediction accuracy. Experimental results demonstrate that our approach provides a significant improvement compared to both the search-based and prediction-based AF baselines.

引用

页数：5

共 34 条

[1] CLASS OF ALGORITHMS FOR FAST DIGITAL IMAGE REGISTRATION [J].

BARNEA, DI ;

SILVERMAN, HF .

IEEE TRANSACTIONS ON COMPUTERS, 1972, C 21 (02) :179-+

[2]

Chan C., 2019, Photography, Mobile, and Immersive Imaging 2019, P1

[3] A passive auto-focus camera control system [J].

Chen, Chih-Yung ;

Hwang, Rey-Chue ;

Chen, Yu-Ju .

APPLIED SOFT COMPUTING, 2010, 10 (01) :296-303

[4]

Choi M., 2023, IEEE CVF INT C COMP, p13 112

[5] COMPARISON OF AUTOFOCUS METHODS FOR AUTOMATED MICROSCOPY [J].

FIRESTONE, L ;

COOK, K ;

CULP, K ;

TALSANIA, N ;

PRESTON, K .

CYTOMETRY, 1991, 12 (03) :195-206

[6] Learning Single Camera Depth Estimation using Dual-Pixels [J].

Garg, Rahul ;

Wadhwa, Neal ;

Ansari, Sameer ;

Barron, Jonathan T. .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :7627-7636

[7]

Hannah MJ, 1974, Computer Matching of Areas in Stereo Images

[8] Modified fast climbing search auto-focus algorithm with adaptive step size searching technique for digital camera [J].

He, Jie ;

Zhou, Rongzhen ;

Hong, Zhiliang .

2003, Institute of Electrical and Electronics Engineers Inc. (49)

[9] Learning to Autofocus [J].

Herrmann, Charles ;

Bowen, Richard Strong ;

Wadhwa, Neal ;

Garg, Rahul ;

He, Qiurui ;

Barron, Jonathan T. ;

Zabih, Ramin .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :2227-2236

[10]

Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.8.1735, 10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]

← 1 2 3 4 →