Enhancing resolution uniformity of spectral line confocal 3D sensors through integration of vision transformer with perpendicular attention-augmented parallel convolutional neural networks

被引：0

作者：

Wang, Shuai ^{[1
]}

Zheng, Zechen ^{[2
]}

Diao, Kuan ^{[1
]}

Liu, Xiaojun ^{[1
]}

机构：

[1] Technol Huazhong Univ Sci, State Key Lab Digital Mfg Equipment & Technol, 1037 Luoyu Rd, Wuhan, Peoples R China

[2] Northwest Univ, Sch Informat Sci & Technol, 1 Xuefu Ave, Xian, Peoples R China

来源：

MEASUREMENT | 2025年 / 249卷

基金：

中国国家自然科学基金; 中国博士后科学基金;

关键词：

Spectral line confocal; Surface topography measurement; Field degradation correction; Resolution; Deep learning;

D O I：

10.1016/j.measurement.2025.116980

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

The spectral line confocal sensor, with its significant advantages of submicron-level high resolution, fast scanning imaging, multi-parameter measurement, and non-contact measurement, is widely utilized in industries such as aerospace, military, semiconductors, and new energy. However, achieving consistent resolution across the sensor's extensive axial measurement range remains challenging due to light field degradation in images captured by area array CMOS sensors. This lack of uniform resolution, particularly at the edges, leads to data distortion and error accumulation, impacting the system's overall accuracy and reliability. To address this limitation, this paper proposes a network architecture that integrates a Vision Transformer (ViT) with a perpendicular attention parallel CNN, employing deep learning to restore the images captured by the sensor. This approach combines local and global information, enabling the extraction of image features at different scales and enhancing the model's capability to capture information in various directions within the image. As a result, the sensor achieves highly uniform signal quality, enabling accurate 3D reconstruction. Furthermore, the effectiveness of the proposed method is experimentally validated through measurements of periodic structures on a wafer and semiconductor.

引用

页数：11

共 4 条

[1] Enhancing prediction of magnetic properties in additive manufacturing products through a 3D convolutional vision transformer model
Lien-Kai Chang
Po-Chun Chen
Mi-Ching Tsai
Rong-Mao Lee
Jhih-Cheng Huang
Tsung-Wei Chang
Ming-Huwi Horng
The International Journal of Advanced Manufacturing Technology, 2025, 137 (9) : 4503 - 4519
[2] POSITIONAL-SPECTRAL-TEMPORAL ATTENTION IN 3D CONVOLUTIONAL NEURAL NETWORKS FOR EEG EMOTION RECOGNITION
Liu, Jiyao
Zhao, Yanxi
Wu, Hao
Jiang, Dongmei
2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 305 - 312
[3] 3D medical image segmentation using the serial-parallel convolutional neural network and transformer based on cross-window self-attention
Yu, Bin
Zhou, Quan
Yuan, Li
Liang, Huageng
Shcherbakov, Pavel
Zhang, Xuming
CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2025,
[4] Multi-Scale Spectral-Spatial Attention Network for Hyperspectral Image Classification Combining 2D Octave and 3D Convolutional Neural Networks
Liang, Lianhui
Zhang, Shaoquan
Li, Jun
Plaza, Antonio
Cui, Zhi
REMOTE SENSING, 2023, 15 (07)

← 1 →