Tire tread detection based on fusion of spatial and frequency domain features

被引:0
作者
Chen, Qiya [1 ]
Dong, Yude [1 ]
Wang, Jinbiao [1 ]
Yuan, Zhonghang [1 ]
机构
[1] Hefei Univ Technol, Sch Mech Engn, Baohe Dist 193, Hefei 230009, Peoples R China
基金
中国国家自然科学基金;
关键词
Tire tread; feature extraction; feature fusion; data augmentation; object recognition; similarity detection;
D O I
10.1177/09544070241278425
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
Tire tread, as a primary feature of tires, presents a formidable challenge in reliable detection due to the vast variety of treads available and the scarcity of publicly available datasets. To address this issue, this paper introduces DCT-ResNet, a neural network designed for adaptive fusion of spatial and frequency domain features. This method overcomes dataset limitations through the use of generative networks and image enhancement techniques. Initially, the network captures spatial features using down-sampling layers and hidden layers. Concurrently, frequency domain features are extracted through the integration of Discrete Cosine Transform (DCT) and a specialized frequency domain network. The final step involves the use of a multi-head self-attention layer to achieve adaptive feature fusion, ensuring the reliable extraction of tire tread features. Experimental results highlight the effectiveness of the proposed approach. The DCT-ResNet network achieves impressive classification accuracies of 99% on the tire tread dataset and 97% on the CIFAR-10 dataset. Additionally, the network demonstrates a level of pattern similarity detection comparable to expert judgments. In adversarial testing, the data augmentation significantly enhances the network's robustness, allowing DCT-ResNet to outperform other methods in resistance to interference. Consequently, the method presented in this paper holds substantial practical significance for the high-reliability detection of tire treads.
引用
收藏
页数:25
相关论文
共 38 条
[1]  
Arjovsky M, 2017, PR MACH LEARN RES, V70
[2]  
Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
[3]  
Cai Chengtao, 2019, 2019 3rd International Conference on Electronic Information Technology and Computer Engineering (EITCE). Proceedings, P419, DOI 10.1109/EITCE47263.2019.9095051
[4]   Grad-CAM plus plus : Generalized Gradient-based Visual Explanations for Deep Convolutional Networks [J].
Chattopadhay, Aditya ;
Sarkar, Anirban ;
Howlader, Prantik ;
Balasubramanian, Vineeth N. .
2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, :839-847
[5]  
Chen H., 2022, SoftwareX, V43, P65
[6]  
Chen W., 2020, arXiv preprint arXiv:1904.04232
[7]  
Dhillon G., 2020, arXiv preprint arXiv: 1909.02729
[8]  
Gao Y., 2023, Research on tire trace image identification method based on Siamese network, DOI [10.27438/d.cnki.gyadu.2023.000696, DOI 10.27438/D.CNKI.GYADU.2023.000696]
[9]  
Goodfellow I. J., 2014, P ICLR
[10]  
Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672