Wildlife Images Recognition Method Based on Wasserstein Distance and Correlation Alignment Transfer Learning

被引:0
作者
Zhang, Changchun [1 ]
Li, Dafang [1 ]
Zhang, Junguo [1 ]
机构
[1] School of Technology, Beijing Forestry University, State Key Laboratory of Efficient Production of Forest Resources Key Laboratory of National Forestry and Grassland Administration on Forestry Equipment and Automation, Beijing
来源
Linye Kexue/Scientia Silvae Sinicae | 2024年 / 60卷 / 08期
关键词
correlation alignment; image recognition; transfer learning; Wasserstein distance; wild animal;
D O I
10.11707/j.1001-7488.LYKX20230399
中图分类号
学科分类号
摘要
【Objective】This study aims to address the influence of complex factors such as lighting, background, and shooting scale on the accuracy of wildlife image recognition. 【 Method】 In this study, the wild animal images captured by infrared triggered cameras in the wild were used as the object: 1) Two publicly available wildlife datasets, ENA24 and NACTI, were used to construct disjoint datasets S1 and S2, comprising a total of 11 animal categories and 25 591 images. 2) To tackle domain shift issues, a ResNet50 network was utilized as a feature extraction module to build a domain adversarial network, effectively alleviating domain bias. 3) A representation learning network incorporating Wasserstein distance and correlation alignment was proposed to establish a transfer learning network for feature extraction and recognition, so as to further exploit transferable features. 【Result】The performance of different models in wildlife recognition was evaluated using the average accuracy metric. Results indicated that the average accuracy on 11 wildlife categories for eight models, namely ResNet50, DDC, DCORAL, DAN, DANN, CDAN, HAN, and JTN, was 48.4%, 51.6%, 49.6%, 52.6%, 45.2%, 50.9%, 54.6%, and 53.5%, respectively. Upon enhancing the ResNet50 base model with improved residual modules and introducing a representation learning network incorporating Wasserstein distance and correlation alignment, the average accuracy for 11 wildlife categories was improved by 2.7% compared to the existing best result with the comparative methods.【 Conclusion】 The transfer learning method based on Wasserstein distance and correlation alignment has achieved an average accuracy of 57.3% in wildlife recognition. The introduction of representation learning based on Wasserstein distance and correlation alignment can effectively improve the accuracy of the wildlife recognition model. © 2024 Chinese Society of Forestry. All rights reserved.
引用
收藏
页码:25 / 32
页数:7
相关论文
共 29 条
[1]  
Cheng Z A., Automatic recognition of terrestrial wildlife in inner mongolia based on deep convolution neural network, (2019)
[2]  
Li A Q., Research on automatic recognition method of wildlife monitoring images based on convolutional neural network, (2020)
[3]  
Qi J D, Ma Z T, Zhang D H, Et al., Wildlife image recognition in Miyun District based on BS-ResNeXt-50, Scientia Silvae Sinicae, 59, 8, pp. 112-122, (2023)
[4]  
Xie B, Wang N, Fan Y W., Correlation alignment total variation model and algorithm for style transfer, Journal of Image and Graphics, 25, 2, pp. 241-254, (2020)
[5]  
Chen P, Zhao R, He T, Et al., Unsupervised domain adaptation of bearing fault diagnosis based on join sliced Wasserstein distance, ISA transactions, 129, pp. 504-519, (2022)
[6]  
Ganin Y, Ustinova E, Ajakan H, Et al., Domain-adversarial training of neural networks, Journal of Machine Learning Research, 17, 1, pp. 2096-2030, (2016)
[7]  
He K, Zhang X, Ren S, Et al., Deep residual learning for image recognition, IEEE Conference on Computer Vision and Pattern Recognition, pp. 770-778, (2016)
[8]  
Long M, Cao Y, Cao Z, Et al., Transferable representation learning with deep adaptation networks, IEEE Transactions on Pattern Analysis and Machine Intelligence, 41, 12, pp. 3071-3085, (2019)
[9]  
Long M, Cao Z, Wang J, Et al., Conditional adversarial domain adaptation, Advances in Neural Information Processing Systems, 31, (2018)
[10]  
Miao Z, Liu Z, Gaynor K M, Et al., Iterative human and automated identification of wildlife images, Nature Machine Intelligence, 3, 10, pp. 885-895, (2021)