Classification algorithm of garbage images based on novel spatial attention mechanism and transfer learning

被引：0

作者：

Gao M. ^{[1
,2
]}

Chen Y. ^{[1
]}

Zhang Z. ^{[1
,3
]}

Feng Y. ^{[1
,4
]}

Fan W. ^{[1
,5
]}

机构：

[1] School of Management Science and Engineering, Dongbei University of Finance and Economics, Dalian

[2] Center for Post-doctoral Studies of Computer Science, Northeastern University, Shenyang

[3] School of Economics and Management, Southwest Jiaotong University, Chengdu

[4] School of Information Technology and Management, University of International Business and Economics, Beijing

[5] Department of Business Analytics, University of Iowa, Iowa City

来源：

Xitong Gongcheng Lilun yu Shijian/System Engineering Theory and Practice | 2021年 / 41卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Attention mechanism; Fine-grained image classification; Garbage classification; Transfer learning;

D O I：

10.12011/SETP2020-1645

中图分类号：

学科分类号：

摘要：

As governments at all levels in China have started to promote mandatory garbage classification, in order to meet the standardization and automated garbage classification in all aspects of classification and recycling need a fine-grained image classification model suitable for cloud deployment with high accuracy and low latency. This article takes advantage of deep transfer learning to establish an end-to-end transfer learning network architecture GANet (garbage neural network). Aiming at the challenges of category confusion and background interference in garbage classification, this paper proposes a new pixel-level spatial attention mechanism PSATT (pixel-level spatial attention). In order to overcome the challenges of multi-class and sample imbalance, this paper proposes a label smoothing regularization loss function. In order to improve convergence speed, model stability and generalization, this paper proposes a stepped OneCycle learning rate control method, and gives a combined use strategy combining Rectified Adam (RAdam) optimization method and stochastic weight averaging. Experiments used the training data which are marked by the Shenzhen garbage classification standard and provided by the "Huawei cloud artificial intelligence competition • garbage classification challenge cup", and verified the significant effect of GANet in the garbage classification problem, and won the national second prize (2nd place). At the same time, the proposed PSATT mechanism is superior to the comparison methods with improvement on different backbone network architectures, and has good versatility. The GANet architecture, PSATT mechanism and training strategies proposed in this paper not only have important engineering reference value, but also have good academic value. © 2021, Editorial Board of Journal of Systems Engineering Society of China. All right reserved.

引用

页码：498 / 512

页数：14

共 29 条

[1] Luo J H, Wu J X., A survey on fine-grained image categorization using deep convolutional features, Acta Automatica Sinica, 43, 8, pp. 1306-1318, (2017)
[2] Guo L H, Niu X Y, Ma J, Et al., Research of face recognition algorithm using the deep tiled convolutional neural networks and Map-Reduce method, Systems Engineering-Theory & Practice, 34, S1, pp. 283-286, (2014)
[3] Pan S J, Yang Q., A survey on transfer learning, IEEE Transactions on Knowledge and Data Engineering, 22, 10, pp. 1345-1359, (2010)
[4] Wu J J, Liu G N, Wang J Y, Et al., Data intelligence: Trends and challenges, Systems Engineering-Theory & Practice, 40, 8, pp. 2116-2149, (2020)
[5] Tan M, Le Q V., EfficientNet: Rethinking model scaling for convolutional neural networks, International Conference on Machine Learning, pp. 6105-6114, (2019)
[6] Szegedy C, Vanhoucke V, Ioffe S, Et al., Rethinking the inception architecture for computer vision, Conference on Computer Vision and Pattern Recognition, pp. 2818-2826, (2016)
[7] Lin T Y, Goyal P, Girshick R, Et al., Focal loss for dense object detection, IEEE Transactions on Pattern Analysis & Machine Intelligence, 42, 2, pp. 318-327, (2020)
[8] Garipov T, Izmailov P, Podoprikhin D, Et al., Loss surfaces, mode connectivity, and fast ensembling of DNNs, Neural Information Processing Systems, pp. 8789-8798, (2018)
[9] Izmailov P, Podoprikhin D, Garipov T, Et al., Averaging weights leads to wider optima and better generalization, Uncertainty in Artificial Intelligence, pp. 876-885, (2018)
[10] Liu L, Jiang H, He P, Et al., On the variance of the adaptive learning rate and beyond, International Conference on Learning Representations, (2020)

← 1 2 3 →