Learning fusion feature representation for garbage image classification model in human-robot interaction

被引:23
作者
Li, Xi [1 ,2 ]
Li, Tian [2 ]
Li, Shaoyi [1 ,2 ]
Tian, Bin [1 ,2 ]
Ju, Jianping [1 ]
Liu, Tingting [1 ]
Liu, Hai [1 ]
机构
[1] Nanchang Inst Sci & Technol, Sch Informat & Artificial Intelligence, Nanchang 330108, Peoples R China
[2] Wuhan Inst Technol, Sch Elect & Informat Engn, Wuhan 430075, Peoples R China
关键词
Infrared imaging; Image classification; Group convolution; Channel shuffle; CBAM; Label smoothing; NETWORK;
D O I
10.1016/j.infrared.2022.104457
中图分类号
TH7 [仪器、仪表];
学科分类号
0804 ; 080401 ; 081102 ;
摘要
Garbage image classification often suffers from three aspect challenges: complex image background, same-shape category, and low-quality image. The existing machine vision methods have excellent learning capabilities. However, they require powerful computational resources. In this work, an efficient garbage image classification network (GScbamKL-Net) is proposed in this work to address the problems mentioned. The proposed network is designed from the following three aspects. First, the new network unit with group convolution and channel shuffle is designed. This unit can significantly reduce the number of parameters of the model and achieve good performance. Second, the CBAM attention mechanism, which can extract key features by weighting the output features in space and channel, is added to the network unit. Furthermore, the LeakyReLu function is introduced as the activation function model. A label smoothing function is constructed as the loss function. It can mitigate the errors and effects of sample imbalance and obtain a good nonlinear transformation effect. The normal garbage images and garbage images using infrared imaging technology were tested respectively. Experimental results show that the proposed GScbam-Net has excellent classification performance while maintaining its lightweight.
引用
收藏
页数:8
相关论文
共 38 条
[31]  
Szegedy C., 2015, PROC CVPR IEEE, P1, DOI DOI 10.1109/CVPR.2015.7298594
[32]   Rethinking the Inception Architecture for Computer Vision [J].
Szegedy, Christian ;
Vanhoucke, Vincent ;
Ioffe, Sergey ;
Shlens, Jon ;
Wojna, Zbigniew .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2818-2826
[33]   A Novel Framework for Trash Classification Using Deep Transfer Learning [J].
Vo, Anh H. ;
Le Hoang Son ;
Minh Thanh Vo ;
Tuong Le .
IEEE ACCESS, 2019, 7 :178631-178639
[34]   CBAM: Convolutional Block Attention Module [J].
Woo, Sanghyun ;
Park, Jongchan ;
Lee, Joon-Young ;
Kweon, In So .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :3-19
[35]  
Yang HS, 2018, ADV MATER SCI ENG, V2018, DOI [10.1155/2018/6464036, 10.1155/2018/5732352]
[36]  
Yang M., 2016, CLASSIFICATION TRASH, P3
[37]   ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices [J].
Zhang, Xiangyu ;
Zhou, Xinyu ;
Lin, Mengxiao ;
Sun, Ran .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6848-6856
[38]   Intelligent garbage classification system based on improve MobileNetV3-Large [J].
Zhao, Yi ;
Huang, Hancheng ;
Li, Zhixiang ;
Yiwang, Huang ;
Lu, Manjie .
CONNECTION SCIENCE, 2022, 34 (01) :1299-1321