COCCI: Context-Driven Clothing Classification Network

被引：0

作者：

Jiang, Minghua ^{[1
,2
]}

Liu, Shuqing ^{[1
]}

Shi, Yankang ^{[1
]}

Du, Chenghu ^{[1
]}

Tang, Guangyu ^{[1
]}

Liu, Li ^{[1
,2
]}

Peng, Tao ^{[1
,2
]}

Hu, Xinrong ^{[1
,2
]}

Yu, Feng ^{[1
,2
]}

机构：

[1] Wuhan Text Univ, Sch Comp Sci & Artificial Intelligence, Wuhan 430200, Peoples R China

[2] Engn Res Ctr Hubei Prov Clothing Informat, Wuhan 430200, Peoples R China

来源：

ADVANCES IN COMPUTER GRAPHICS, CGI 2023, PT I | 2024年 / 14495卷

基金：

中国国家自然科学基金;

关键词：

Clothing classification; Knowledge distillation; Attention mechanism; Apparel parsing and understanding;

D O I：

10.1007/978-3-031-50069-5_7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Clothing classification serves as a fundamental task for clothing retrieval, clothing recommendation, etc. In this task, there are two inherent challenges: suppressing complex backgrounds outside the clothing region and disentangling the feature entanglement of shape-similar clothing samples. These challenges arise from insufficient attention to key distinctions of clothing, which hinders the accuracy of clothing classification. Also, the high computational resource requirement of some complex and large-scale models also decreases the inference efficiency. To tackle these challenges, we propose a new COntext-driven Clothing ClassIfication network (COCCI), which improves inference accuracy while reducing model complexity. First, we design a self-adaptive attention fusion (SAAF) module to enhance category-exclusive clothing features and prevent misclassification by suppressing ineffective features with confused image contexts. Second, we propose a novel multi-scale feature aggregation (MSFA) module to establish spatial context correlations by using multi-scale clothing features. This helps disentangle feature entanglement among shape-similar clothing samples. Finally, we introduce knowledge distillation to extract reliable teacher knowledge from complex datasets, which helps student models learn clothing features with rich representation information, thereby improving generalization while reducing model complexity. In comparison to state-of-the-art networks trained with one single model, our method achieves SOTA performance on the widely-used clothing classification benchmark.

引用

页码：69 / 80

页数：12

共 22 条

[1]

Dosovitskiy A., 2020, P INT C LEARN REPR I, P1

[2] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[3]

Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]

[4] Densely Connected Convolutional Networks [J].

Huang, Gao ;

Liu, Zhuang ;

van der Maaten, Laurens ;

Weinberger, Kilian Q. .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2261-2269

[5] A neighbourhood feature-based local binary pattern for texture classification [J].

Lan, Shaokun ;

Li, Jie ;

Hu, Shiqi ;

Fan, Hongcheng ;

Pan, Zhibin .

VISUAL COMPUTER, 2024, 40 (05) :3385-3409

[6] Hierarchical learning with backtracking algorithm based on the Visual Confusion Label Tree for large-scale image classification [J].

Liu, Yuntao ;

Dou, Yong ;

Jin, Ruochun ;

Li, Rongchun ;

Qiao, Peng .

VISUAL COMPUTER, 2022, 38 (03) :897-917

[7] Swin Transformer: Hierarchical Vision Transformer using Shifted Windows [J].

Liu, Ze ;

Lin, Yutong ;

Cao, Yue ;

Hu, Han ;

Wei, Yixuan ;

Zhang, Zheng ;

Lin, Stephen ;

Guo, Baining .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9992-10002

[8] A ConvNet for the 2020s [J].

Liu, Zhuang ;

Mao, Hanzi ;

Wu, Chao-Yuan ;

Feichtenhofer, Christoph ;

Darrell, Trevor ;

Xie, Saining .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :11966-11976

[9] DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations [J].

Liu, Ziwei ;

Luo, Ping ;

Qiu, Shi ;

Wang, Xiaogang ;

Tang, Xiaoou .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1096-1104

[10] A knowledge-sharing semi-supervised approach for fashion clothes classification and attribute prediction [J].

Shajini, Majuran ;

Ramanan, Amirthalingam .

VISUAL COMPUTER, 2022, 38 (11) :3551-3561

← 1 2 3 →