Semantic image representation for image recognition and retrieval using multilayer variational auto-encoder, InceptionNet and low-level image features

被引:1
作者
Giveki, Davar [1 ]
Esfandyari, Sajad [1 ]
机构
[1] Malayer Univ, Dept Comp Engn, Malayer, Iran
关键词
Image representation; Image recognition; Content-based image retrieval; Deep learning; FEATURE FUSION; SCENE; CLASSIFICATION; PERFORMANCE; INFORMATION; ATTENTION; NETWORK; CNN;
D O I
10.1007/s11227-024-06792-5
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel image descriptor that enhances performance in image recognition and retrieval by combining deep learning and handcrafted features. Our method integrates high-level semantic features extracted via InceptionResNet-V2 with color and texture features to create a comprehensive representation of image content. The descriptor's effectiveness is demonstrated through extensive experiments across a range of image recognition and retrieval tasks. Our approach is tested on six benchmark datasets, including Corel-1 K, VS, OT, QT, SUN-397, and ILSVRC-2012 for single-label classification, and COCO and NUS-WIDE for multi-label classification, achieving high performances. The results establish that the proposed method is versatile and robust, excelling in single-label and multi-label recognition as well as image retrieval tasks, and outperforms several state-of-the-art methods. This work provides a significant advancement in image representation, with broad applicability in various computer vision domains.
引用
收藏
页数:40
相关论文
共 100 条
[11]  
Chua T-S, 2009, P ACM INT C IM VID R, P1
[12]   Contrastively enforcing distinctiveness for multi-label image classification [J].
Dao, Son D. ;
Zhao, He ;
Phung, Dinh ;
Cai, Jianfei .
NEUROCOMPUTING, 2023, 555
[13]  
Dehghani Mostafa, P MACHINE LEARNING R
[14]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[15]   Semantic Fisher Scores for Task Transfer: Using Objects to Classify Scenes [J].
Dixit, Mandar ;
Li, Yunsheng ;
Vasconcelos, Nuno .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (12) :3102-3118
[16]   Multilayer Convolutional Feature Aggregation Algorithm for Image Retrieval [J].
Dong, Rongsheng ;
Liu, Ming ;
Li, Fengying .
MATHEMATICAL PROBLEMS IN ENGINEERING, 2019, 2019
[17]  
Fein-Ashley J, 2024, Arxiv, DOI arXiv:2409.16897
[18]  
Howard AG, 2017, Arxiv, DOI [arXiv:1704.04861, DOI 10.48550/ARXIV.1704.04861]
[19]   DeepHSAR: Semi-supervised fine-grained learning for multi-label human sexual activity recognition [J].
Gangwar, Abhishek ;
Gonzalez-Castro, Victor ;
Alegre, Enrique ;
Fidalgo, Eduardo ;
Martinez-Mendoza, Alicia .
INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (05)
[20]  
Giveki D., 2022, J Mach Vis Image Process, V9, P81