Sports image retrieval based on hand-drawn sketches

被引:0
作者
Fu, Lijia [1 ]
Aili, Shabaaiti [2 ]
机构
[1] Hanjiang Normal Univ, Sch Phys Educ, Shiyan, Peoples R China
[2] Hanjiang Normal Univ, Sch Educ, Shiyan, Peoples R China
关键词
Sketch retrieval; Sports; Heterogeneous networks; Cross-modal matching; Similarity learning;
D O I
10.1007/s44443-025-00068-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sketch-based sports retrieval provides an intuitive and efficient interactive way for sports video analysis and tactical research. This problem is very challenging, on the one hand, in how to group annotations and, on the other hand, in how to perform cross-modal alignment between sketches and real image shapes. In addition, sketches are usually highly abstract, resulting in weak semantic information expression ability, further affecting the retrieval effect. To address the above problems, we first constructed the largest sports sketch training data to date, including multiple sports such as basketball and football. Secondly, in order to effectively bridge the gap between sketches and real images, we proposed a cross-modal sports retrieval framework based on heterogeneous networks to effectively bridge the visual gap between sketches and videos. In addition, we introduced label word vectors to enhance the feature expression ability of sketches using semantic information and improve the semantic consistency of cross-modal matching. Finally, we designed a joint loss function for similarity learning, which improves retrieval accuracy and enhances model generalization ability by maximizing inter-class similarity and minimizing intra-class similarity. Experimental results show that the proposed method outperforms existing methods in terms of retrieval accuracy and computational efficiency, providing a new intelligent solution for sports analysis.
引用
收藏
页数:14
相关论文
共 35 条
[1]  
Shih H.-C., A survey of content-aware video analysis for sports, IEEE Trans Circuits Syst Video Technol, 28, 5, pp. 1212-1231, (2017)
[2]  
Patel S.H., Kamdar D., Survey on sport video analysis and event detection, Int J Auton Adapt Commun Syst, 18, 1, pp. 67-98, (2025)
[3]  
Liu J., Huang G., Hyyppa J., Li J., Gong X., Jiang X., A survey on location and motion tracking technologies, methodologies and applications in precision sports, Expert Syst Appl, 229, (2023)
[4]  
Qian H., [Retracted] Optimization of Intelligent Management and Monitoring System of Sports Training Hall Based on Internet of Things, Wirel Commun Mob Comput, 2021, 1, (2021)
[5]  
Liang K., Et al., Gunrock 2.0: a user adaptive social conversational system, (2020)
[6]  
Dong J., Et al., ChemDes: an integrated web-based platform for molecular descriptor and fingerprint computation, J Cheminform, 7, pp. 1-10, (2015)
[7]  
Rossetto L., Et al., Interactive video retrieval in the age of deep learning–detailed evaluation of VBS 2019, IEEE Trans Multimedia, 23, pp. 243-256, (2020)
[8]  
Zhou W., Jia J., Jiang W., Huang C., Sketch augmentation-driven shape retrieval learning framework based on convolutional neural networks, IEEE Trans Visual Comput Graphics, 27, 8, pp. 3558-3570, (2020)
[9]  
Xu W., Yuan K., Li W., Ding W., An emerging fuzzy feature selection method using composite entropy-based uncertainty measure and data distribution, IEEE Trans Emerg Top Comput Intell, 7, 1, pp. 76-88, (2022)
[10]  
Liang S., Dai W., Wei Y., Uncertainty learning for noise resistant sketch-based 3d shape retrieval, IEEE Trans Image Process, 30, pp. 8632-8643, (2021)