Frequency-Domain Guided Image Classification With Large Model Assistance

被引:0
作者
Hua, Xia [1 ]
Han, Lei [1 ]
机构
[1] China Univ Petr East China, Dept Phys Educ, Qingdao 266580, Shandong, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Frequency-domain analysis; Sports; Image classification; Training; Accuracy; Convolution; Random forests; Nearest neighbor methods; Image synthesis; Image recognition; Sport image classification; frequency-domain guided; large model assistance;
D O I
10.1109/ACCESS.2024.3500099
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image classification technology has made significant advancements, but methods tailored for specific image classification, such as sport image, remain inadequate. This is primarily constrained by two factors. First, the image quality of sport images varies greatly, and many rare sports have very few images. Second, most sport image classification techniques directly transfer image classification methods from other domains, with few approaches specifically designed based on the unique features of sport images. To address this problem, we devise a Frequency-domain Guided sport image classification method with Large model Assistance, named FGLA. Specifically, we design two main modules for FGLA. The first module combines Fourier Transform and Wavelet Transform to embed the frequency-domain information into the original image as additional channels, which converts the image into a three-dimensional cuboid, incorporating more comprehensive information. This allows the model to assign different weights to each channel, thereby enhancing image classification performance. The second module leverages the powerful image generation capabilities of large models to augment the dataset with more images, especially for rare sports. Additionally, it directly generates frequency-domain images to enhance the generalization of the classification model. We analyze the effectiveness of FGLA across many models and several classic sports datasets. The results indicate that FGLA achieves the highest accuracy in sport image classification. Moreover, this method also demonstrates strong generalization capabilities and can be adapted to other image classification tasks.
引用
收藏
页码:186246 / 186254
页数:9
相关论文
共 27 条
  • [1] Wu X., Zhan C., Lai Y.-K., Cheng M.-M., Yang J., IP102: A large-scale benchmark dataset for insect pest recognition, in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit. (CVPR), pp. 8779-8788, (2019)
  • [2] Ali M.A., Haque M., Alam S.B., Rahman R., Amin M., Kobashi S., Medical personal protective equipment detection using YOLOV7, in Proc. Int. Conf. Mach. Learn. Cybern. (ICMLC), pp. 242-247, (2023)
  • [3] Lian X., Sports training image blur and deduplication algorithms based on visual sensing, in Proc. 3rd Int. Conf. Distrib. Comput. Electr. Circuits Electron. (ICDCECE), pp. 1-4, (2024)
  • [4] Zhou L., Rivinius M., Johnson C.R., Weiskopf D., Photographic high-dynamic-range scalar visualization, IEEE Trans. Vis. Comput. Graphics, 26, 6, pp. 2156-2167, (2020)
  • [5] Mohan N., Jain V., Performance analysis of support vector machine in diabetes prediction, in Proc. 4th Int. Conf. Electron., Commun. Aerosp. Technol. (ICECA), pp. 1-3, (2020)
  • [6] Zhao W.-L., Wang H., Ngo C.-W., Approximate k-NN graph construction: A generic online approach, IEEE Trans. Multimedia, 24, pp. 1909-1921, (2022)
  • [7] Sotnikov D., Lyly M., Salmi T., Prediction of 2G HTS tape quench behavior by random forest model trained on 2-D FEM simulations, IEEE Trans. Appl. Supercond., 33, 5, pp. 1-5, (2023)
  • [8] Jyothi V., Reddy G.S.P., Kavya M.S., Reddy K.H., Reddy P.S., Study of image forgery detection using scale invariant feature transform, in Proc. Int. Conf. Sustain. Comput. Data Commun. Syst. (ICSCDS), pp. 669-673, (2023)
  • [9] Zhang B., Jiao Y., Ma Z., Li Y., Zhu J., An efficient image matching method using speed up robust features, in Proc. IEEE Int. Conf. Mechatronics Autom., pp. 553-558, (2014)
  • [10] He R., Hu B.-G., Zheng W.-S., Kong X.-W., Robust principal component analysis based on maximum correntropy criterion, IEEE Trans. Image Process., 20, 6, pp. 1485-1494, (2011)