Automatic Annotation of Training Datasets in Computer Vision Using Machine Learning Methods

被引：0

作者：

Zhuravlyov, A. K. ^{[1
]}

Grigorian, K. A. ^{[1
]}

机构：

[1] Kazan Fed Univ, Kazan 420008, Russia

来源：

AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS | 2024年 / 58卷 / SUPPL5期

关键词：

computer vision; machine learning; automatic data annotation; training datasets; image segmentat-ion;

D O I：

10.3103/S0005105525700347

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper addresses the automatic annotation of training datasets in the field of computer vision using machine learning methods. Data annotation is a key stage in the development and training of deep learning models, but creating labeled data often requires significant time and labor. This paper proposes a mechanism for automatic annotation based on the use of convolutional neural networks and active learning methods. The proposed methodology includes the analysis and evaluation of existing approaches to automatic annotation. The effectiveness of the proposed solutions is assessed using publicly available datasets. The results demonstrate that the proposed method significantly reduces the time required for data annotation, although operator intervention is still necessary. The literature review presents an analysis of modern annotation methods and existing automatic systems, providing a better understanding of the context and advantages of the proposed approach. The conclusion discusses the study achievements, its limitations, and possible directions for future research in this field.

引用

页码：S279 / S282

页数：4

共 16 条

[1] Semi-automatic Annotation of Objects in Visual-Thermal Video [J].

Berg, Amanda ;

Johnander, Joakim ;

de Gevigney, Flavie Durand ;

Ahlberg, Jorgen ;

Felberg, Michael .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :2242-2251

[2] Automatic image annotation method based on a convolutional neural network with threshold optimization [J].

Cao, Jianfang ;

Zhao, Aidi ;

Zhang, Zibang .

PLOS ONE, 2020, 15 (09)

[3]

Cityscapes dataset, ABOUT US

[4]

COCO Dataset, About us

[5]

Council J., 2019, The Wall Street Journal, V28

[6]

docs.ultralytics, Ultralytics: YOLOv8 Docs

[7] The PASCAL Visual Object Classes Challenge: A Retrospective [J].

Everingham, Mark ;

Eslami, S. M. Ali ;

Van Gool, Luc ;

Williams, Christopher K. I. ;

Winn, John ;

Zisserman, Andrew .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) :98-136

[8] Automatic lung nodule detection using a 3D deep convolutional neural network combined with a multi-scale prediction strategy in chest CTs [J].

Gu, Yu ;

Lu, Xiaoqi ;

Yang, Lidong ;

Zhang, Baohua ;

Yu, Dahua ;

Zhao, Ying ;

Gao, Lixin ;

Wu, Liang ;

Zhou, Tao .

COMPUTERS IN BIOLOGY AND MEDICINE, 2018, 103 :220-231

[9]

Kirillov A., 2023, Segment Anything, P4015, DOI DOI 10.1109/ICCV51070.2023.00371

[10] Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection [J].

Levine, Sergey ;

Pastor, Peter ;

Krizhevsky, Alex ;

Ibarz, Julian ;

Quillen, Deirdre .

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2018, 37 (4-5) :421-436

← 1 2 →