APPROACH TO THE AUTOMATIC CREATION OF AN ANNOTATED DATASET FOR THE DETECTION, LOCALIZATION AND CLASSIFICATION OF BLOOD CELLS IN AN IMAGE

被引:1
作者
Kovalenko, S. M. [1 ]
Kutsenko, O. S. [2 ]
Kovalenko, S., V [3 ]
Kovalenko, A. S. [4 ]
机构
[1] Natl Tech Univ, Dept Software Engn & Management Intelligent Techno, Kharkiv Polytech Inst, Kharkiv, Ukraine
[2] Natl Tech Univ, Dept Syst Anal & Informat Analyt Technol, Kharkiv Polytech Inst, Kharkiv, Ukraine
[3] Natl Tech Univ, Dept Syst Anal & Informat Analyt Technol, Kharkiv Polytech Inst, Kharkiv, Ukraine
[4] Natl Tech Univ, Dept Syst Anal & Informat Analyt Tecnnol, Kharkiv Polytech Inst, Kharkiv, Ukraine
关键词
computer vision; object detection; object localization; digital image processing; classification;
D O I
10.15588/1607-3274-2024-1-12
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Context. The paper considers the problem of automating the creation of an annotated dataset for further use in a system for de-tecting, localizing and classifying blood cells in an image using deep learning. The subject of the research is the processes of digital image processing for object detection and localization. Objective. The aim of this study is to create a pipeline of digital image processing methods that can automatically generate an annotated set of blood smear images. This set will then be used to train and validate deep learning models, significantly reducing the time required by machine learning specialists. Method. The proposed approach for object detection and localization is based on digital image processing methods such as filter-ing, thresholding, binarization, contour detection, and filling. The pipeline for detection and localization includes the following steps: The given fragment of text describes a process that involves noise reduction, conversion to the HSV color model, defining a mask for white blood cells and platelets, detecting the contours of white blood cells and platelets, determining the coordinates of the upper left and lower right corners of white blood cells and platelets, calculating the area of the region inside the bounding box, saving the ob-tained data, and determining the most common color in the image; filling the contours of leukocytes and platelets with said color; defining a mask for red blood cells; defining the contours of red blood cells; determining the coordinates of the upper left and lower right corners of red blood cells; calculating the area of the region within the bounding box; entering data about the found objects into the dataframe; saving to a .csv file for future use. With an unlabeled image dataset and a generated .csv file using image processing libraries, any researcher should be able to recreate a labeled dataset. Results. The developed approach was implemented in software for creating an annotated dataset of blood smear images Conclusions. The study proposes and justifies an approach to automatically create a set of annotated data. The pipeline is tested on a set of unlabelled data and a set of labelled data is obtained, consisting of cell images and a .csv file with the attributes "file name", "type", "xmin", "ymin", "xmax", "ymax", "area", which are the coordinates of the bounding box for each object. The number of correctly, incorrectly, and unrecognised objects is calculated manually, and metrics are calculated to assess the accuracy and qual-ity of object detection and localisation
引用
收藏
页码:128 / 139
页数:12
相关论文
共 16 条
[1]  
Brownlee J, 2020, CALCULATE PRECISION
[2]  
Burger M. J., 2022, Burge, DOI [10.1007/978-1-4471-6684-9.8, DOI 10.1007/978-1-4471-6684-9.8]
[3]  
Burger W., DIGITAL IMAGE PROCES
[4]   An Automated Method for Counting Red Blood Cells using Image Processing [J].
Chadha, Gulpreet Kaur ;
Srivastava, Aakarsh ;
Singh, Abhilasha ;
Gupta, Ritu ;
Singla, Deepanshi .
INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 :769-778
[5]  
Chityala R, 2020, Image processing and acquisition using Python, DOI [10.1201/9780429243370.9, DOI 10.1201/9780429243370.9]
[6]  
Dey S., 2020, Python Image Processing Cookbook: Over 60 recipes to help you perform complex image processing and computer vision tasks with ease
[7]  
Gaudenz B., Object Detection in 2024: The Definitive Guide
[8]  
Megel I., 2017, P INT SCI S METR MET, P225
[9]  
Mikolajczyk Agnieszka, 2018, 2018 International Interdisciplinary PhD Workshop (IIPhDW), P117, DOI 10.1109/IIPHDW.2018.8388338
[10]  
Mikolajczyk A., Data augmentation for improving deep learning in image classification problem. 2019 Int Interdiscip PhD Work IIPhDW 2019 2019