Using Segmentation to Improve Machine Learning Performance in Human-in-the-Loop Systems

被引：0

作者：

Carneiro, Davide ^{[1
,2
]}

Carvalho, Mariana ^{[1
,2
]}

机构：

[1] Politecn Porto, CIICESI, Escola Super Tecnol Gestao, Porto, Portugal

[2] Univ Minho, Algoritmi Ctr, Dept Informat, Braga, Portugal

来源：

INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 2 | 2023年 / 543卷

关键词：

Active learning; Segmentation; Human-in-the-loop;

D O I：

10.1007/978-3-031-16078-3_28

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The expectations of Machine Learning systems are becoming increasingly demanding, namely in what concerns the diversity of applications, the expected accuracy, and the pressure for results. However, there are cases in which Human experts are needed to label the data, which may have a significant cost in terms of human resources and time. In these cases, it is often best to learn on-the-fly, without expecting for the whole data to be labeled. Often, it is desirable to guide the Human annotators into focusing on the more relevant instances: this constitutes the so-called active learning. In this paper we propose an approach in which a clustering algorithm is used to find groups of similar instances. Then, the procedure is guided with the objective of favoring the annotation of the groups that are under-represented in the labeled dataset. Results show that this approach leads to models that are, over time, more accurate and reliable.

引用

页码：413 / 428

页数：16