Active Learning in Multimedia Annotation and Retrieval: A Survey

被引：168

作者：

Wang, Meng ^{[1
]}

Hua, Xian-Sheng ^{[1
]}

机构：

[1] Microsoft Res Asia, Beijing Sigma Ctr, Beijing 100080, Peoples R China

来源：

ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY | 2011年 / 2卷 / 02期

关键词：

Algorithms; Experimentation; Human Factors; Active learning; image annotation; video annotation; content-based image retrieval; sample selection; model learning;

D O I：

10.1145/1899412.1899414

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Active learning is a machine learning technique that selects the most informative samples for labeling and uses them as training data. It has been widely explored in multimedia research community for its capability of reducing human annotation effort. In this article, we provide a survey on the efforts of leveraging active learning in multimedia annotation and retrieval. We mainly focus on two application domains: image/video annotation and content-based image retrieval. We first briefly introduce the principle of active learning and then we analyze the sample selection criteria. We categorize the existing sample selection strategies used in multimedia annotation and retrieval into five criteria: risk reduction, uncertainty, diversity, density and relevance. We then introduce several classification models used in active learning-based multimedia annotation and retrieval, including semi-supervised learning, multilabel learning and multiple instance learning. We also provide a discussion on several future trends in this research direction. In particular, we discuss cost analysis of human annotation and large-scale interactive multimedia annotation.

引用

页数：21

共 86 条

[1]

Ahn L., 2004, P ACM CHI

[2]

Ahn L., 2006, P ACM C HUM FACT COM

[3]

Anglum D., 1998, MACH LEARN, V2

[4]

[Anonymous], 2006, BOOK REV IEEE T NEUR

[5]

[Anonymous], P INT C MACH LEARN

[6]

[Anonymous], P NIPS WORKSH COST S

[7]

[Anonymous], 2004, P INT C MACH LEARN

[8]

AYACHE S., 2007, P INT WORKSH CONT BA

[9]

Bao L., 2009, P ACM MULT

[10]

Berger AL, 1996, COMPUT LINGUIST, V22, P39

← 1 2 3 4 5 6 7 8 9 →