Large-scale zero-shot learning in the wild: Classifying zoological illustrations

被引：6

作者：

Stork, Lise ^{[1
]}

Weber, Andreas ^{[2
]}

van den Herik, Jaap ^{[1
,3
]}

Plaat, Aske ^{[1
]}

Verbeek, Fons ^{[1
]}

Wolstencroft, Katherine ^{[1
]}

机构：

[1] Leiden Inst Adv Comp Sci, Niels Bohrweg 1, NL-2333 CA Leiden, Netherlands

[2] Univ Twente, Drienerlolaan 5, NL-7522 NB Enschede, Netherlands

[3] Leiden Ctr Data Sci, Leiden, Netherlands

来源：

ECOLOGICAL INFORMATICS | 2021年 / 62卷

关键词：

Zero-shot learning; Biodiversity; Natural history; Hierarchical learning; Fine-grained object recognition; Small samples; CLASSIFICATION;

D O I：

10.1016/j.ecoinf.2021.101222

中图分类号：

Q14 [生态学（生物生态学）];

学科分类号：

071012 ; 0713 ;

摘要：

In this paper we analyse the classification of zoological illustrations. Historically, zoological illustrations were the modus operandi for the documentation of new species, and now serve as crucial sources for long-term ecological and biodiversity research. By employing computational methods for classification, the data can be made amenable to research. Automated species identification is challenging due to the long-tailed nature of the data, and the millions of possible classes in the species taxonomy. Success commonly depends on large training sets with many examples per class, but images from only a subset of classes are digitally available, and many images are unlabelled, since labelling requires domain expertise. We explore zero-shot learning to address the problem, where features are learned from classes with medium to large samples, which are then transferred to recognise classes with few or no training samples. We specifically explore how distributed, multi-modal background knowledge from data providers, such as the Global Biodiversity Information Facility (GBIF), iNaturalist, and the Biodiversity Heritage Library (BHL), can be used to share knowledge between classes for zero-shot learning. We train a prototypical network for zero-shot classification, and introduce fused prototypes (FP) and hierarchical prototype loss (HPL) to optimise the model. Finally, we analyse the performance of the model for use in real-world applications. The experimental results are encouraging, indicating potential for use of such models in an expert support system, but also express the difficulty of our task, showing a necessity for research into computer vision methods that are able to learn from small samples.

引用

页数：15

共 45 条

[41]

Wah C., 2011, CALTECH UCSD BIRDS 2

[42] Deep visual domain adaptation: A survey [J].

Wang, Mei ;

Deng, Weihong .

NEUROCOMPUTING, 2018, 312 :135-153

[43] Collecting Colonial Nature European Naturalists and the Netherlands Indies in the Early Nineteenth Century [J].

Weber, Andreas .

BMGN-THE LOW COUNTRIES HISTORICAL REVIEW, 2019, 134 (03) :72-95

[44] Zero-Shot Learning-A Comprehensive Evaluation of the Good, the Bad and the Ugly [J].

Xian, Yongqin ;

Lampert, Christoph H. ;

Schiele, Bernt ;

Akata, Zeynep .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (09) :2251-2265

[45] Latent Embeddings for Zero-shot Classification [J].

Xian, Yongqin ;

Akata, Zeynep ;

Sharma, Gaurav ;

Nguyen, Quynh ;

Hein, Matthias ;

Schiele, Bernt .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :69-77

← 1 2 3 4 5 →