Thinking like a naturalist: Enhancing computer vision of citizen science images by harnessing contextual data

被引:60
作者
Terry, J. Christopher D. [1 ,2 ]
Roy, Helen E. [1 ]
August, Tom A. [1 ]
机构
[1] NERC Ctr Ecol & Hydrol, Wallingford, Oxon, England
[2] Univ Oxford, Dept Zool, Oxford, England
来源
METHODS IN ECOLOGY AND EVOLUTION | 2020年 / 11卷 / 02期
基金
英国自然环境研究理事会;
关键词
citizen science; computer vision; convolutional neural network; ladybird; machine learning; metadata; naturalists; species identification; LADYBIRD;
D O I
10.1111/2041-210X.13335
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
The accurate identification of species in images submitted by citizen scientists is currently a bottleneck for many data uses. Machine learning tools offer the potential to provide rapid, objective and scalable species identification for the benefit of many aspects of ecological science. Currently, most approaches only make use of image pixel data for classification. However, an experienced naturalist would also use a wide variety of contextual information such as the location and date of recording. Here, we examine the automated identification of ladybird (Coccinellidae) records from the British Isles submitted to the UK Ladybird Survey, a volunteer-led mass participation recording scheme. Each image is associated with metadata; a date, location and recorder ID, which can be cross-referenced with other data sources to determine local weather at the time of recording, habitat types and the experience of the observer. We built multi-input neural network models that synthesize metadata and images to identify records to species level. We show that machine learning models can effectively harness contextual information to improve the interpretation of images. Against an image-only baseline of 48.2%, we observe a 9.1 percentage-point improvement in top-1 accuracy with a multi-input model compared to only a 3.6% increase when using an ensemble of image and metadata models. This suggests that contextual data are being used to interpret an image, beyond just providing a prior expectation. We show that our neural network models appear to be utilizing similar pieces of evidence as human naturalists to make identifications. Metadata is a key tool for human naturalists. We show it can also be harnessed by computer vision systems. Contextualization offers considerable extra information, particularly for challenging species, even within small and relatively homogeneous areas such as the British Isles. Although complex relationships between disparate sources of information can be profitably interpreted by simple neural network architectures, there is likely considerable room for further progress. Contextualizing images has the potential to lead to a step change in the accuracy of automated identification tools, with considerable benefits for large-scale verification of submitted records.
引用
收藏
页码:303 / 315
页数:13
相关论文
共 48 条
  • [21] A multi-access identification key based on colour patterns in ladybirds (Coleoptera, Coccinellidae)
    Jouveau, Severin
    Delaunay, Mathilde
    Vignes-Lebbe, Regine
    Nattier, Romain
    [J]. ZOOKEYS, 2018, (758) : 55 - 73
  • [22] Kingma D., 2014, arXiv
  • [23] Deeper Depth Prediction with Fully Convolutional Residual Networks
    Laina, Iro
    Rupprecht, Christian
    Belagiannis, Vasileios
    Tombari, Federico
    Navab, Nassir
    [J]. PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, : 239 - 248
  • [24] A review of unsupervised feature learning and deep learning for time-series modeling
    Langkvist, Martin
    Karlsson, Lars
    Loutfi, Amy
    [J]. PATTERN RECOGNITION LETTERS, 2014, 42 : 11 - 24
  • [25] Mac Aodha O., 2019, PRESENCE ONLY GEOGRA
  • [26] Ant genera identification using an ensemble of convolutional neural networks
    Marques, Alan Caio R.
    Raimundo, Marcos M.
    Cavalheiro, Ellen Marianne B.
    Salles, Luis F. P.
    Lyra, Christiano
    Von Zuben, Fernando J.
    [J]. PLOS ONE, 2018, 13 (01):
  • [27] A survey on image-based insect classification
    Martineau, Chloe
    Conte, Donatello
    Raveaux, Romain
    Arnault, Ingrid
    Munier, Damien
    Venturini, Gilles
    [J]. PATTERN RECOGNITION, 2017, 65 : 273 - 284
  • [28] Effective Training of Convolutional Neural Networks for Insect Image Recognition
    Martineau, Maxime
    Raveaux, Romain
    Chatelain, Clement
    Conte, Donatello
    Venturini, Gilles
    [J]. ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, ACIVS 2018, 2018, 11182 : 426 - 437
  • [29] Met Office, 2012, MET OFF INT DAT ARCH
  • [30] Miao Z., 2018, 450189 BIORXIV, P450189, DOI [10.1101/450189, DOI 10.1101/450189]