Subitizing with Variational Autoencoders

被引：0

作者：

Wever, Rijnder ^{[1
]}

Runia, Tom F. H. ^{[1
]}

机构：

[1] Univ Amsterdam, Intelligent Sensory Informat Syst, Amsterdam, Netherlands

来源：

COMPUTER VISION - ECCV 2018 WORKSHOPS, PT III | 2019年 / 11131卷

关键词：

Object counting; Numerosity; Variational autoencoders; VISUAL SENSE; NUMBER; PARIETAL; REPRESENTATION; NUMEROSITY;

D O I：

10.1007/978-3-030-11015-4_47

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Numerosity, the number of objects in a set, is a basic property of a given visual scene. Many animals develop the perceptual ability to subitize: the near-instantaneous identification of the numerosity in small sets of visual items. In computer vision, it has been shown that numerosity emerges as a statistical property in neural networks during unsupervised learning from simple synthetic images. In this work, we focus on more complex natural images using unsupervised hierarchical neural networks. Specifically, we show that variational autoencoders are able to spontaneously perform subitizing after training without supervision on a large amount of images from the Salient Object Subitizing dataset. While our method is unable to outperform supervised convolutional networks for subitizing, we observe that the networks learn to encode numerosity as a basic visual property. Moreover, we find that the learned representations are likely invariant to object area; an observation in alignment with studies on biological neural networks in cognitive neuroscience.

引用

页码：617 / 627

页数：11

共 45 条

[1]

[Anonymous], 2016, Deep learning. vol

[2]

[Anonymous], 2015, arXiv: Learning

[3]

[Anonymous], 2015, 3 INT C LEARN REPR I

[4]

[Anonymous], 2003, ICCV

[5]

[Anonymous], 2015, ICCV

[6]

[Anonymous], 2014, ICLR

[7] Counting in the Wild [J].

Arteta, Carlos ;

Lempitsky, Victor ;

Zisserman, Andrew .

COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 :483-498

[8]

Boski M, 2017, 2017 10TH INTERNATIONAL WORKSHOP ON MULTIDIMENSIONAL (ND) SYSTEMS (NDS)

[9] A visual sense of number [J].

Burr, David ;

Ross, John .

CURRENT BIOLOGY, 2008, 18 (06) :425-428

[10] The devil is in the details: an evaluation of recent feature encoding methods [J].

Chatfield, Ken ;

Lempitsky, Victor ;

Vedaldi, Andrea ;

Zisserman, Andrew .

PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,

← 1 2 3 4 5 →