A Sensorimotor Perspective on Contrastive Multiview Visual Representation Learning

被引：0

作者：

Laflaquiere, Alban ^{[1
]}

机构：

[1] AI Lab, SoftBank Robot Europe, F-75015 Paris, France

来源：

IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS | 2022年 / 14卷 / 02期

关键词：

Task analysis; Visualization; Robot sensing systems; Training; Machine learning; Semantics; Deep learning; Artificial perception; contrastive multiview learning; representation learning; sensorimotor; unsupervised learning; CORTEX; EXPERIENCE; MODULATION; TOPOLOGY; AGENTS;

D O I：

10.1109/TCDS.2021.3086267

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The contrastive multiview visual representation learning (CMVRL) framework has recently gained a lot of traction in the unsupervised representation learning literature. Combining a simple data augmentation strategy and a contrastive learning objective, it has been able to generate representations that compare favorably to their supervised counterparts on common downstream visual tasks. The theoretical understanding of this empirical success is currently an active area of research. In this article, we propose a sensorimotor perspective on the various components of the framework. We show how it can be interpreted as building representations that geometrically embed the stable semantic content that a situated agent experiences on short spatiotemporal scales when actively exploring its environment. We also discuss the relevance of the approach in light of contemporary active, dynamical, and hierarchical theories of perception. Finally, we extrapolate this sensorimotor perspective to outline promising future research directions that could push the state of the art further and help better understand how an autonomous agent could develop useful visual representations in an unsupervised fashion.

引用

页码：269 / 278

页数：10

共 102 条

[81]

Schwarzer Max, 2020, PROC INT C LEARN REP

[82] Time-Contrastive Networks: Self-Supervised Learning from Multi-View Observation [J].

Sermanet, Pierre ;

Lynch, Corey ;

Hsu, Jasmine ;

Levine, Sergey .

2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :486-487

[83]

Srinivas A., 2020, Curl: Contrastive unsupervised representations for reinforcement learning

[84]

Terekhov AV, 2014, J IEEE I C DEVELOP L, P368, DOI 10.1109/DEVLRN.2014.6983009

[85]

Tian Yonglong, 2020, Advances in Neural Information Processing Systems, V33

[86]

Tian Yonglong, 2019, P EUR C COMP VIS

[87]

Tschannen M., 2019, On Mutual Information Maximization for Representation Learning

[88]

van den Oord Aaron, 2018, CoRR, DOI 10.48550/arxiv.1807.03748

[89]

Vincent P., 2008, P 25 INT C MACH LEAR, DOI DOI 10.1145/1390156.1390294

[90]

Wang T., 2020, INT C MACHINE LEARNI, P9929

← 2 3 4 5 6 7 8 9 10 11 →