Associative embedding for team discrimination

被引：12

作者：

Istasse, Maxime ^{[1
]}

Moreau, Julien ^{[1
]}

De Vleeschouwer, Christophe ^{[1
]}

机构：

[1] UCLouvain ICTEAM, Louvain La Neuve, Belgium

来源：

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019) | 2019年

关键词：

D O I：

10.1109/CVPRW.2019.00303

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Assigning team labels to players in a sport game is not a trivial task when no prior is known about the visual appearance of each team. Our work builds on a Convolutional Neural Network (CNN) to learn a descriptor, namely a pixel-wise embedding vector, that is similar for pixels depicting players from the same team, and dissimilar when pixels correspond to distinct teams. The advantage of this idea is that no per-game learning is needed, allowing efficient team discrimination as soon as the game starts. In principle, the approach follows the associative embedding framework introduced in [22] to differentiate instances of objects. Our work is however different in that it derives the embeddings from a lightweight segmentation network and, more fundamentally, because it considers the assignment of the same embedding to unconnected pixels, as required by pixels of distinct players from the same team. Excellent results, both in terms of team labelling accuracy and generalization to new games/arenas, have been achieved on panoramic views of a large variety of basketball games involving players interactions and occlusions. This makes our method a good candidate to integrate team separation in many CNN-based sport analytics pipelines.

引用

页码：2477 / 2486

页数：10

共 40 条

[1] Sparsity Driven People Localization with a Heterogeneous Network of Cameras [J].

Alahi, Alexandre ;

Jacques, Laurent ;

Boursier, Yannick ;

Vandergheynst, Pierre .

JOURNAL OF MATHEMATICAL IMAGING AND VISION, 2011, 41 (1-2) :39-58

[2]

[Anonymous], 2016, NIPS

[3]

[Anonymous], 2018, BRIT MACH VIS C BMVC

[4] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].

Badrinarayanan, Vijay ;

Kendall, Alex ;

Cipolla, Roberto .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495

[5]

Bialkowski A., 2014, REPRESENTING TEAM BE, P247

[6] Variational Inference for Dirichlet Process Mixtures [J].

Blei, David M. ;

Jordan, Michael I. .

BAYESIAN ANALYSIS, 2006, 1 (01) :121-143

[7]

Carr P, 2012, LECT NOTES COMPUT SC, V7572, P864, DOI 10.1007/978-3-642-33718-5_62

[8] Learning Online Smooth Predictors for Realtime Camera Planning using Recurrent Decision Trees [J].

Chen, Jianhui ;

Le, Hoang M. ;

Carr, Peter ;

Yue, Yisong ;

Little, James J. .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :4688-4696

[9] A bottom-up approach based on semantics for the interpretation of the main camera stream in soccer games [J].

Cioppa, A. ;

Deliege, A. ;

Van Droogenbroeck, M. .

PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, :1846-1855

[10] The Cityscapes Dataset for Semantic Urban Scene Understanding [J].

Cordts, Marius ;

Omran, Mohamed ;

Ramos, Sebastian ;

Rehfeld, Timo ;

Enzweiler, Markus ;

Benenson, Rodrigo ;

Franke, Uwe ;

Roth, Stefan ;

Schiele, Bernt .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223

← 1 2 3 4 →