SoccerNet-v2: A Dataset and Benchmarks for Holistic Understanding of Broadcast Soccer Videos

被引:79
作者
Deliege, Adrien [1 ]
Cioppa, Anthony [1 ]
Giancola, Silvio [2 ]
Seikavandi, Meisam J. [3 ]
Dueholm, Jacob, V [3 ]
Nasrollahi, Kamal [3 ,4 ]
Ghanem, Bernard [2 ]
Moeslund, Thomas B. [3 ]
Van Droogenbroeck, Marc [1 ]
机构
[1] Univ Liege, Liege, Belgium
[2] KAUST, Thuwal, Saudi Arabia
[3] Aalborg Univ, Aalborg, Denmark
[4] Milestone Syst, Brondby, Denmark
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021 | 2021年
关键词
REPLAY DETECTION;
D O I
10.1109/CVPRW53098.2021.00508
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Understanding broadcast videos is a challenging task in computer vision, as it requires generic reasoning capabilities to appreciate the content offered by the video editing. In this work, we propose SoccerNet-v2, a novel large-scale corpus of manual annotations for the SoccerNet [24] video dataset, along with open challenges to encourage more research in soccer understanding and broadcast production. Specifically, we release around 300k annotations within SoccerNet's 500 untrimmed broadcast soccer videos. We extend current tasks in the realm of soccer to include action spotting, camera shot segmentation with boundary detection, and we define a novel replay grounding task. For each task, we provide and discuss benchmark results, reproducible with our open-source adapted implementations of the most relevant works in the field. SoccerNet-v2 is presented to the broader research community to help push computer vision closer to automatic solutions for more general video understanding and production purposes.
引用
收藏
页码:4503 / 4514
页数:12
相关论文
共 89 条
[31]   The "something something" video database for learning and evaluating visual common sense [J].
Goyal, Raghav ;
Kahou, Samira Ebrahimi ;
Michalski, Vincent ;
Materzynska, Joanna ;
Westphal, Susanne ;
Kim, Heuna ;
Haenel, Valentin ;
Fruend, Ingo ;
Yianilos, Peter ;
Mueller-Freitag, Moritz ;
Hoppe, Florian ;
Thurau, Christian ;
Bax, Ingo ;
Memisevic, Roland .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5843-5851
[32]   AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions [J].
Gu, Chunhui ;
Sun, Chen ;
Ross, David A. ;
Vondrick, Carl ;
Pantofaru, Caroline ;
Li, Yeqing ;
Vijayanarasimhan, Sudheendra ;
Toderici, George ;
Ricco, Susanna ;
Sukthankar, Rahul ;
Schmid, Cordelia ;
Malik, Jitendra .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6047-6056
[33]   View-independent action recognition: a hybrid approach [J].
Hashemi, Seyed Mohammad ;
Rahmati, Mohammad .
MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (12) :6755-6775
[34]  
He K., 2015, C COMPUTER VISION PA, DOI DOI 10.1109/CVPR.2016.90
[35]  
Hershey S, 2017, INT CONF ACOUST SPEE, P131, DOI 10.1109/ICASSP.2017.7952132
[36]   Sports Field Localization via Deep Structured Models [J].
Homayounfar, Namdar ;
Fidler, Sanja ;
Urtasun, Raquel .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4012-4020
[37]  
Hu YC, 2007, 2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, P1555
[38]   Associative embedding for team discrimination [J].
Istasse, Maxime ;
Moreau, Julien ;
De Vleeschouwer, Christophe .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, :2477-2486
[39]  
Jackman Simeon, 2019, THESIS
[40]  
Jain Hiteshi, 2020, CORR