ASTRA: An Action Spotting TRAnsformer for Soccer Videos

被引:3
|
作者
Xarles, Artur [1 ,2 ]
Escalera, Sergio [1 ,2 ,3 ]
Moeslund, Thomas B. [3 ]
Clapes, Albert [1 ,2 ]
机构
[1] Univ Barcelona, Barcelona, Spain
[2] Comp Vis Ctr, Barcelona, Spain
[3] Aalborg Univ, Aalborg, Denmark
来源
PROCEEDINGS OF THE 6TH INTERNATIONAL WORKSHOP ON MULTIMEDIA CONTENT ANALYSIS IN SPORTS, MMSPORTS 2023 | 2023年
关键词
computer vision; action spotting; transformer encoder-decoder; uncertainty estimation; balanced mixup;
D O I
10.1145/3606038.3616153
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, we introduce ASTRA, a Transformer-based model designed for the task of Action Spotting in soccer matches. ASTRA addresses several challenges inherent in the task and dataset, including the requirement for precise action localization, the presence of a long-tail data distribution, non-visibility in certain actions, and inherent label noise. To do so, ASTRA incorporates (a) a Transformer encoder-decoder architecture to achieve the desired output temporal resolution and to produce precise predictions, (b) a balanced mixup strategy to handle the long-tail distribution of the data, (c) an uncertainty-aware displacement head to capture the label variability, and (d) input audio signal to enhance detection of non-visible actions. Results demonstrate the effectiveness of ASTRA, achieving a tight Average-mAP of 66.82 on the test set. Moreover, in the SoccerNet 2023 Action Spotting challenge, we secure the 3rd position with an Average-mAP of 70.21 on the challenge set.
引用
收藏
页码:93 / 102
页数:10
相关论文
共 36 条
  • [1] A Transformer-based System for Action Spotting in Soccer Videos
    Zhu, He
    Liang, Junwei
    Lin, Chengzhi
    Zhang, Jun
    Hu, Jianming
    PROCEEDINGS OF THE 5TH ACM INTERNATIONAL WORKSHOP ON MULTIMEDIA CONTENT ANALYSIS IN SPORTS, MMSPORTS 2022, 2022, : 103 - 109
  • [2] SpotFormer: A Transformer-based Framework for Precise Soccer Action Spotting
    Cao, Mengqi
    Yang, Min
    Zhang, Guozhen
    Li, Xiaotian
    Wu, Yilu
    Wu, Gangshan
    Wang, Limin
    2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [3] TEMPORALLY PRECISE ACTION SPOTTING IN SOCCER VIDEOS USING DENSE DETECTION ANCHORS
    Soares, Joao V. B.
    Shah, Avijit
    Biswas, Topojoy
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2796 - 2800
  • [4] Soccer-CLIP: Vision Language Model for Soccer Action Spotting
    Shin, Yoonho
    Park, Sanghoon
    Han, Youngsub
    Jeon, Byoung-Ki
    Lee, Soonyoung
    Kang, Byung Jun
    IEEE ACCESS, 2025, 13 : 44354 - 44365
  • [5] Action Search: Spotting Actions in Videos and Its Application to Temporal Action Localization
    Alwassel, Humam
    Heilbron, Fabian Caba
    Ghanem, Bernard
    COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 : 253 - 269
  • [6] OSL-ActionSpotting: A Unified Library for Action Spotting in Sports Videos
    Benzakour, Yassine
    Cabado, Bruno
    Giancola, Silvio
    Cioppa, Anthony
    Ghanem, Bernard
    Van Droogenbroeck, Marc
    2024 IEEE INTERNATIONAL WORKSHOP ON SPORT, TECHNOLOGY AND RESEARCH, STAR 2024, 2024, : 132 - 137
  • [7] A Graph-Based Method for Soccer Action Spotting Using Unsupervised Player Classification
    Cartas, Alejandro
    Ballester, Coloma
    Haro, Gloria
    PROCEEDINGS OF THE 5TH ACM INTERNATIONAL WORKSHOP ON MULTIMEDIA CONTENT ANALYSIS IN SPORTS, MMSPORTS 2022, 2022, : 93 - 102
  • [8] Transformer-based fall detection in videos
    Nunez-Marcos, Adrian
    Arganda-Carreras, Ignacio
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 132
  • [9] Online Multi-player Tracking in Monocular Soccer Videos
    Herrmann, Michael
    Hoernig, Martin
    Radig, Bernd
    2014 AASRI CONFERENCE ON SPORTS ENGINEERING AND COMPUTER SCIENCE (SECS 2014), 2014, 8 : 30 - 37
  • [10] Vision Transformer-Based Tailing Detection in Videos
    Lee, Jaewoo
    Lee, Sungjun
    Cho, Wonki
    Siddiqui, Zahid Ali
    Park, Unsang
    APPLIED SCIENCES-BASEL, 2021, 11 (24):