Detection of Tennis Events from Acoustic Data

被引:6
作者
Baughman, Aaron [1 ]
Morales, Eduardo [2 ]
Reiss, Gary [3 ]
Greco, Nancy [2 ]
Hammer, Stephen [3 ]
Wang, Shiqiang [2 ]
机构
[1] IBM Corp, Res Triangle Pk, NC USA
[2] IBM Res, Yorktown Hts, NY USA
[3] IBM Corp, Atlanta, GA USA
来源
PROCEEDINGS OF THE 2ND INTERNATIONAL WORKSHOP ON MULTIMEDIA CONTENT ANALYSIS IN SPORTS, MMSPORTS 2019 | 2019年
关键词
Acoustic classification; event detection; machine learning; multimedia system; sports;
D O I
10.1145/3347318.3355520
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Professional tennis is a fast-paced sport with serves and hits that can reach speeds of over 100 mph and matches lasting long in duration. For example, in 13 years of Grand Slam data, there were 454 matches with an average of 3 sets that lasted 40 minutes. The fast pace and long duration of tennis matches make tracking the time boundaries of each tennis point in a match challenging. The visual aspect of a tennis match is highly diverse because of its variety in angles, occlusions, resolutions, contrast and colors, but the sound component is relatively stable and consistent. In this paper, we present a system that detects events such as ball hits and point boundaries in a tennis match from sound data recorded in the match. We first describe the sound processing pipeline that includes preprocessing, feature extraction, basic (atomic) event detection, and point boundary detection. Then, we describe the overall cloud-based system architecture. Afterwards, we describe the user interface that includes a tool for data labeling to efficiently generate the training dataset, and a workbench for sound and model management. The performance of our system is evaluated in experiments with real-world tennis sound data. Our proposed pipeline can detect atomic tennis events with an F1-score of 92.39% and point boundaries with average precision and recall values of around 80%. This system can be very useful for tennis coaches and players to find and extract game highlights with specific characteristics, so that they can analyze these highlights and establish their play strategy.
引用
收藏
页码:91 / 99
页数:9
相关论文
共 22 条
[1]  
[Anonymous], 2019, Cloudant
[2]  
[Anonymous], 2016, Acoustic Scene Recognition with Deep Learning
[3]  
[Anonymous], 2019, Hawk-Eye Line-Calling System
[4]  
[Anonymous], 2018, TUT Acoustic Scene Classification
[5]  
Bagnall A., 2018, The UEA & UCR time series classification repository
[6]  
Bagnall A, 2014, Arxiv, DOI arXiv:1406.4757
[7]   Environmental Sound Recognition With Time-Frequency Audio Features [J].
Chu, Selina ;
Narayanan, Shrikanth ;
Kuo, C. -C. Jay .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (06) :1142-1158
[8]  
Decroos T, 2017, AAAI CONF ARTIF INTE, P1302
[9]  
Duxans Helenca, 2009, 2009 IEEE INT S BROA, P1
[10]  
Gharib S, 2018, IEEE INT WORKS MACH