Semantic analysis based on fusion of audio/visual features for soccer video

被引:2
作者
Wang, Zengkai [1 ]
机构
[1] Jiaxing Univ, Coll Math Phys & Informat Engn, Jiaxing 314001, Peoples R China
来源
PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE OF INFORMATION AND COMMUNICATION TECHNOLOGY | 2021年 / 183卷
关键词
Semantic analysis; event detection; soccer video;
D O I
10.1016/j.procs.2021.02.098
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In order to analysis the semantic content of soccer video, the audio/visual features are effectively extracted. Based on the theory that the variation of video content would cause the fluctuation of viewers' affection, the highlight time curve (HTC) is generated by fusion of affection arousal factors to reveal the excitement of the game. The semantic boundaries of highlights are determined by HTC combined with the domain knowledge of soccer video. With the help of distinguishable highlight feature vectors (HFVs), highlights are classified into goal, shoot, and foul. Compared with the existing works, the main contributions of this paper are as follows. We proposed a novel Hough transform based whistle detection algorithm and achieves more effective performance. A robust goalmouth detection algorithm is presented and contributed to the highlight classification phase. The highlights with semantic boundaries are accurately extracted and classified. Experiments conducted on real world soccer videos demonstrated the good performance of the proposed framework. (C) 2021 The Authors. Published by Elsevier B.V.
引用
收藏
页码:563 / 571
页数:9
相关论文
共 12 条
[1]  
Halin AA, 2013, INT ARAB J INF TECHN, V10, P493
[3]  
Huang Q, 2007, 2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, P1695
[4]  
Oskouie P, 2012, ARTIF INTELL REV, P1
[5]   HMM based soccer video event detection using enhanced mid-level semantic [J].
Qian, Xueming ;
Wang, Huan ;
Liu, Guizhong ;
Hou, Xingsong .
MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 60 (01) :233-255
[6]   Knowledge-Discounted Event Detection in Sports Video [J].
Tjondronegoro, Dian W. ;
Chen, Yi-Ping Phoebe .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2010, 40 (05) :1009-1024
[7]   A novel framework for semantic annotation and personalized retrieval of sports video [J].
Xu, Changsheng ;
Wang, Jinjun ;
Lu, Hanqing ;
Zhang, Yifan .
IEEE TRANSACTIONS ON MULTIMEDIA, 2008, 10 (03) :421-436
[8]  
Xu M, 2003, 2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL II, PROCEEDINGS, P281
[9]   Audio keywords generation for sports video analysis [J].
Xu, Min ;
Xu, Changsheng ;
Duan, Lingyu ;
Jin, Jesse S. ;
Luo, Suhuai .
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2008, 4 (02)
[10]  
[于俊清 Yu Junqing], 2010, [计算机研究与发展, Journal of Computer Research and Development], V47, P1823