A novel framework for semantic annotation and personalized retrieval of sports video

被引:84
作者
Xu, Changsheng [1 ]
Wang, Jinjun [1 ]
Lu, Hanqing [2 ]
Zhang, Yifan [2 ]
机构
[1] Media Semant, Inst Infocomm Res, Singapore 119613, Singapore
[2] Chinese Acad Sci, Natl Lab Pattern Recognit, Inst Automat, Beijing, Peoples R China
关键词
annotation; event detection; personalized retrieval; sports video analysis; summarization;
D O I
10.1109/TMM.2008.917346
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sports video annotation is important for sports video semantic analysis such as event detection and personalization. In this paper, we propose a novel approach for sports video semantic annotation and personalized retrieval. Different from the state of the art sports video analysis methods which heavily rely on audio/visual features, the proposed approach incorporates web-casting text into sports video analysis. Compared with previous approaches, the contributions of our approach include the following. 1) The event detection accuracy is significantly improved due to the incorporation of web-casting text analysis. 2) The proposed approach is able to detect exact event boundary and extract event semantics that are very difficult or impossible to be handled by previous approaches. 3) The proposed method is able to create personalized summary from both general and specific point of view related to particular game, event, player or team according to user's preference. We present the framework of our approach and details of text analysis, video analysis, text/video alignment, and personalized retrieval. The experimental results on event boundary detection in sports video are encouraging and comparable to the manually selected events. The evaluation on personalized retrieval is effective in helping meet users' expectations.
引用
收藏
页码:421 / 436
页数:16
相关论文
共 49 条
  • [1] [Anonymous], P ACM MM
  • [2] [Anonymous], 1995, P IEEE INT C MULT CO
  • [3] Semantic annotation of soccer videos: automatic highlights identification
    Assfalg, E
    Bertini, M
    Colombo, C
    Del Bimbo, A
    Nunziati, W
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2003, 92 (2-3) : 285 - 305
  • [4] Event based indexing of broadcasted sports video by intermodal collaboration
    Babaguchi, N
    Kawai, Y
    Kitahashi, T
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2002, 4 (01) : 68 - 75
  • [5] BABAGUCHI N, 2003, P PAC RIM C MULT SIN, P940
  • [6] BABAGUCHI N, 2000, P ACM MULT 2000 WORK, P205
  • [7] BERTINIA M, 2005, P 2 EUR SEM WEB C HE
  • [8] CHESHIRE D, 1990, COMPLETE BOOK VIDEO
  • [9] A unified framework for semantic shot classification in sports video
    Duan, LY
    Xu, M
    Tian, Q
    Xu, CS
    Jin, JS
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2005, 7 (06) : 1066 - 1083
  • [10] Duan LY, 2004, 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS, P709