Towards a Scene-based Video Annotation Framework

被引:1
|
作者
Getahun, Fekade [1 ]
Birara, Mekuanent [2 ]
机构
[1] Univ Addis Ababa, Dept Comp Sci, Addis Ababa, Ethiopia
[2] Hawassa Univ, Dept Comp Sci & IT, Awasa, Ethiopia
来源
2015 11TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS (SITIS) | 2015年
关键词
Video annotation; Scene level video annotation; Concept formulation; Concept normalization; Video summarization; Event prediction and Shot boundary detection;
D O I
10.1109/SITIS.2015.123
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The amount of video in the web is huge and searching specific part of a video can be achieved using a content or text based video indexing, grouping, searching and retrieval approaches. To realize content based video searching, in this work we propose scene based video annotation for the identification and labeling of events and objects in a video with a descriptive text. Video annotation requires a knowledge base to define semantic meaning of events and objects in the video. Manual and semi-supervised video annotation approaches fail as both require expertise for the correct identification and labeling of video concepts. Annotation requires a great deal of concept dependency and relatedness processing to give descriptive statement to a scene in the video. This paper introduces a novel scene based video annotation framework to provide scene level semantic description of videos. The framework uses audio component of the scene to support event and object identification and has proper filtering and normalization. The framework provides concept relatedness, concept formulation, shot and scene level video annotations. To validate the capability of the proposed framework, we developed a prototype that shows a scene level video annotation. The framework is evaluated using a standard video processing evaluation dataset for its accuracy in event and object prediction, and the overall accuracy and usability of the system is evaluated using human ratings. The proposed approach exhibits 81% accuracy in object and event prediction and an average user rating of 3.47 out of 4 in overall system evaluation.
引用
收藏
页码:306 / 313
页数:8
相关论文
共 50 条
  • [1] SCENE-BASED AUTOMATIC IMAGE ANNOTATION
    Tariq, Amara
    Foroosh, Hassan
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 3047 - 3051
  • [2] Scene-based Video Analytics Studio
    Liao, Chia-Wei
    Chan, Kai-Hsuan
    Chang, Wen-Tsung
    Tu, Sheng-Tsung
    2014 IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2014, : 304 - 310
  • [3] Scene-based scalable video summarization
    Li, Y
    Kuo, CCJ
    Tretter, D
    INTERNET MULTIMEDIA MANAGEMENT SYSTEMS IV, 2003, 5242 : 34 - 45
  • [4] Scene-based video watermarking for broadcasting systems
    Choi, UC
    Choi, YH
    Kim, DC
    Choi, TS
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2004, PT 4, 2004, 3046 : 271 - 280
  • [5] A Scene-Based Augmented Reality Framework for Exhibits
    Li-Chee-Ming, Julien
    Wu, Zheng
    Tan, Randy
    Tan, Ryan
    Khan, Naimul Mefraz
    Ye, Andy
    Guan, Ling
    IMAGE ANALYSIS AND RECOGNITION, ICIAR 2019, PT I, 2019, 11662 : 287 - 296
  • [6] A blind scene-based watermarking for video copyright protection
    Masoumi, Majid
    Amiri, Shervin
    AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2013, 67 (06) : 528 - 535
  • [7] Scene-based nonuniformity correction with video sequences and registration
    Hardie, RC
    Hayat, MM
    Armstrong, E
    Yasuda, B
    APPLIED OPTICS, 2000, 39 (08) : 1241 - 1250
  • [8] A Scene-Based Video Watermarking Technique Using SVMs
    Yen, Shwu-Huey
    Chang, Hsiao-Wei
    Wang, Chia-Jen
    Wang, Patrick S.
    Chang, Mel-Chueh
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 1283 - +
  • [9] Scene-based nonuniformity correction with video sequences and registration
    Hardie, Russell C.
    Hayat, Majeed M.
    Armstrong, Earnest
    Yasuda, Brian
    Applied Optics, 2000, 39 (08): : 1241 - 1250
  • [10] An accurate scene-based traffic model for MPEG video stream
    Zhang, Q
    Lin, C
    Yin, H
    Dai, QH
    ICECS 2003: PROCEEDINGS OF THE 2003 10TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS, VOLS 1-3, 2003, : 228 - 231