Towards a Scene-based Video Annotation Framework

被引:1
作者
Getahun, Fekade [1 ]
Birara, Mekuanent [2 ]
机构
[1] Univ Addis Ababa, Dept Comp Sci, Addis Ababa, Ethiopia
[2] Hawassa Univ, Dept Comp Sci & IT, Awasa, Ethiopia
来源
2015 11TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS (SITIS) | 2015年
关键词
Video annotation; Scene level video annotation; Concept formulation; Concept normalization; Video summarization; Event prediction and Shot boundary detection;
D O I
10.1109/SITIS.2015.123
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The amount of video in the web is huge and searching specific part of a video can be achieved using a content or text based video indexing, grouping, searching and retrieval approaches. To realize content based video searching, in this work we propose scene based video annotation for the identification and labeling of events and objects in a video with a descriptive text. Video annotation requires a knowledge base to define semantic meaning of events and objects in the video. Manual and semi-supervised video annotation approaches fail as both require expertise for the correct identification and labeling of video concepts. Annotation requires a great deal of concept dependency and relatedness processing to give descriptive statement to a scene in the video. This paper introduces a novel scene based video annotation framework to provide scene level semantic description of videos. The framework uses audio component of the scene to support event and object identification and has proper filtering and normalization. The framework provides concept relatedness, concept formulation, shot and scene level video annotations. To validate the capability of the proposed framework, we developed a prototype that shows a scene level video annotation. The framework is evaluated using a standard video processing evaluation dataset for its accuracy in event and object prediction, and the overall accuracy and usability of the system is evaluated using human ratings. The proposed approach exhibits 81% accuracy in object and event prediction and an average user rating of 3.47 out of 4 in overall system evaluation.
引用
收藏
页码:306 / 313
页数:8
相关论文
共 50 条
  • [21] Studying Natural User Interfaces for Smart Video Annotation towards Ubiquitous Environments
    Rodrigues, Rui
    Madeira, Rui Neves
    Correia, Nuno
    PROCEEDINGS OF THE 20TH INTERNATIONAL CONFERENCE ON MOBILE AND UBIQUITOUS MULTIMEDIA, MUM 2021, 2021, : 158 - 168
  • [22] Visualization-Based Active Learning for Video Annotation
    Liao, Hongsen
    Chen, Li
    Song, Yibo
    Ming, Hao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 18 (11) : 2196 - 2205
  • [23] A NOVEL VIDEO ANNOTATION FRAMEWORK USING NEAR-DUPLICATE SEGMENT DETECTION
    Chou, Chien-Li
    Chen, Hua-Tsung
    Hsu, Chun-Chieh
    Lee, Suh-Yin
    2015 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2015,
  • [24] Multimodal Video Annotation for Retrieval and Discovery of Newsworthy Video in a News Verification Scenario
    Nixon, Lyndon
    Apostolidis, Evlampios
    Markatopoulou, Foteini
    Patras, Ioannis
    Mezaris, Vasileios
    MULTIMEDIA MODELING (MMM 2019), PT I, 2019, 11295 : 143 - 155
  • [25] A Collaborative Video Annotation System Based on Semantic Web Technologies
    Marco Grassi
    Christian Morbidoni
    Michele Nucci
    Cognitive Computation, 2012, 4 : 497 - 514
  • [26] A Collaborative Video Annotation System Based on Semantic Web Technologies
    Grassi, Marco
    Morbidoni, Christian
    Nucci, Michele
    COGNITIVE COMPUTATION, 2012, 4 (04) : 497 - 514
  • [27] Video annotation based on adaptive annular spatial partition scheme
    Ding, Guiguang
    Zhang, Lu
    Li, Xiaoxu
    IEICE ELECTRONICS EXPRESS, 2010, 7 (01): : 7 - 12
  • [28] Object tracking for video annotation
    Zhang, SQ
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XXVII, PTS 1AND 2, 2004, 5558 : 804 - 814
  • [29] WEB-BASED COLLABORATIVE VIDEO ANNOTATION SYSTEM AND ANNOTATION RECOMMEND METHOD FOR ENGLISH CONVERSATION LEARNING
    Chou, C. M.
    Huang, Y. M.
    4TH INTERNATIONAL TECHNOLOGY, EDUCATION AND DEVELOPMENT CONFERENCE (INTED 2010), 2010, : 4345 - 4345
  • [30] Video annotation and analytics in CourseMapper
    Chatti M.A.
    Marinov M.
    Sabov O.
    Laksono R.
    Sofyan Z.
    Fahmy Yousef A.M.
    Schroeder U.
    Smart Learning Environments, 3 (1)