Towards a Scene-based Video Annotation Framework

被引:1
作者
Getahun, Fekade [1 ]
Birara, Mekuanent [2 ]
机构
[1] Univ Addis Ababa, Dept Comp Sci, Addis Ababa, Ethiopia
[2] Hawassa Univ, Dept Comp Sci & IT, Awasa, Ethiopia
来源
2015 11TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS (SITIS) | 2015年
关键词
Video annotation; Scene level video annotation; Concept formulation; Concept normalization; Video summarization; Event prediction and Shot boundary detection;
D O I
10.1109/SITIS.2015.123
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The amount of video in the web is huge and searching specific part of a video can be achieved using a content or text based video indexing, grouping, searching and retrieval approaches. To realize content based video searching, in this work we propose scene based video annotation for the identification and labeling of events and objects in a video with a descriptive text. Video annotation requires a knowledge base to define semantic meaning of events and objects in the video. Manual and semi-supervised video annotation approaches fail as both require expertise for the correct identification and labeling of video concepts. Annotation requires a great deal of concept dependency and relatedness processing to give descriptive statement to a scene in the video. This paper introduces a novel scene based video annotation framework to provide scene level semantic description of videos. The framework uses audio component of the scene to support event and object identification and has proper filtering and normalization. The framework provides concept relatedness, concept formulation, shot and scene level video annotations. To validate the capability of the proposed framework, we developed a prototype that shows a scene level video annotation. The framework is evaluated using a standard video processing evaluation dataset for its accuracy in event and object prediction, and the overall accuracy and usability of the system is evaluated using human ratings. The proposed approach exhibits 81% accuracy in object and event prediction and an average user rating of 3.47 out of 4 in overall system evaluation.
引用
收藏
页码:306 / 313
页数:8
相关论文
共 50 条
[41]   Surgical Performance Analysis and Classification Based on Video Annotation of Laparoscopic Tasks [J].
Loukas, Constantinos ;
Gazis, Athanasios ;
Kanakis, Meletios A. .
JSLS-JOURNAL OF THE SOCIETY OF LAPAROENDOSCOPIC SURGEONS, 2020, 24 (04)
[42]   sketch based Video Annotation and organization system in Distributed teaching Environment [J].
Chen, Jia ;
Ma, Cui-xia ;
Wang, Hong-an ;
Yang, Hai-yan ;
Teng, Dong-xing .
INTERNATIONAL JOURNAL OF DISTRIBUTED SYSTEMS AND TECHNOLOGIES, 2010, 1 (04) :27-41
[43]   Video Object Annotation, Navigation, and Composition [J].
Goldman, Dan B. ;
Gonterman, Chris ;
Curless, Brian ;
Salesin, David ;
Seitz, Steven M. .
UIST 2008: PROCEEDINGS OF THE 21ST ANNUAL ACM SYMPOSIUM ON USER INTERFACE SOFTWARE AND TECHNOLOGY, 2008, :3-12
[44]   A Distributed Automatic Video Annotation Platform [J].
Islam, Md Anwarul ;
Uddin, Md Azher ;
Lee, Young-Koo .
APPLIED SCIENCES-BASEL, 2020, 10 (15)
[45]   Instawatch: A freeware tool for video annotation [J].
Bermudez Ruiz, Francisco Javier .
PROCEEDINGS OF THE XVII INTERNATIONAL CONFERENCE ON HUMAN COMPUTER INTERACTION INTERACCION 2016, 2016,
[46]   Towards machine vision-based video analysis in smart cities: a survey, framework, applications and open issues [J].
Sabha, Ambreen ;
Selwal, Arvind .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (22) :62107-62158
[47]   A Description Scheme for Video Overview Based on Scene Detection and Face Clustering [J].
Tang, Boyuan ;
Chen, Weiting .
JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2021, 30 (01)
[48]   SCENE RETRIEVAL FOR VIDEO SUMMARIZATION BASED ON TEXT-TO-IMAGE GAN [J].
Yanagi, Rintaro ;
Togo, Ren ;
Ogawa, Takahiro ;
Haseyama, Miki .
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, :1825-1829
[49]   A multi-level Video Annotation Tool based on XML-dictionaries [J].
Kounoudes, Anastasis ;
Tsapatsoulis, Nicolas ;
Theodosiou, Zenonas ;
Milis, Marios .
MATHEMATICAL METHODS, COMPUTATIONAL TECHNIQUES, NON-LINEAR SYSTEMS, INTELLIGENT SYSTEMS, 2008, :455-+
[50]   A collaborative approach for semantic time-based video annotation using gamification [J].
Viana, Paula ;
Pinto, Jose Pedro .
HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2017, 7