A Flexible Object-of-Interest Annotation Framework for Online Video Portals

被引:0
|
作者
Sorschag, Robert [1 ]
机构
[1] Vienna Univ Technol, Inst Software Technol & Interact Syst, Favoritenstr 9-11, A-1040 Vienna, Austria
来源
FUTURE INTERNET | 2012年 / 4卷 / 01期
关键词
video annotation; video sharing; object recognition;
D O I
10.3390/fi4010179
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this work, we address the use of object recognition techniques to annotate what is shown where in online video collections. These annotations are suitable to retrieve specific video scenes for object related text queries which is not possible with the manually generated metadata that is used by current portals. We are not the first to present object annotations that are generated with content-based analysis methods. However, the proposed framework possesses some outstanding features that offer good prospects for its application in real video portals. Firstly, it can be easily used as background module in any video environment. Secondly, it is not based on a fixed analysis chain but on an extensive recognition infrastructure that can be used with all kinds of visual features, matching and machine learning techniques. New recognition approaches can be integrated into this infrastructure with low development costs and a configuration of the used recognition approaches can be performed even on a running system. Thus, this framework might also benefit from future advances in computer vision. Thirdly, we present an automatic selection approach to support the use of different recognition strategies for different objects. Last but not least, visual analysis can be performed efficiently on distributed, multi-processor environments and a database schema is presented to store the resulting video annotations as well as the off-line generated low-level features in a compact form. We achieve promising results in an annotation case study and the instance search task of the TRECVID 2011 challenge.
引用
收藏
页码:179 / 215
页数:37
相关论文
共 50 条
  • [1] A Novel Video Annotation Framework Based on Video Object
    Li, Yang
    Lu, Jianjiang
    Zhang, Yafei
    Li, Ran
    Zhou, Bo
    FIRST IITA INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, : 572 - 575
  • [2] Automatic Object-of-Interest segmentation from natural images
    Ko, Byoung Chul
    Nam, Jae-Yeal
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 45 - +
  • [3] Exploiting Object-of-Interest Information to Understand Attention in VR Classrooms
    Bozkir, Efe
    Stark, Philipp
    Gao, Hong
    Hasenbein, Lisa
    Hahn, Jens-Uwe
    Kasneci, Enkelejda
    Goellner, Richard
    2021 IEEE VIRTUAL REALITY AND 3D USER INTERFACES (VR), 2021, : 597 - 605
  • [4] Object-of-Interest Perception in a Reconfigurable Rolling-Crawling Robot
    Semwal, Archana
    Lee, Melvin Ming Jun
    Sanchez, Daniela
    Teo, Sui Leng
    Wang, Bo
    Mohan, Rajesh Elara
    SENSORS, 2022, 22 (14)
  • [5] Object tracking for video annotation
    Zhang, SQ
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XXVII, PTS 1AND 2, 2004, 5558 : 804 - 814
  • [6] Object-of-Interest Extraction by Integrating Stochastic Inference with Learnt Active Shape Sketch
    Li, Hongwei
    Lin, Liang
    Wu, Tianfu
    Liu, Xiaobai
    Dong, Lanfang
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 3962 - 3965
  • [7] Object-of-interest image segmentation based on human attention and semantic region clustering
    Ko, Byoung Chul
    Nam, Jae-Yeal
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2006, 23 (10) : 2462 - 2470
  • [8] REALTIME OBJECT-OF-INTEREST TRACKING BY LEARNING COMPOSITE PATCH-BASED TEMPLATES
    Xu, Yuanlu
    Zhou, Hongfei
    Wang, Qing
    Lin, Liang
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 389 - 392
  • [9] Video Object Annotation, Navigation, and Composition
    Goldman, Dan B.
    Gonterman, Chris
    Curless, Brian
    Salesin, David
    Seitz, Steven M.
    UIST 2008: PROCEEDINGS OF THE 21ST ANNUAL ACM SYMPOSIUM ON USER INTERFACE SOFTWARE AND TECHNOLOGY, 2008, : 3 - 12
  • [10] Interactive Video Object Mask Annotation
    Trung-Nghia Le
    Nguyen, Tam, V
    Quoc-Cuong Tran
    Lam Nguyen
    Trung-Hieu Hoang
    Minh-Quan Le
    Minh-Triet Tran
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 16067 - 16070