An Ontology for Generating Descriptions about Natural Outdoor Scenes

被引:0
作者
Nwogu, Ifeoma [1 ]
Zhou, Yingbo [2 ]
Brown, Christopher [1 ]
机构
[1] Univ Rochester, 601 Elmwood Ave, Rochester, NY 14627 USA
[2] SUNY Buffalo, Buffalo, NY 14260 USA
来源
2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS) | 2011年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an image ontology useful for generating descriptive texts about highly unconstrained natural outdoor images, taken under many different conditions - lighting, varying viewpoints, etc. The ontology pre-defines the visual content we are interested in describing. Unlike other image description techniques, which tend to be purely object-centric, we utilize a holistic scene ontology for description. The primitive units defined by the ontology are extracted from an image via stochastic processes. Similarly, attributes of the units, also specified by the ontology, are evaluated. Binary and tertiary relationships between relevant primitives are also evaluated. The values, attributes and relationships of the primitive units are combined, based on a pre-defined set of production rules, in such a way as to generate rich, descriptive sentences about the image. Evaluation strategies are implemented to quantitatively test the meaningfulness of the generated sentences. Results indicate that the proposed scene ontology aids in generating highly relevant, naturalistic and meaningful sentences describing natural outdoor images.
引用
收藏
页数:8
相关论文
共 14 条
[1]  
[Anonymous], 2011, P 24 CVPR
[2]  
[Anonymous], 2009, ICCV
[3]  
Duygulu P, 2002, LECT NOTES COMPUT SC, V2353, P97
[4]  
Farhadi A., 2009, Computer Vision and Pattern Recognition
[5]   Every Picture Tells a Story: Generating Sentences from Images [J].
Farhadi, Ali ;
Hejrati, Mohsen ;
Sadeghi, Mohammad Amin ;
Young, Peter ;
Rashtchian, Cyrus ;
Hockenmaier, Julia ;
Forsyth, David .
COMPUTER VISION-ECCV 2010, PT IV, 2010, 6314 :15-+
[6]   Object Detection with Discriminatively Trained Part-Based Models [J].
Felzenszwalb, Pedro F. ;
Girshick, Ross B. ;
McAllester, David ;
Ramanan, Deva .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (09) :1627-1645
[7]   Efficient graph-based image segmentation [J].
Felzenszwalb, PF ;
Huttenlocher, DP .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 59 (02) :167-181
[8]   A TRANSLATION APPROACH TO PORTABLE ONTOLOGY SPECIFICATIONS [J].
GRUBER, TR .
KNOWLEDGE ACQUISITION, 1993, 5 (02) :199-220
[9]   Recovering surface layout from an image [J].
Hoiem, Derek ;
Efros, Alexei A. ;
Hebert, Martial .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2007, 75 (01) :151-172
[10]  
Noy N., Ontology development 101: A guide to creating your first ontology 0880