Word sense disambiguation with pictures

被引:31
作者
Barnard, K [1 ]
Johnson, M [1 ]
机构
[1] Univ Arizona, Dept Comp Sci, Tucson, AZ 85721 USA
关键词
word sense disambiguation; image auto-annotation; region labeling; statistical models;
D O I
10.1016/j.artint.2005.04.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce using images for word sense disambiguation, either alone, or in conjunction with traditional text based methods. The approach is based on a recently developed method for automatically annotating images by using a statistical model for the joint probability for image regions and words. The model itself is learned from a data base of images with associated text. To use the model for word sense disambiguation, we constrain the predicted words to be possible senses for the word under consideration. When word prediction is constrained to a narrow set of choices (such as possible senses), it can be quite reliable. We report on experiments using the resulting sense probabilities as is, as well as augmenting a state of the art text based word sense disambiguation algorithm. In order to evaluate our approach, we developed a new corpus, ImCor, which consists of a substantive portion of the Corel image data set associated with disambiguated text drawn from the SemCor corpus. Our experiments using this corpus suggest that visual information can be very useful in disambiguating word senses. It also illustrates that associated non-textual information such as image data can help ground language meaning. (c) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:13 / 30
页数:18
相关论文
共 50 条
[1]  
AGIRRE E, 1996, P 16 INT C COMP LING, P16
[2]  
AGIRRE E, 1995, P 1 INT C REC ADV NA
[3]  
[Anonymous], 1991, P 29 ANN M ASS COMP, DOI DOI 10.3115/981344.981378
[4]  
[Anonymous], 1998, P 1 INT C LANG RES E
[5]  
Bar-Hillel Y., 1960, Advances in computers, V1, P91, DOI [DOI 10.1016/S0065-2458(08)60607-5, 10.1016/S0065-2458(08)60607-5]
[6]  
Barnard K, 2003, PROC CVPR IEEE, P675
[7]   Matching words and pictures [J].
Barnard, K ;
Duygulu, P ;
Forsyth, D ;
de Freitas, N ;
Blei, DM ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (06) :1107-1135
[8]  
Barnard K, 2001, PROC CVPR IEEE, P434
[9]  
Barnard K, 2001, EIGHTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOL II, PROCEEDINGS, P408, DOI 10.1109/ICCV.2001.937654
[10]  
BARNARD K, 2003, P HLT NAACL 2003 WOR, P1