Phrase-based Image Captioning

被引:0
|
作者
Lebret, Remi [1 ,2 ]
Pinheiro, Pedro O. [1 ,2 ]
Collobert, Ronan [3 ]
机构
[1] Idiap Res Inst, Martigny, Switzerland
[2] Ecole Polytech Fed Lausanne EPFL, Lausanne, Switzerland
[3] Facebook AI Res, Menlo Pk, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generating a novel textual description of an image is an interesting problem that connects computer vision and natural language processing. In this paper, we present a simple model that is able to generate descriptive sentences given a sample image This model has a strong focus on the syntax of the descriptions. We train a purely bilinear model that learns a metric between an image representation (generated from a previously trained Convolutional Neural Network) and phrases that are used to described them. The system is then able to infer phrases from a given image sample. Based on caption syntax statistics, we propose a simple language model that can produce relevant descriptions for a given test image using the phrases inferred. Our approach, which is considerably simpler than state-of-the-art models, achieves comparable results in two popular datasets for the task: Flickr30k and the recently proposed Microsoft COCO.
引用
收藏
页码:2085 / 2094
页数:10
相关论文
共 50 条
  • [41] Rule-based reordering constraints for phrase-based SMT
    Goh, Chooi-Ling
    Onishi, Takashi
    Sumita, Eiichiro
    Proceedings of the 15th International Conference of the European Association for Machine Translation, EAMT 2011, 2011, : 113 - 120
  • [42] A Phrase-Based Method for Hierarchical Clustering of Web Snippets
    Li, Zhao
    Wu, Xindong
    PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 1947 - 1948
  • [43] Czech-English phrase-based machine translation
    Bojar, Ondrej
    Matusov, Evgeny
    Ney, Hermann
    Lect. Notes Comput. Sci., (214-224):
  • [44] Improving semistatic compression via phrase-based modeling
    Brisaboa, Nieves R.
    Farina, Antonio
    Navarro, Gonzalo
    Parama, Jose R.
    INFORMATION PROCESSING & MANAGEMENT, 2011, 47 (04) : 545 - 559
  • [45] Leveraging External Knowledge for Phrase-based Topic Modeling
    Xu, Mingyang
    Yang, Ruixin
    Ranshous, Stephen
    Li, Shijie
    Samatova, Nagiza F.
    2017 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2017, : 29 - 32
  • [46] English to Bodo Phrase-Based Statistical Machine Translation
    Islam, Md Saiful
    Purkayastha, Bipul Syam
    ADVANCED COMPUTING AND COMMUNICATION TECHNOLOGIES, 2018, 562 : 207 - 217
  • [47] Phrase-Based Presentation Slides Generation for Academic Papers
    Wang, Sida
    Wan, Xiaojun
    Du, Shikang
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 196 - 202
  • [48] Phrase-based Hierarchical Method for Clustering Search Results
    Yang Ke
    Han Baoming
    Li Zujie
    PROCEEDINGS OF THE THIRD INTERNATIONAL SYMPOSIUM ON TEST AUTOMATION & INSTRUMENTATION, VOLS 1 - 4, 2010, : 1430 - 1435
  • [49] A Hybrid Phrase-based/Statistical Speech Translation System
    Stallard, David
    Choi, Fred
    Krstovski, Kriste
    Natarajan, Prem
    Prasad, Rohit
    Saleem, Shirin
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 757 - 760
  • [50] Improvements in Statistical Phrase-Based Interactive Machine Translation
    Cai, Dongfeng
    Zhang, Hua
    Ye, Na
    2013 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2013), 2013, : 91 - 94