Phrase-based Image Captioning

被引:0
|
作者
Lebret, Remi [1 ,2 ]
Pinheiro, Pedro O. [1 ,2 ]
Collobert, Ronan [3 ]
机构
[1] Idiap Res Inst, Martigny, Switzerland
[2] Ecole Polytech Fed Lausanne EPFL, Lausanne, Switzerland
[3] Facebook AI Res, Menlo Pk, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generating a novel textual description of an image is an interesting problem that connects computer vision and natural language processing. In this paper, we present a simple model that is able to generate descriptive sentences given a sample image This model has a strong focus on the syntax of the descriptions. We train a purely bilinear model that learns a metric between an image representation (generated from a previously trained Convolutional Neural Network) and phrases that are used to described them. The system is then able to infer phrases from a given image sample. Based on caption syntax statistics, we propose a simple language model that can produce relevant descriptions for a given test image using the phrases inferred. Our approach, which is considerably simpler than state-of-the-art models, achieves comparable results in two popular datasets for the task: Flickr30k and the recently proposed Microsoft COCO.
引用
收藏
页码:2085 / 2094
页数:10
相关论文
共 50 条
  • [31] Monte Carlo techniques for phrase-based translation
    Arun, Ahhishek
    Haddow, Barry
    Koehn, Philipp
    Lopez, Adam
    Dyer, Chris
    Blunsom, Phil
    MACHINE TRANSLATION, 2010, 24 (02) : 103 - 121
  • [32] The CASIA phrase-based machine translation system
    Yang, ZD
    Chen, ZB
    Pang, W
    Wei, W
    Xu, B
    PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 416 - 419
  • [33] Efficient Incremental Phrase-Based Document Clustering
    Bakr, Ahmad M.
    Yousri, Noha A.
    Ismail, Mohamed A.
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 517 - 520
  • [34] Improved Reordering Rules for Hierarchical Phrase-based Translation
    Cai, Shu
    Lue, Yajuan
    Liu, Qun
    2009 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2009, : 65 - 70
  • [35] Phrase-based hierarchical clustering of web search results
    Maslowska, I
    ADVANCES IN INFORMATION RETRIEVAL, 2003, 2633 : 555 - 562
  • [36] Some improvements in phrase-based statistical machine translation
    Yang, Zhendong
    Pang, Wei
    Du, Jinhua
    Wei, Wei
    Xu, Bo
    CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 704 - +
  • [37] Faster Phrase-Based Decoding by Refining Feature State
    Heafield, Kenneth
    Kayser, Michael
    Manning, Christoper D.
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2014, : 130 - 135
  • [38] Flattened Syntactical Phrase-Based Translation Model for SMT
    Chen, Qing
    Yao, Tianshun
    COMPUTER PROCESSING OF ORIENTAL LANGUAGES: LANGUAGE TECHNOLOGY FOR THE KNOWLEDGE-BASED ECONOMY, 2009, 5459 : 345 - 353
  • [39] Phrase-based alignment models for statistical machine translation
    Tomás, J
    Lloret, J
    Casacuberta, F
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PT 2, PROCEEDINGS, 2005, 3523 : 605 - 613
  • [40] Phrase-based document similarity based on an Index Graph model
    Hammouda, KM
    Kamel, MS
    2002 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2002, : 203 - 210