Phrase-based Image Captioning

被引:0
|
作者
Lebret, Remi [1 ,2 ]
Pinheiro, Pedro O. [1 ,2 ]
Collobert, Ronan [3 ]
机构
[1] Idiap Res Inst, Martigny, Switzerland
[2] Ecole Polytech Fed Lausanne EPFL, Lausanne, Switzerland
[3] Facebook AI Res, Menlo Pk, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generating a novel textual description of an image is an interesting problem that connects computer vision and natural language processing. In this paper, we present a simple model that is able to generate descriptive sentences given a sample image This model has a strong focus on the syntax of the descriptions. We train a purely bilinear model that learns a metric between an image representation (generated from a previously trained Convolutional Neural Network) and phrases that are used to described them. The system is then able to infer phrases from a given image sample. Based on caption syntax statistics, we propose a simple language model that can produce relevant descriptions for a given test image using the phrases inferred. Our approach, which is considerably simpler than state-of-the-art models, achieves comparable results in two popular datasets for the task: Flickr30k and the recently proposed Microsoft COCO.
引用
收藏
页码:2085 / 2094
页数:10
相关论文
共 50 条
  • [1] phi-LSTM: A Phrase-Based Hierarchical LSTM Model for Image Captioning
    Tan, Ying Hua
    Chan, Chee Seng
    COMPUTER VISION - ACCV 2016, PT V, 2017, 10115 : 101 - 117
  • [2] Phrase-based image caption generator with hierarchical LSTM network
    Tan, Ying Hua
    Chan, Chee Seng
    NEUROCOMPUTING, 2019, 333 : 86 - 100
  • [3] Statistical phrase-based translation
    Koehn, P
    Och, FJ
    Marcu, D
    HLT-NAACL 2003: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, 2003, : 127 - 133
  • [4] Hierarchical phrase-based translation
    Chiang, David
    COMPUTATIONAL LINGUISTICS, 2007, 33 (02) : 201 - 228
  • [5] A Comparative Study on Applying Hierarchical Phrase-based and Phrase-based on Thai-Chinese Translation
    Luekhong, Prasert
    Sukhauta, Rattasit
    Porkaew, Peerachet
    Ruangrajitpakorn, Taneth
    Supnithi, Thepchai
    2012 SEVENTH INTERNATIONAL CONFERENCE ON KNOWLEDGE, INFORMATION AND CREATIVITY SUPPORT SYSTEMS (KICSS 2012), 2012, : 126 - 133
  • [6] On the Cost of Phrase-Based Ranking
    Petri, Matthias
    Moffat, Alistair
    SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 931 - 934
  • [7] A PHRASE-BASED MATCHING FUNCTION
    GALBIATI, G
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1991, 42 (01): : 36 - 48
  • [8] Integrating Phrase Inseparability in Phrase-Based Model
    Shi, Lixin
    Nie, Jian-Yun
    PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 708 - 709
  • [9] Statistical phrase-based speech translation
    Mathias, Lambert
    Byrne, William
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 561 - 564
  • [10] Improved techniques for phrase-based translation
    Ruiz Costa-Jussa, Marta
    Fonollosa, Jose A. R.
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2005, (35): : 351 - 356