Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures (Extended Abstract)

被引:0
|
作者
Bernardi, Raffaella [1 ]
Cakici, Ruket [2 ]
Elliott, Desmond [3 ]
Erdem, Aykut [4 ]
Erdem, Erkut [4 ]
Ikizler-Cinbis, Nazli [4 ]
Keller, Frank [5 ]
Muscat, Adrian [6 ]
Plank, Barbara [7 ]
机构
[1] Univ Trento, Trento, Italy
[2] Middle East Tech Univ, Ankara, Turkey
[3] Univ Amsterdam, Amsterdam, Netherlands
[4] Hacettepe Univ, Ankara, Turkey
[5] Univ Edinburgh, Edinburgh, Midlothian, Scotland
[6] Univ Malta, Msida, Malta
[7] Univ Groningen, Groningen, Netherlands
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic image description generation is a challenging problem that has recently received a large amount of interest from the computer vision and natural language processing communities. In this survey, we classify the known approaches based on how they conceptualise this problem and provide a review of existing models, highlighting their advantages and disadvantages. Moreover, we give an overview of the benchmark image-text datasets and the evaluation measures that have been developed to assess the quality of machine-generated descriptions. Finally we explore future directions in the area of automatic image description.
引用
收藏
页码:4970 / 4974
页数:5
相关论文
共 50 条
  • [31] Towards automatic product generation from meteosat images
    Gaertner, V.
    European Space Agency Bulletin, 1995, (82):
  • [32] Automatic Spreadsheet Generation from Conceptual Models
    Antunes, Leo
    Correa, Alexandre
    Barros, Marcio
    2015 29TH BRAZILIAN SYMPOSIUM ON SOFTWARE ENGINEERING, 2015, : 140 - 149
  • [33] Learning Probabilistic logic models from probabilistic examples (Extended abstract)
    Chen, Jianzhong
    Muggleton, Stephen
    Santos, Jose
    INDUCTIVE LOGIC PROGRAMMING, 2008, 4894 : 22 - +
  • [35] Automatic Implementations Synthesis of Secure Protocols and Attacks from Abstract Models
    Sivelle, Camille
    Debbah, Lorys
    Puys, Maxime
    Lafourcade, Pascal
    Franco-Rondisson, Thibault
    SECURE IT SYSTEMS, NORDSEC 2022, 2022, 13700 : 234 - 252
  • [36] Visually Grounded Models of Spoken Language: A Survey of Datasets, Architectures and Evaluation Techniques
    Chrupala, Grzegorz
    Journal of Artificial Intelligence Research, 2022, 73 : 673 - 707
  • [37] Automatic generation of subject-specific finite element models of the spine from magnetic resonance images
    Kok, Joeri
    Shcherbakova, Yulia M.
    Schlosser, Tom P. C.
    Seevinck, Peter R.
    van der Velden, Tijl A.
    Castelein, Rene M.
    Ito, Keita
    van Rietbergen, Bert
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2023, 11
  • [38] Visually Grounded Models of Spoken Language: A Survey of Datasets, Architectures and Evaluation Techniques
    Chrupala, Grzegorz
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 73 : 673 - 707
  • [39] A Survey of Automatic Code Generation from Natural Language
    Shin, Jiho
    Nam, Jaechang
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2021, 17 (03): : 537 - 555
  • [40] A comprehensive survey on shadow removal from document images: datasets, methods, and opportunities
    Bingshu Wang
    Changping Li
    Wenbin Zou
    Yongjun Zhang
    Xuhang Chen
    C.L. Philip Chen
    Vicinagearth, 2 (1):