Automatic image captioning combining natural language processing and deep neural networks

被引:11
|
作者
Rinaldi, Antonio M. [1 ]
Russo, Cristiano [1 ]
Tommasino, Cristian [1 ]
机构
[1] Univ Naples Federico II, Dept Elect Engn & Informat Technol, IKNOS LAB Intelligent & Knowledge Syst LUPT, Via Claudio 21, I-80125 Naples, Italy
关键词
Object detection; Image captioning; Deep neural networks; Semantic-instance segmentation;
D O I
10.1016/j.rineng.2023.101107
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
An image contains a lot of information that humans can detect in a very short time. Image captioning aims to detect this information by describing the image content through image and text processing techniques. One of the peculiarities of the proposed approach is the combination of multiple networks to catch as many distinct features as possible from a semantic point of view. In this work, our goal is to prove that a combination strategy of existing methods can efficiently improve the performance in the object detection tasks concerning the performance achieved by each tested individually. This approach involves using different deep neural networks that perform two levels of hierarchical object detection in an image. The results are combined and used by a captioning module that generates image captions through natural language processing techniques. Several experimental results are reported and discussed to show the effectiveness of our framework. The combination strategy has also improved, showing a gain in precision over single models.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Natural Language Processing with Optimal Deep Learning-Enabled Intelligent Image Captioning System
    Marzouk, Radwa
    Alabdulkreem, Eatedal
    Nour, Mohamed K.
    Al Duhayyim, Mesfer
    Othman, Mahmoud
    Zamani, Abu Sarwar
    Yaseen, Ishfaq
    Motwakel, Abdelwahed
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (02): : 4435 - 4451
  • [2] Paragraph Image Captioning with Deep Fully Convolutional Neural Networks
    Li R.-F.
    Liang H.-Y.
    Feng F.-X.
    Zhang G.-W.
    Wang X.-J.
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2019, 42 (06): : 155 - 161
  • [3] Deep Learning for automatically describing images in natural language - Image Captioning
    Hotaran, Anca Mihaela
    Vrejoiu, Mihnea Horia
    ROMANIAN JOURNAL OF INFORMATION TECHNOLOGY AND AUTOMATIC CONTROL-REVISTA ROMANA DE INFORMATICA SI AUTOMATICA, 2020, 30 (01): : 87 - 100
  • [4] Lighting Search Algorithm With Convolutional Neural Network-Based Image Captioning System for Natural Language Processing
    Alnashwan, Rana Othman
    Chelloug, Samia Allaoua
    Almalki, Nabil Sharaf
    Issaoui, Imene
    Motwakel, Abdelwahed
    Sayed, Ahmed
    IEEE ACCESS, 2023, 11 : 142643 - 142651
  • [5] Deep image captioning using an ensemble of CNN and LSTM based deep neural networks
    Alzubi, Jafar A.
    Jain, Rachna
    Nagrath, Preeti
    Satapathy, Suresh
    Taneja, Soham
    Gupta, Paras
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (04) : 5761 - 5769
  • [6] A survey on deep neural network-based image captioning
    Liu, Xiaoxiao
    Xu, Qingyang
    Wang, Ning
    VISUAL COMPUTER, 2019, 35 (03) : 445 - 470
  • [7] Bridging auditory perception and natural language processing with semantically informed deep neural networks
    Esposito, Michele
    Valente, Giancarlo
    Plasencia-Calana, Yenisel
    Dumontier, Michel
    Giordano, Bruno L.
    Formisano, Elia
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [8] Hierarchical Deep Neural Network for Image Captioning
    Yuting Su
    Yuqian Li
    Ning Xu
    An-An Liu
    Neural Processing Letters, 2020, 52 : 1057 - 1067
  • [9] A survey on deep neural network-based image captioning
    Xiaoxiao Liu
    Qingyang Xu
    Ning Wang
    The Visual Computer, 2019, 35 : 445 - 470
  • [10] Image Captioning using Deep Neural Architectures
    Shah, Parth
    Bakrola, Vishvajit
    Pati, Supriya
    2017 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2017,