Automatic image captioning combining natural language processing and deep neural networks

被引:13
作者
Rinaldi, Antonio M. [1 ]
Russo, Cristiano [1 ]
Tommasino, Cristian [1 ]
机构
[1] Univ Naples Federico II, Dept Elect Engn & Informat Technol, IKNOS LAB Intelligent & Knowledge Syst LUPT, Via Claudio 21, I-80125 Naples, Italy
关键词
Object detection; Image captioning; Deep neural networks; Semantic-instance segmentation;
D O I
10.1016/j.rineng.2023.101107
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
An image contains a lot of information that humans can detect in a very short time. Image captioning aims to detect this information by describing the image content through image and text processing techniques. One of the peculiarities of the proposed approach is the combination of multiple networks to catch as many distinct features as possible from a semantic point of view. In this work, our goal is to prove that a combination strategy of existing methods can efficiently improve the performance in the object detection tasks concerning the performance achieved by each tested individually. This approach involves using different deep neural networks that perform two levels of hierarchical object detection in an image. The results are combined and used by a captioning module that generates image captions through natural language processing techniques. Several experimental results are reported and discussed to show the effectiveness of our framework. The combination strategy has also improved, showing a gain in precision over single models.
引用
收藏
页数:14
相关论文
共 50 条
[41]   Sensory processing and categorization in cortical and deep neural networks [J].
Pinotsis, Dimitris A. ;
Siegel, Markus ;
Miller, Earl K. .
NEUROIMAGE, 2019, 202
[42]   Exploiting Deep Neural Networks for Digital Image Compression [J].
Hussain, Farhan ;
Jeong, Jechang .
2015 2ND WORLD SYMPOSIUM ON WEB APPLICATIONS AND NETWORKING (WSWAN), 2015,
[43]   Image Inpainting Algorithm Based on Deep Neural Networks [J].
Lyu, Jianfeng ;
Shao, Lizhen ;
Lei, Xuemei .
Computer Engineering and Applications, 2023, 59 (20) :1-12
[44]   Deep Neural Networks for Efficient Image Caption Generation [J].
Rai, Riddhi ;
Guruprasad, Navya Shimoga ;
Tumuluru, Shreya Sindhu .
ADVANCED NETWORK TECHNOLOGIES AND INTELLIGENT COMPUTING, ANTIC 2023, PT II, 2024, 2091 :247-260
[45]   Medical Image Segmentation Using Deep Neural Networks [J].
Aydin, Musa .
29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
[46]   Image response regression via deep neural networks [J].
Zhang, Daiwei ;
Li, Lexin ;
Sripada, Chandra ;
Kang, Jian .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2024, 85 (05) :1589-1614
[47]   Deep neural networks and image classification in biological vision [J].
Leek, E. Charles ;
Leonardis, Ales ;
Heinke, Dietmar .
VISION RESEARCH, 2022, 197
[48]   Improvement of Image Compression Performance by Deep Neural Networks [J].
Vasylenko, Dmytro ;
Stirenko, Sergii ;
Gordienko, Yuri .
IEEE EUROCON 2021 - 19TH INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES, 2021, :135-139
[49]   IMAGE RECONSTRUCTION FOR QUANTA IMAGE SENSORS USING DEEP NEURAL NETWORKS [J].
Choi, Joon Hee ;
Elgendy, Omar A. ;
Chan, Stanley H. .
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, :6543-6547
[50]   Automatic Comparative Chest Radiography Using Deep Neural Networks [J].
Sony, Redwan ;
Isaac, Carolyn V. ;
Vanbaarle, Alexis ;
Devota, Clara J. ;
Fenton, Todd ;
Hefner, Joseph T. ;
Ross, Arun .
IEEE ACCESS, 2025, 13 :4398-4410