Oppositional Harris Hawks Optimization with Deep Learning-Based Image Captioning

被引:0
作者
Kavitha, V. R. [1 ]
Nimala, K. [2 ]
Beno, A. [3 ]
Ramya, K. C. [4 ]
Kadry, Seifedine [5 ]
Kang, Byeong-Gwon [6 ]
Nam, Yunyoung [7 ]
机构
[1] Prathyusha Engn Coll, Dept Comp Sci & Engn, Thiruvallur 602025, India
[2] SRM Inst Sci & Technol, Dept Networking & Commun, Chennai, Tamil Nadu, India
[3] Dr Sivanthi Aditanar Coll Engn, Dept Elect & Commun Engn, Tiruchendur 628215, India
[4] Sri Krishna Coll Engn & Technol, Dept Elect & Elect Engn, Coimbatore 641008, Tamil Nadu, India
[5] Noroff Univ Coll, Dept Appl Data Sci, Kristiansand, Norway
[6] Soonchunhyang Univ, Dept Informat & Commun Engn, Asan, South Korea
[7] Soonchunhyang Univ, Dept Comp Sci & Engn, Asan, South Korea
来源
COMPUTER SYSTEMS SCIENCE AND ENGINEERING | 2023年 / 44卷 / 01期
关键词
Image captioning; natural language processing; artificial intelligence; machine learning; deep learning; FUSION;
D O I
10.32604/csse.2023.024553
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Image Captioning is an emergent topic of research in the domain of artificial intelligence (AI). It utilizes an integration of Computer Vision (CV) and Natural Language Processing (NLP) for generating the image descriptions. It finds use in several application areas namely recommendation in editing applications, utilization in virtual assistance, etc. The development of NLP and deep learning (DL) models find useful to derive a bridge among the visual details and textual semantics. In this view, this paper introduces an Oppositional Harris Hawks Optimization with Deep Learning based Image Captioning (OHHODLIC) technique. The OHHO-DLIC technique involves the design of distinct levels of pre-processing. Moreover, the feature extraction of the images is carried out by the use of EfficientNet model. Furthermore, the image captioning is performed by bidirectional long short term memory (BiLSTM) model, comprising encoder as well as decoder. At last, the oppositional Harris Hawks optimization (OHHO) based hyperparameter tuning process is performed for effectively adjusting the hyperparameter of the EfficientNet and BiLSTM models. The experimental analysis of the OHHO-DLIC technique is carried out on the Flickr 8k Dataset and a comprehensive comparative analysis highlighted the better performance over the recent approaches.
引用
收藏
页码:579 / 593
页数:15
相关论文
共 18 条
  • [1] Automatic Image Captioning Based on ResNet50 and LSTM with Soft Attention
    Chu, Yan
    Yue, Xiao
    Yu, Lei
    Sergei, Mikhailov
    Wang, Zhengkui
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2020, 2020
  • [2] Image captioning using DenseNet network and adaptive attention
    Deng, Zhenrong
    Jiang, Zhouqin
    Lan, Rushi
    Huang, Wenming
    Luo, Xiaonan
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 85
  • [3] Bidirectional Grid Long Short-Term Memory (BiGridLSTM): A Method to Address Context-Sensitivity and Vanishing Gradient
    Fei, Hongxiao
    Tan, Fengyun
    [J]. ALGORITHMS, 2018, 11 (11)
  • [4] Opposition-based learning Harris hawks optimization with advanced transition rules: principles and analysis
    Gupta, Shubham
    Deep, Kusum
    Heidari, Ali Asghar
    Moayedi, Hossein
    Wang, Mingjing
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 158
  • [5] Harris hawks optimization: Algorithm and applications
    Heidari, Ali Asghar
    Mirjalili, Seyedali
    Faris, Hossam
    Aljarah, Ibrahim
    Mafarja, Majdi
    Chen, Huiling
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 97 : 849 - 872
  • [6] A Comprehensive Survey of Deep Learning for Image Captioning
    Hossain, Md Zakir
    Sohel, Ferdous
    Shiratuddin, Mohd Fairuz
    Laga, Hamid
    [J]. ACM COMPUTING SURVEYS, 2019, 51 (06)
  • [7] Karpathy A, 2015, PROC CVPR IEEE, P3128, DOI 10.1109/CVPR.2015.7298932
  • [8] Deep learning model for real-time image compression in Internet of Underwater Things (IoUT)
    Krishnaraj, N.
    Elhoseny, Mohamed
    Thenmozhi, M.
    Selim, Mahmoud M.
    Shankar, K.
    [J]. JOURNAL OF REAL-TIME IMAGE PROCESSING, 2020, 17 (06) : 2097 - 2111
  • [9] Optimal deep learning model for classification of lung cancer on CT images
    Lakshmanaprabu, S. K.
    Mohanty, Sachi Nandan
    Shankar, K.
    Arunkumar, N.
    Ramirez, Gustavo
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 92 : 374 - 382
  • [10] Automated detection and classification of fundus diabetic retinopathy images using synergic deep learning model
    Shankar, K.
    Sait, Abdul Rahaman Wahab
    Gupta, Deepak
    Lakshmanaprabu, S. K.
    Khanna, Ashish
    Pandey, Hari Mohan
    [J]. PATTERN RECOGNITION LETTERS, 2020, 133 : 210 - 216