A lightweight convolutional neural network for large-scale Chinese image caption

被引:0
|
作者
Dexin Zhao
Ruixue Yang
Shutao Guo
机构
[1] Tianjin University of Technology,Tianjin Key Laboratory of Intelligence Computing and Novel Software Technology
来源
Optoelectronics Letters | 2021年 / 17卷
关键词
A;
D O I
暂无
中图分类号
学科分类号
摘要
Image caption is a high-level task in the area of image understanding, in which most of the models adopt a convolutional neural network (CNN) to extract image features assigning a recurrent neural network (RNN) to generate sentences. Researchers tend to design complex networks with deeper layers to improve the performance of feature extraction in recent years. Increasing the size of the network could obtain features of high quality, but it is not an efficient way in terms of computational cost. A large number of parameters brought by CNN makes the research difficult to apply in human daily life. In order to reduce the information loss of the convolutional process with less cost, we propose a lightweight convolutional neural network, named as Bifurcate-CNN (B-CNN). Furthermore, recent works are devoted to generating captions in English, in this paper, we develop an image caption model that generates descriptions in Chinese. Compared with Inception-v3, the depth of our model is shallower with fewer parameters, and the computational cost is lower. Evaluated on the AI CHALLENGER dataset, we prove that our model can enhance the performance, improving BLEU-4 from 46.1 to 49.9 and CIDEr from 142.5 to 156.6 respectively.
引用
收藏
页码:361 / 366
页数:5
相关论文
共 50 条
  • [21] A LIGHTWEIGHT CONVOLUTIONAL NEURAL NETWORK FOR BITEMPORAL IMAGE CHANGE DETECTION
    Wang, Rongfang
    Ding, Fan
    Chen, Jia-Wei
    Jiao, Licheng
    Wang, Liang
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 2551 - 2554
  • [22] A Lightweight Conditional Convolutional Neural Network for Hyperspectral Image Classification
    Wu, Linfeng
    Wang, Huajun
    Wang, Huiqing
    PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 2023, 89 (07): : 413 - 423
  • [23] ConvUNeXt: A Lightweight Convolutional Neural Network for Watercolor Image Translation
    Su, Hao
    Huang, Jiamian
    Ito, Yasuaki
    Nakano, Koji
    2022 TENTH INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING WORKSHOPS, CANDARW, 2022, : 127 - 133
  • [24] Lightweight Label Propagation for Large-Scale Network Data
    Li, Yu-Feng
    Liang, De-Ming
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (05) : 2071 - 2082
  • [25] Large-scale Video Classification with Convolutional Neural Networks
    Karpathy, Andrej
    Toderici, George
    Shetty, Sanketh
    Leung, Thomas
    Sukthankar, Rahul
    Fei-Fei, Li
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1725 - 1732
  • [26] Lightweight Label Propagation for Large-Scale Network Data
    Liang, De-Ming
    Li, Yu-Feng
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 3421 - 3427
  • [27] LARGE-SCALE WEAKLY SUPERVISED AUDIO CLASSIFICATION USING GATED CONVOLUTIONAL NEURAL NETWORK
    Xu, Yong
    Kong, Qiuqiang
    Wang, Wenwu
    Plumbley, Mark D.
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 121 - 125
  • [28] A Lightweight Framework for Fast Image Retrieval on Large-Scale Image Datasets
    Chen, Renhai
    Li, Wenwen
    Rao, Guozheng
    Feng, Zhiyong
    2020 9TH IEEE NON-VOLATILE MEMORY SYSTEMS AND APPLICATIONS SYMPOSIUM (NVMSA 2020), 2020, : 42 - 47
  • [29] A Convolutional Neural Network to Perform Object Detection and Identification in Visual Large-Scale Data
    Ayachi, Riadh
    Said, Yahia
    Atri, Mohamed
    BIG DATA, 2021, 9 (01) : 41 - 52
  • [30] Automatic Classification of Large-Scale Respiratory Sound Dataset Based on Convolutional Neural Network
    Minami, Koki
    Lu, Huimin
    Kim, Hyoungseop
    Mabu, Shingo
    Hirano, Yasushi
    Kido, Shoji
    2019 19TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2019), 2019, : 804 - 807