MULTI-SCALE CROPPING MECHANISM FOR REMOTE SENSING IMAGE CAPTIONING

被引:0
|
作者
Zhang, Xueting [1 ,2 ]
Wang, Qi [1 ,2 ]
Chen, Shangdong [3 ]
Li, Xuelong [1 ,2 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Shaanxi, Peoples R China
[2] Northwestern Polytech Univ, Ctr Opt IMagery Anal & Learning OPTIMAL, Xian 710072, Shaanxi, Peoples R China
[3] Northwest Univ, Sch Informat Sci & Technol, Xian 710072, Shaanxi, Peoples R China
来源
2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019) | 2019年
基金
中国国家自然科学基金;
关键词
Remote sensing image; image captioning; encoder-decoder; multi-scale cropping;
D O I
10.1109/igarss.2019.8900503
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
With the rapid development of artificial satellite, a large number of high resolution remote sensing images can be easily obtained now. Recently, remote sensing image captioning, which aims to generate accurate and concise descriptive sentences for remote sensing images, has been promoted by template-based model and encoder-decoder model with several related datasets released. Based on an encoder-decoder model, we propose a training mechanism of multi-scale cropping for remote sensing image captioning in this paper, which can extract more fine-grained information from remote sensing images and enhance the generalization performance of the base model. The experimental results on two datasets UCM-captions and Sydney-captions demonstrate that the proposed approach availably improves the performances in describing high resolution remote sensing images.
引用
收藏
页码:10039 / 10042
页数:4
相关论文
共 50 条
  • [1] Multi-scale Attentive Fusion Network for Remote Sensing Image Change Captioning
    Chen, Cai
    Wang, Yi
    Yap, Kim-Hui
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [2] Remote sensing image target detection combining multi-scale and attention mechanism
    Zhang Y.-Z.
    Guo W.
    Cai Z.-Q.
    Li W.-B.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2022, 56 (11): : 2215 - 2223
  • [3] MC-Net: multi-scale contextual information aggregation network for image captioning on remote sensing images
    Huang, Haiyan
    Shao, Zhenfeng
    Cheng, Qimin
    Huang, Xiao
    Wu, Xiaoping
    Li, Guoming
    Tan, Li
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2023, 16 (02) : 4848 - 4866
  • [4] Remote Sensing Image Retrieval Based on Multi-scale Pooling and Norm Attention Mechanism
    Ge, Yun
    Ma, Lin
    Ye, Famao
    Chu, Jun
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2022, 44 (02) : 543 - 551
  • [5] Deep Multi-Scale Transformer for Remote Sensing Image Restoration
    Li, Yanting
    2024 5TH INTERNATIONAL CONFERENCE ON GEOLOGY, MAPPING AND REMOTE SENSING, ICGMRS 2024, 2024, : 138 - 142
  • [6] Multi-scale uncertainty evaluation of remote sensing image classification
    Zhao Quan-hua
    Song Wei-dong
    Bao Yong
    2009 JOINT URBAN REMOTE SENSING EVENT, VOLS 1-3, 2009, : 1210 - 1215
  • [7] Multi-scale segmentation of the high resolution remote sensing image
    Zhong, C
    Zhao, ZM
    Yan, DM
    Chen, RX
    IGARSS 2005: IEEE International Geoscience and Remote Sensing Symposium, Vols 1-8, Proceedings, 2005, : 3682 - 3684
  • [8] Topological Equivalence in Multi-Scale Remote Sensing Image Segmentation
    Liu, Xiaoli
    Zhu, Guobin
    Li, Jinggang
    Li, Xue
    SENSORS, MECHATRONICS AND AUTOMATION, 2014, 511-512 : 510 - +
  • [9] Remote sensing scene image classification model based on multi-scale features and attention mechanism
    Wang, Guowei
    Xu, Haixia
    Wang, Xinyu
    Yuan, Liming
    Wen, Xianbin
    JOURNAL OF APPLIED REMOTE SENSING, 2022, 16 (04)
  • [10] Object Detection of Remote Sensing Image Based on Multi-Scale Feature Fusion and Attention Mechanism
    Du, Zuoqiang
    Liang, Yuan
    IEEE ACCESS, 2024, 12 : 8619 - 8632