Few-Shot Aerial Image Semantic Segmentation Leveraging Pyramid Correlation Fusion

被引:9
作者
Ao, Wei [1 ]
Zheng, Shunyi [1 ]
Meng, Yan [2 ]
Gao, Zhi [1 ]
机构
[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430079, Peoples R China
[2] Hubei Univ, Sch Artificial Intelligence, Wuhan 430062, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2023年 / 61卷
关键词
Distance correlation; few-shot semantic segmentation (FSS); meta-learning; remote-sensing image processing; semantic correspondence; DEEP; NETWORK; CLASSIFICATION; AGGREGATION;
D O I
10.1109/TGRS.2023.3328339
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Few-shot semantic segmentation (FSS) has gained significant attention due to its ability to segment novel objects using only a limited number of labeled samples, thereby addressing the problem of overfitting caused by a lack of training data. Although this technique is widely studied in the field of computer vision, there are few methods for remote-sensing images. Prevalent FSS methods can achieve remarkable results for natural images, but they are difficult to apply to remote-sensing image processing because existing methods rarely take into consideration the large-scale and resolution differences in remote-sensing images. Consequently, it is hard for them to obtain correct semantic guidance from a few annotated remote-sensing images. To tackle these problems, this article proposes the pyramid correlation fusion network (PCFNet) to promote the ability to mine helpful information by calculating multiscale pixel-wise semantic correspondence. Particularly, the dual-distance correlation (DDC) module is designed to simultaneously compute the cosine similarity and Euclidean distance between query features and support features, producing adequate guidance information to determine the category of each pixel. Moreover, to improve segmentation accuracy for small objects, the scale-aware cross-entropy loss (SACELoss) is introduced to dynamically assign loss weights according to the actual sizes of objects. This enables smaller objects to be assigned larger weight values and thus receive more attention during training. Comprehensive experiments on both the iSAID- 5(i) and DLRSD- 5(i) datasets demonstrate that our method outperforms state-of-the-art FSS methods. Our code is available at https://github.com/TinyAway/PCFNet.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 62 条
[11]   Few-Shot SAR Target Classification via Metalearning [J].
Fu, Kun ;
Zhang, Tengfei ;
Zhang, Yue ;
Wang, Zhirui ;
Sun, Xian .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[12]   Small sample classification of hyperspectral image using model-agnostic meta-learning algorithm and convolutional neural network [J].
Gao, Kuiliang ;
Liu, Bing ;
Yu, Xuchu ;
Zhang, Pengqiang ;
Tan, Xiong ;
Sun, Yifan .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2021, 42 (08) :3090-3122
[13]   Deep Relation Network for Hyperspectral Image Few-Shot Classification [J].
Gao, Kuiliang ;
Liu, Bing ;
Yu, Xuchu ;
Qin, Jinchun ;
Zhang, Pengqiang ;
Tan, Xiong .
REMOTE SENSING, 2020, 12 (06)
[14]   A coarse-to-fine boundary refinement network for building footprint extraction from remote sensing imagery [J].
Guo, Haonan ;
Du, Bo ;
Zhang, Liangpei ;
Su, Xin .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2022, 183 :240-252
[15]   Generating annual high resolution land cover products for 28 metropolises in China based on a deep super-resolution mapping network using Landsat imagery [J].
He, Da ;
Shi, Qian ;
Liu, Xiaoping ;
Zhong, Yanfei ;
Xia, Guisong ;
Zhang, Liangpei .
GISCIENCE & REMOTE SENSING, 2022, 59 (01) :2036-2067
[16]   Generating 2m fine-scale urban tree cover product over 34 metropolises in China based on deep context-aware sub-pixel mapping network [J].
He, Da ;
Shi, Qian ;
Liu, Xiaoping ;
Zhong, Yanfei ;
Zhang, Liangpei .
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 106
[17]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[18]   Cost Aggregation with 4D Convolutional Swin Transformer for Few-Shot Segmentation [J].
Hong, Sunghwan ;
Cho, Seokju ;
Nam, Jisu ;
Lin, Stephen ;
Kim, Seungryong .
COMPUTER VISION, ECCV 2022, PT XXIX, 2022, 13689 :108-126
[19]  
Iqbal Ehtesham, 2022, arXiv, DOI [10.48550/arXiv.2206.09667, DOI 10.48550/ARXIV.2206.09667]
[20]   Few-Shot Scene Classification of Optical Remote Sensing Images Leveraging Calibrated Pretext Tasks [J].
Ji, Hong ;
Gao, Zhi ;
Zhang, Yongjun ;
Wan, Yu ;
Li, Can ;
Mei, Tiancan .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60