Evaluation of deep learning algorithm for crop identification based on GF-6 time series images

被引:0
|
作者
Chen S. [1 ]
Liu J. [1 ]
机构
[1] Institute of Agricultural Resource and Regional Planning, Chinese Academy of Agricultural Sciences, Beijing
来源
Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering | 2021年 / 37卷 / 15期
关键词
Crops; Deep learning; GF-6; Heilongjiang; Recognition; Remote sensing; Time series;
D O I
10.11975/j.issn.1002-6819.2021.15.020
中图分类号
学科分类号
摘要
Crop type mapping is one of the most important tools with medium and high spatial resolution satellite images in monitoring services of modern agriculture. Taking Heilongjiang Province of northeast China as a study area, this study aims to evaluate the state-of-the-art deep learning in crop type classification. A comparison was made on the Convolution Neural Network (CNN), Recurrent Neural Network (RNN), and Attention Mechanism (AM) for the application in crop type classification, while the traditional random forest (RF) model was also used as the control. Six models of deep learning were Temporal Convolutional Neural Network (TempCNN), Multi Scale 1D Residual Network (MSResNet), Omniscale Convolutional Neural Network (OmniscaleCNN), Long Short-Term Memory (LSTM), STAR Recurrent Neural Network (StarRNN), and Transformer. The specific procedure was as follows. First, GF-6 wide-field view image time series was acquired between April and November in the Lindian and Hailun study area, northeast of China, in order to extract the features of three types of crops at different growth stages. The resulting image time series used in the Lindian and the Hailun was composed of 41 and 48 GF-6 images, respectively. The preprocessing workflow included RPC correction, radiometric calibration, convert to top-of-atmospheric and surface reflectance using 6S atmospheric correction. The image interpolation and global min-max normalization were also applied to fill the empty pixel, further improving the convergence speed and stability of neural networks. The ground truth data was manually labelled using a field survey combined with GF-2 high-resolution image to generate datasets for train and evaluation. The datasets included six crops, such as rice, maize, soybean, water, urban and rest, covering 2 003 629 pixels in Lindian, 935 679 pixels in Hailun. Second, all models were trained and evaluated in Lindian, according to the differences between CNN, RNN, AM, and RF. All models achieved an overall accuracy of 93%-95%, and F1-score above 89%, 84%, and 97% for soybean, maize, and rice, respectively, where three major crops were from both study areas. Thirdly, the trained model in Lindian was transferred to that in Hailun, where the overall classification accuracy of each model declined between 7.2% to 41.0%, due to the differences of land cover classes and temporal composition of the data. Among CNNs, the accuracy of MSResNet barely changed to recognize three types of crops after transfer. Since OmniScaleCNN was automatically adjusted the size of the convolution filter, the accuracy of OmniScaleCNN after the transfer was better than that of TempCNN. A forget gate was utilized in the LSTM and StarRNN among RNNs, in order to avoid gradient explosion and disappearance in the classification, where the overall accuracy declined less than 10% after transfer. However, the accuracy of attention-based Transformer and RF dropped significantly. All models performed better on the distribution of three types of crops under the condition that the spatial location and temporal composition of data remain unchanged, in terms of visual analysis of classified images. Two CNN or RNN models were expected to accurately identify the general distribution of all land cover classes, under the varying spatial location and temporal composition. Furthermore, the run time of each deep learning was within 1 h, less than 6.2 times of random forest. Time consumption in the whole process was associated with the model training, as well as the image treatment for the Hailun study area covering an area of about 10 000 km². Correspondingly, deep learning presented a high-precision and large-scale crop mapping, in terms of classification accuracy and operating efficiency, particularly that the transfer learning performed better than before. © 2021, Editorial Department of the Transactions of the Chinese Society of Agricultural Engineering. All right reserved.
引用
收藏
页码:161 / 168
页数:7
相关论文
共 36 条
  • [11] Zhao Hongwei, Chen Zhongxin, Jiang Hao, Et al., Early growing stage crop species identification in southern China based on sentinel-1A time series imagery and one-dimensional CNN, Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 36, 3, pp. 169-177, (2020)
  • [12] Pelletier C, Webb G I, Petitjean F., Temporal convolutional neural network for the classification of satellite image time series, Remote Sensing, 11, 5, pp. 3154-3166, (2019)
  • [13] Heming L, Qi L., Hyperspectral imagery classification using sparse representations of convolutional neural network features, Remote Sensing, 8, 2, pp. 99-110, (2016)
  • [14] Ji S, Zhang Z, Zhang C, Et al., Learning discriminative spatiotemporal features for precise crop classification from multi-temporal satellite images, International Journal of Remote Sensing, 41, 8, pp. 3162-3174, (2020)
  • [15] Shunping J, Chi Z, Anjian X, Et al., 3D convolutional neural networks for crop classification with multi-temporal remote sensing images, Remote Sensing, 10, 2, pp. 75-89, (2018)
  • [16] Ismail H, Forestier G, Weber J, Et al., Deep learning for time series classification: A review, Data Mining and Knowledge Discovery, 24, 2, pp. 57-69, (2019)
  • [17] Zhong L, Hu L, Zhou H., Deep learning based multi-temporal crop classification, Remote Sensing of Environment, 221, 3, pp. 430-443, (2018)
  • [18] Russwurm M, Korner M., Temporal vegetation modelling using long short-term memory networks for crop identification from medium-resolution multi-spectral satellite images, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 11-19, (2017)
  • [19] Yang Zeyu, Zhang Hongyan, Ming Jin, Et al., Extraction of winter rapeseed from high-resolution remote sensing imagery via deep learning, Bulletin of Surveying and Mapping, 522, 9, pp. 113-116, (2020)
  • [20] Ienco D, Gaetano R, Dupaquier C, Et al., Land cover classification via multitemporal spatial data by deep recurrent neural networks, IEEE Geoscience and Remote Sensing Letters, 14, 10, pp. 1685-1689, (2017)