Semi-supervised rotation forest based on ensemble margin theory for the classification of hyperspectral image with limited training data

被引:33
作者
Feng, Wei [1 ]
Quan, Yinghui [1 ]
Dauphin, Gabriel [2 ]
Li, Qiang [3 ]
Gao, Lianru [4 ]
Huang, Wenjiang [4 ]
Xia, Junshi [5 ]
Zhu, Wentao [1 ]
Xing, Mengdao [6 ]
机构
[1] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
[2] Univ Paris XIII, Inst Galilee, Lab Informat Proc & Transmiss, Sorbonne Paris Cite,L2TI, Paris, France
[3] Northwestern Polytech Univ, Sch Phys Sci & Technol, Xian 710129, Peoples R China
[4] Chinese Acad Sci, Aerosp Informat Res Inst, Key Lab Digital Earth Sci, Beijing 100094, Peoples R China
[5] RIKEN, Geoinformat Unit, Ctr Adv Intelligence Project, Tokyo 1030027, Japan
[6] Xidian Univ, Acad Adv Interdisciplinary Res, Xian 710071, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Rotation forest; Classification; Hyperspectral image; Semi-supervised learning; Ensemble margin; SEMISUPERVISED CLASSIFICATION;
D O I
10.1016/j.ins.2021.06.059
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, an adaptive semi-supervised rotation forest (SSRoF) algorithm is proposed for the classification of hyperspectral images with limited training data. Our proposition is based on Rotation Forest (RoF), a classifying technique that has proved to be remarkably accurate in the context of high-dimensional data. It is adapted to the semi-supervised context, by increasing the number of training instances in the learning stage, with high-quality unlabeled samples mined using ensemble margin. SMOTE is adopted to overcome the class imbalance problem. Out-Of-Bag (OOB) instances are used in a second phase to figure out the optimal number of samples to be added to the training set. Five ensemble methods and five semi-supervised methods are employed as comparisons. The results on three real hyperspectral remote sensing datasets demonstrate the effectiveness of the proposed method. (c) 2021 Published by Elsevier Inc.
引用
收藏
页码:611 / 638
页数:28
相关论文
共 46 条
  • [1] Alshawabkeh M., 2013, Hypothesis margin based weighting for feature selection using boosting: theory, algorithms and applications"
  • [2] Semi-supervised learning with connectivity-driven convolutional neural networks
    Amorim, Willian Paraguassu
    Rosa, Gustavo Henrique
    Thomazella, Rogerio
    Cogo Castanho, Jose Eduardo
    Lofrano Dotto, Fabio Romano
    Rodrigues Junior, Oswaldo Pons
    Marana, Aparecido Nilceu
    Papa, Joao Paulo
    [J]. PATTERN RECOGNITION LETTERS, 2019, 128 : 16 - 22
  • [3] Batista GE., 2004, ACM SIGKDD EXPL NEWS, V6, P20, DOI [DOI 10.1145/1007730.1007735, 10.1145/1007730.1007735]
  • [4] Dynamic Random Forests
    Bernard, Simon
    Adam, Sebastien
    Heutte, Laurent
    [J]. PATTERN RECOGNITION LETTERS, 2012, 33 (12) : 1580 - 1586
  • [5] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [6] Breiman L, 1996, MACH LEARN, V24, P123, DOI 10.1007/BF00058655
  • [7] Breiman L., 1984, STAT PROBABILITY SER, DOI 10.1201/9781315139470
  • [8] Semisupervised classification of hyperspectral images by SVMs optimized in the primal
    Chi, Mingmin
    Bruzzone, Lorenzo
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2007, 45 (06): : 1870 - 1880
  • [9] Handling data irregularities in classification: Foundations, trends, and future challenges
    Das, Swagatam
    Datta, Shounak
    Chaudhuri, Bidyut B.
    [J]. PATTERN RECOGNITION, 2018, 81 : 674 - 693
  • [10] Boosting with Lexicographic Programming: Addressing Class Imbalance without Cost Tuning
    Datta, Shounak
    Nag, Sayak
    Das, Swagatam
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (05) : 883 - 897