Radon Cumulative Distribution Transform Subspace Modeling for Image Classification

被引:15
作者
Shifat-E-Rabbi, Mohammad [1 ]
Yin, Xuwang [2 ]
Rubaiyat, Abu Hasnat Mohammad [2 ]
Li, Shiying [1 ]
Kolouri, Soheil [3 ]
Aldroubi, Akram [4 ]
Nichols, Jonathan M. [5 ]
Rohde, Gustavo K. [1 ,2 ]
机构
[1] Univ Virginia, Dept Biomed Engn, Charlottesville, VA 22908 USA
[2] Univ Virginia, Dept Elect & Comp Engn, Charlottesville, VA 22904 USA
[3] Vanderbilt Univ, Dept Comp Sci, Nashville, TN 37212 USA
[4] Vanderbilt Univ, Dept Math, Nashville, TN 37212 USA
[5] US Naval Res Lab, Washington, DC 20375 USA
关键词
R-CDT; Nearest subspace; Image classification; Generative model; CONVOLUTIONAL NEURAL-NETWORKS;
D O I
10.1007/s10851-021-01052-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a new supervised image classification method applicable to a broad class of image deformation models. The method makes use of the previously described Radon Cumulative Distribution Transform (R-CDT) for image data, whose mathematical properties are exploited to express the image data in a form that is more suitable for machine learning. While certain operations such as translation, scaling, and higher-order transformations are challenging to model in native image space, we show the R-CDT can capture some of these variations and thus render the associated image classification problems easier to solve. The method-utilizing a nearest-subspace algorithm in the R-CDT space-is simple to implement, non-iterative, has no hyper-parameters to tune, is computationally efficient, label efficient, and provides competitive accuracies to state-of-the-art neural networks for many types of classification problems. In addition to the test accuracy performances, we show improvements (with respect to neural network-based methods) in terms of computational efficiency (it can be implemented without the use of GPUs), number of training samples needed for training, as well as out-of-distribution generalization. The Python code for reproducing our results is available at Shifat-E-Rabbi et al. (Python code implementing the Radon cumulative distribution transform subspace model for image classification. https://github.com/rohdelab/rcdt_ns_classifier).
引用
收藏
页码:1185 / 1203
页数:19
相关论文
共 51 条
[1]  
[Anonymous], KAGGLE SIGN LANGUAGE
[2]   Classification of Hyperspectral Images With Regularized Linear Discriminant Analysis [J].
Bandos, Tatyana V. ;
Bruzzone, Lorenzo ;
Camps-Valls, Gustavo .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2009, 47 (03) :862-873
[3]   Detecting and visualizing cell phenotype differences from microscopy images using transportbased morphometry [J].
Basu, Saurav ;
Kolouri, Soheil ;
Rohde, Gustavo K. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2014, 111 (09) :3448-3453
[4]  
Bosch A, 2007, IEEE I CONF COMP VIS, P1863
[5]  
Bracewell, 1986, FOURIER TRANSFORM IT
[6]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[7]   Random Forest and Rotation Forest for fully polarized SAR image classification using polarimetric and spatial features [J].
Du, Peijun ;
Samat, Alim ;
Waske, Bjoern ;
Liu, Sicong ;
Li, Zhenhong .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2015, 105 :38-53
[8]  
Fitzgerald CE., 2018, MED IMAGING IMAGE PR, V10574, P105742
[9]   Artificial neural networks (the multilayer perceptron) - A review of applications in the atmospheric sciences [J].
Gardner, MW ;
Dorling, SR .
ATMOSPHERIC ENVIRONMENT, 1998, 32 (14-15) :2627-2636
[10]  
Hadid A, 2007, 2007 FIRST ACM/IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED SMART CAMERAS, P96