Multimodal Data Fusion using Signal/Image Processing Methods for Multi-Class Machine Learning

被引:1
作者
Richards, Casey J. [1 ]
Valliani, Nawal [1 ]
Johnson, Benjamin A. [1 ]
Wong, Nelson Ka Ki [1 ]
Pennati, Angelo [1 ]
Saeed, Amir K. [1 ]
Rodriguez, Benjamin M. [1 ]
机构
[1] Johns Hopkins Univ, Whiting Sch Engn, 3400 N Charles St, Baltimore, MD 21218 USA
来源
SIGNAL PROCESSING, SENSOR/INFORMATION FUSION, AND TARGET RECOGNITION XXXII | 2023年 / 12547卷
关键词
statistical signal processing; linear discriminant analysis; feature generation; feature ranking; transforms; classification;
D O I
10.1117/12.2664987
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the world progresses further into the digital era, we see a growing utility for combining datasets gathered on different devices and receivers as well as on varying time ranges, for use in machine learning. However, machine learning classification introduces a requirement for standardized data, which in turn hampers the ability to utilize diverse sets of data at a given timestamp. In this paper, we investigate the application of various signal pre-processing techniques (Daubecheis wavelet, discrete cosine and discrete fourier transform among others) for multi-modal, multi-class machine learning. Following the pre-processing, the multi-faceted signals are represented solely by features generated from first order statistics, eigen decomposition, and linear discriminant. Utilizing these generated features, as opposed to the signals themselves, these diverse datasets may now be combined as input to machine learning methods. Furthermore, we apply Fisher's linear discriminant ratio and Random Forest feature importance metrics for feature ranking and feature space reduction followed by a comparison of the approaches. Our work demonstrates that dissimilar datasets with common classes may be combined using the proposed methods with a classification accuracy >= 95%. This paper demonstrates that the feature space may be reduced by approximately 60% with <= 5% loss in classification accuracy, and in some cases, a slight increase in classification accuracy.
引用
收藏
页数:17
相关论文
共 23 条
[11]   Analysis of a complex of statistical variables into principal components [J].
Hotelling, H .
JOURNAL OF EDUCATIONAL PSYCHOLOGY, 1933, 24 :417-441
[12]  
Jackson Z., 2018, Free spoken digit dataset (fsdd)
[13]   THE APPLICATION OF ELECTRONIC-COMPUTERS TO FACTOR-ANALYSIS [J].
KAISER, HF .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) :141-151
[14]   Multisensor data fusion: A review of the state-of-the-art [J].
Khaleghi, Bahador ;
Khamis, Alaa ;
Karray, Fakhreddine O. ;
Razavi, Saiedeh N. .
INFORMATION FUSION, 2013, 14 (01) :28-44
[15]   Decision trees: a recent overview [J].
Kotsiantis, S. B. .
ARTIFICIAL INTELLIGENCE REVIEW, 2013, 39 (04) :261-283
[16]  
Pedregosa F, 2011, J MACH LEARN RES, V12, P2825, DOI 10.1145/2786984.2786995
[17]  
Rao K.R., 1990, DISCRETE COSINE TRAN
[18]  
Rodriguez B, 2020, ALGORITHMS DATA SCI
[19]  
Rogers J, 2006, LECT NOTES COMPUT SC, V3940, P173
[20]  
Saki F, 2016, INT CONF ACOUST SPEE, P2204, DOI 10.1109/ICASSP.2016.7472068