Multimodal Data Fusion using Signal/Image Processing Methods for Multi-Class Machine Learning

被引：1

作者：

Richards, Casey J. ^{[1
]}

Valliani, Nawal ^{[1
]}

Johnson, Benjamin A. ^{[1
]}

Wong, Nelson Ka Ki ^{[1
]}

Pennati, Angelo ^{[1
]}

Saeed, Amir K. ^{[1
]}

Rodriguez, Benjamin M. ^{[1
]}

机构：

[1] Johns Hopkins Univ, Whiting Sch Engn, 3400 N Charles St, Baltimore, MD 21218 USA

来源：

SIGNAL PROCESSING, SENSOR/INFORMATION FUSION, AND TARGET RECOGNITION XXXII | 2023年 / 12547卷

关键词：

statistical signal processing; linear discriminant analysis; feature generation; feature ranking; transforms; classification;

D O I：

10.1117/12.2664987

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

As the world progresses further into the digital era, we see a growing utility for combining datasets gathered on different devices and receivers as well as on varying time ranges, for use in machine learning. However, machine learning classification introduces a requirement for standardized data, which in turn hampers the ability to utilize diverse sets of data at a given timestamp. In this paper, we investigate the application of various signal pre-processing techniques (Daubecheis wavelet, discrete cosine and discrete fourier transform among others) for multi-modal, multi-class machine learning. Following the pre-processing, the multi-faceted signals are represented solely by features generated from first order statistics, eigen decomposition, and linear discriminant. Utilizing these generated features, as opposed to the signals themselves, these diverse datasets may now be combined as input to machine learning methods. Furthermore, we apply Fisher's linear discriminant ratio and Random Forest feature importance metrics for feature ranking and feature space reduction followed by a comparison of the approaches. Our work demonstrates that dissimilar datasets with common classes may be combined using the proposed methods with a classification accuracy >= 95%. This paper demonstrates that the feature space may be reduced by approximately 60% with <= 5% loss in classification accuracy, and in some cases, a slight increase in classification accuracy.

引用

页数：17

共 23 条

[1] DISCRETE COSINE TRANSFORM [J].

AHMED, N ;

NATARAJAN, T ;

RAO, KR .

IEEE TRANSACTIONS ON COMPUTERS, 1974, C 23 (01) :90-93

[2]

Ajayi GO, 2019, 2019 SOUTHERN AFRICAN UNIVERSITIES POWER ENGINEERING CONFERENCE/ROBOTICS AND MECHATRONICS/PATTERN RECOGNITION ASSOCIATION OF SOUTH AFRICA (SAUPEC/ROBMECH/PRASA), P135, DOI [10.1109/robomech.2019.8704783, 10.1109/RoboMech.2019.8704783]

[3]

[Anonymous], 2018, Free spoken digit dataset (fsdd)

[4]

AstroDave W. C., 2012, Digit recognizer

[5] Multimodal Machine Learning: A Survey and Taxonomy [J].

Baltrusaitis, Tadas ;

Ahuja, Chaitanya ;

Morency, Louis-Philippe .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (02) :423-443

[6]

Baron Fourier J. B. J., 1878, The analytical theory of heat

[7]

Bracewell R., 1965, The Fourier Transform and its Applications

[8] Random forests [J].

Breiman, L .

MACHINE LEARNING, 2001, 45 (01) :5-32

[9]

Edla Damodar Reddy, 2018, Procedia Computer Science, V132, P1523, DOI 10.1016/j.procs.2018.05.116

[10]

Farid H, 2002, 2002 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS, P905

← 1 2 3 →