Dimension Reduction of Multidimensional Structured and Unstructured Datasets through Ensemble Learning of Neural Embeddings

被引:1
作者
Alvarado-Perez, Juan Carlos [1 ,2 ]
Garcia, Miguel Angel [3 ]
Puig, Domenec [4 ]
机构
[1] Univ Rovira i Virgili, Dept Engn Informat & Matemat, Carrera 20A 14-54 Ctr, Pasto 52001, Colombia
[2] CESMAG Univ, Dept Engn, Carrera 20A 14-Ctr 54, Pasto 52001, Colombia
[3] Autonomous Univ Madrid, Dept Elect & Commun Technol, Francisco Tomas & Valiente 11, Madrid 28049, Spain
[4] Univ Rovira i Virgili, Dept Engn Informat & Matemat, Paisos Catalans 26, E-43007 Tarragona, Spain
关键词
cluster inductions; dimensionality reductions; ensemble learning; manifold approximations; online processing; topological preservations; unsupervised deep networks; INTELLIGENCE; GRAPH;
D O I
10.1002/aisy.202400178
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Dimension reduction aims to project a high-dimensional dataset into a low-dimensional space. It tries to preserve the topological relationships among the original data points and/or induce clusters. NetDRm, an online dimensionality reduction method based on neural ensemble learning that integrates different dimension reduction methods in a synergistic way, is introduced. NetDRm is designed for datasets of multidimensional points that can be either structured (e.g., images) or unstructured (e.g., point clouds, tabular data). It starts by training a collection of deep residual encoders that learn the embeddings induced by multiple dimension reduction methods applied to the input dataset. Subsequently, a dense neural network integrates the generated encoders by emphasizing topological preservation or cluster induction. Experiments conducted on widely used multidimensional datasets (point-cloud manifolds, image datasets, tabular record datasets) show that the proposed method yields better results in terms of topological preservation (RNX$R_{\text{NX}}$ curves), cluster induction (V measure), and classification accuracy than the most relevant dimension reduction methods. Dimension reduction maps high-dimensional data into low-dimensional space, preserving topological relationships and inducing clusters. NetDRm is an online method based on neural ensemble learning that integrates various reduction methods. It uses deep encoders and a dense neural network to improve topological preservation, cluster induction, and classification accuracy, surpassing relevant methods.image (c) 2024 WILEY-VCH GmbH
引用
收藏
页数:24
相关论文
共 55 条
[1]   Dimensionality Reduction, Modelling, and Optimization of Multivariate Problems Based on Machine Learning [J].
Alswaitti, Mohammed ;
Siddique, Kamran ;
Jiang, Shulei ;
Alomoush, Waleed ;
Alrosan, Ayat .
SYMMETRY-BASEL, 2022, 14 (07)
[2]  
Amid E., PREPRINT
[3]  
[Anonymous], PREPRINT
[4]   Laplacian eigenmaps for dimensionality reduction and data representation [J].
Belkin, M ;
Niyogi, P .
NEURAL COMPUTATION, 2003, 15 (06) :1373-1396
[5]   SLISEMAP: supervised dimensionality reduction through local explanations [J].
Bjorklund, Anton ;
Makela, Jarmo ;
Puolamaki, Kai .
MACHINE LEARNING, 2023, 112 (01) :1-43
[6]  
Borg I., 2005, Modern multidimensional scaling: Theory and applications, DOI DOI 10.1007/0-387-28981-X
[7]   VECTOR VALUED REPRODUCING KERNEL HILBERT SPACES OF INTEGRABLE FUNCTIONS AND MERCER THEOREM [J].
Carmeli, Claudio ;
De Vito, Ernesto ;
Toigo, Alessandro .
ANALYSIS AND APPLICATIONS, 2006, 4 (04) :377-408
[8]   Performance evaluation of dimensionality reduction techniques on hyperspectral data for mineral exploration [J].
Deepa, C. ;
Shetty, Amba ;
Narasimhadhan, A., V .
EARTH SCIENCE INFORMATICS, 2023, 16 (01) :25-36
[9]   Classification and interaction in random forests [J].
Denisko, Danielle ;
Hoffman, Michael M. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2018, 115 (08) :1690-1692
[10]   Artificial Intelligence and Business Value: a Literature Review [J].
Enholm, Ida Merete ;
Papagiannidis, Emmanouil ;
Mikalef, Patrick ;
Krogstie, John .
INFORMATION SYSTEMS FRONTIERS, 2022, 24 (05) :1709-1734