Fast multidimensional scaling on big geospatial data using neural networks

被引:2
作者
Mademlis, Ioannis [1 ]
Voulgaris, Georgios [1 ]
Pitas, Ioannis [1 ]
机构
[1] Aristotle Univ Thessaloniki, Dept Informat, AUTH Campus, GR-54124 Thessaloniki, Greece
基金
欧盟地平线“2020”;
关键词
Multidimensional scaling; Approximate MDS; Incremental MDS; Big data; Multilayer perceptron; Geospatial mapping; DIMENSIONALITY;
D O I
10.1007/s12145-023-01004-9
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper presents a fast approximation method for Multidimensional Scaling (MDS)-based dimensionality reduction on large cartography datasets. Since MDS preserves data point distances, it is useful in application domains where geolocation data are critical. Typical relevant tasks include smartphone user behavioral pattern extraction, animal motion tracking over long distances, or distributed sensor data monitoring. The input to MDS is a data distance matrix employed for reducing data point dimensionality under distance constraints. Similar procedures are crucial for analyzing and revealing the original hidden data structure, as well as for data visualization, feature extraction, or compression. For N data points, MDS has a computational complexity that exceeds O(N-2), e.g., for several hundred thousands or millions of data points. The proposed method allows fast approximate MDS calculation on million-point datasets in less than a minute on a simple laptop, by sampling a small subset of the original dataset, performing regular MDS on it and training a neural regressor to learn the desired MDS mapping. Quantitative and qualitative empirical evaluation of the proposed fast MLP-MDS algorithm on a geospatial data mapping task, i.e., on reducing 3D Earth surface points (longitude, latitude, radius) to 2D maps, has resulted in promising findings and small approximation errors. The benefits are even greater in incremental settings, where new data points are obtained and projected over time. Unlike regular MDS or competing approximations, this is trivially supported in MLP-MDS due to the latter's model-based nature.
引用
收藏
页码:2241 / 2249
页数:9
相关论文
共 26 条
  • [1] [Anonymous], 2004, Sparse multidimensional scaling using land-mark points
  • [2] Borg Ingwer, 2005, Springer Series in Statistics, DOI DOI 10.18637/JSS.V014.B04
  • [3] Brummelen G., 2012, Heavenly Mathematics: The Forgotten Art of Spherical Trigonometry, V1st
  • [4] From local scenarios to national maps: a participatory framework for envisioning the future of Tanzania
    Capitani, Claudia
    Mukama, Kusaga
    Mbilinyi, Boniface
    Malugu, Isaac O.
    Munishi, Pantaleo K. T.
    Burgess, Neil D.
    Platts, Philip J.
    Sallu, Susannah M.
    Marchant, Robert
    [J]. ECOLOGY AND SOCIETY, 2016, 21 (03):
  • [5] Deep learning multidimensional projections
    Espadoto, Mateus
    Tomita Hirata, Nina Sumiko
    Telea, Alexandru C.
    [J]. INFORMATION VISUALIZATION, 2020, 19 (03) : 247 - 269
  • [6] Reducing the dimensionality of data with neural networks
    Hinton, G. E.
    Salakhutdinov, R. R.
    [J]. SCIENCE, 2006, 313 (5786) : 504 - 507
  • [7] Local Affine Multidimensional Projection
    Joia, Paulo
    Paulovich, Fernando V.
    Coimbra, Danilo
    Cuminato, Jose Alberto
    Nonato, Luis Gustavo
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2011, 17 (12) : 2563 - 2571
  • [8] Jong Youl Choi, 2010, Proceedings 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing (CCGrid), P331, DOI 10.1109/CCGRID.2010.104
  • [9] Kakaletsis E., 2021, SIG PROCESS IMAGE CO, V99, P116
  • [10] Kakaletsis E, 2021, P EUR SIGN PROC C EU