Effectively using unsupervised machine learning in next generation astronomical surveys

被引:10
|
作者
Reis, I. [1 ]
Rotman, M. [2 ]
Poznanski, D. [1 ]
Prochaska, J. X. [3 ]
Wolf, L. [2 ,4 ]
机构
[1] Tel Aviv Univ, Sch Phys & Astron, IL-69978 Tel Aviv, Israel
[2] Tel Aviv Univ, Sch Comp Sci, IL-69978 Tel Aviv, Israel
[3] Univ Calif Santa Cruz, CO Lick Observ, 156 High St, Santa Cruz, CA 95064 USA
[4] Facebook AI Res, Tel Aviv, Israel
关键词
D O I
10.1016/j.ascom.2020.100437
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
In recent years many works have shown that unsupervised Machine Learning (ML) can help detect unusual objects and uncover trends in large astronomical datasets, but a few challenges remain. We show here, for example, that different methods, or even small variations of the same method, can produce significantly different outcomes. While intuitively somewhat surprising, this can naturally occur when applying unsupervised ML to highly dimensional data, where there can be many reasonable yet different answers to the same question. In such a case the outcome of any single unsupervised ML method should be considered a sample from a conceivably wide range of possibilities. We therefore suggest an approach that eschews finding an optimal outcome, instead facilitating the production and examination of many valid ones. This can be achieved by incorporating unsupervised ML into data visualization portals. We present here such a portal that we are developing, applied to the sample of SDSS spectra of galaxies. The main feature of the portal is interactive 2D maps of the data. Different maps are constructed by applying dimensionality reduction to different subspaces of the data, so that each map contains different information that in turn gives a different perspective on the data. The interactive maps are intuitive to use, and we demonstrate how peculiar objects and trends can be detected by means of a few button clicks. We believe that including tools in this spirit in next generation astronomical surveys will be important for making unexpected discoveries, either by professional astronomers or by citizen scientists, and will generally enable the benefits of visual inspection even when dealing with very complex and extensive datasets. Our portal is available online at galaxyportal.space. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Galaxy morphological classification in deep-wide surveys via unsupervised machine learning
    Martin, G.
    Kaviraj, S.
    Hocking, A.
    Read, S. C.
    Geach, J. E.
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2020, 491 (01) : 1408 - 1426
  • [32] MEMS for the next generation of giant astronomical telescopes
    Gavel, Donald
    MEMS/MOEMS COMPONENTS AND THEIR APPLICATIONS III, 2006, 6113
  • [33] Unsupervised and semi-supervised learning: the next frontier in machine learning for plant systems biology
    Yan, Jun
    Wang, Xiangfeng
    PLANT JOURNAL, 2022, 111 (06): : 1527 - 1538
  • [34] Cheetah: A fast unsupervised learning technique to provision next generation network services
    Lahlou, Laaziz
    Kara, Nadjia
    Arouch, Mohssine
    Edstrom, Claes
    PROCEEDINGS OF THE 2020 6TH INTERNATIONAL WORKSHOP ON CONTAINER TECHNOLOGIES AND CONTAINER CLOUDS (WOC '20), 2020, : 19 - 24
  • [35] IoT Device Identification Using Unsupervised Machine Learning
    Koball, Carson
    Rimal, Bhaskar P.
    Wang, Yong
    Salmen, Tyler
    Ford, Connor
    INFORMATION, 2023, 14 (06)
  • [36] Keratoconus severity identification using unsupervised machine learning
    Yousefi, Siamak
    Yousefi, Ebrahim
    Takahashi, Hidenori
    Hayashi, Takahiko
    Tampo, Hironobu
    Inoda, Satoru
    Arai, Yusuke
    Asbell, Penny
    PLOS ONE, 2018, 13 (11):
  • [37] Ranking online retailers using unsupervised machine learning
    Sharma, Himanshu
    Anubha, Anubha
    OPSEARCH, 2024,
  • [38] Classifying the clouds of Venus using unsupervised machine learning
    Mittendorf, J.
    Molaverdikhani, K.
    Ercolano, B.
    Giovagnoli, A.
    Grassi, T.
    ASTRONOMY AND COMPUTING, 2024, 49
  • [39] Clustering Seismocardiographic Events using Unsupervised Machine Learning
    Gamage, Peshala T.
    Azad, Md Khurshidul.
    Taebi, Amirtaha
    Sandler, Richard H.
    Mansy, Hansen A.
    2018 IEEE SIGNAL PROCESSING IN MEDICINE AND BIOLOGY SYMPOSIUM (SPMB), 2018,
  • [40] Issues for the next generation of galaxy surveys
    Peebles, PJE
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 1999, 357 (1750): : 21 - 34