Clustering-Based Predictive Analytics to Improve Scientific Data Discovery

被引:0
|
作者
Devarakonda, Ranjeet [1 ]
Kumar, Jitendra [1 ]
Prakash, Giri [1 ]
机构
[1] Oak Ridge Natl Lab, Environm Sci Div, Oak Ridge, TN 37830 USA
来源
2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2020年
关键词
clustering; content-based filtering; collaborative filtering; data recommended system; data discovery;
D O I
10.1109/BigData50022.2020.9377797
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given the sheer volume of scientific data archived within the data-intensive projects at the US Department of Energy's Oak Ridge National Laboratory, finding precisely what data we are looking for may not be a trivial task; conversely, we may also miss a more prominent data product. To address such issues, we propose improving the data discovery system and using data analytics methods to comprehend what specific users might be interested in based on their physiological state, search patterns, and past data usage history. This work's primary goal is to prune the complexity, increase the visibility of popular data products, and direct users toward the data that best meet their needs. The proposed algorithm constructs a user profile based on the user's explicit or implicit interactions with the system, such as items they are currently looking at on-site and the key metadata mappings related to the data set. The pattern is then used to build a training data set, which will help find relevant data to recommend to the user.
引用
收藏
页码:5658 / 5661
页数:4
相关论文
共 50 条
  • [1] Clustering-Based Predictive Process Monitoring
    Di Francescomarino, Chiara
    Dumas, Marlon
    Maggi, Fabrizio Maria
    Teinemaa, Irene
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2019, 12 (06) : 896 - 909
  • [2] Optimized clustering-based discovery framework on Internet of Things
    Monika Bharti
    Himanshu Jindal
    The Journal of Supercomputing, 2021, 77 : 1739 - 1778
  • [3] Clustering-based resource discovery on Internet-of-Things
    Bharti, M.
    Kumar, R.
    Saxena, S.
    INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2018, 31 (05)
  • [4] Clustering-based Pattern Discovery in Lung Cancer Treatments
    Gomez-Bravo, Daniel
    Garcia, Aaron
    Viguems, Guillermo
    Rios-Sanchez, Belem
    Perez-Garcia, Alejandra
    Ospina, Vanessa
    Torrente, Maria
    Menasalvas, Ernestina
    Provencio, Mariano
    Rodriguez-Gonzalez, Alejandro
    2023 IEEE 36TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS, 2023, : 694 - 699
  • [5] A clustering-based matrix factorization method to improve the accuracy of recommendation systems
    Shajarian, Zahra
    Seyedi, Seyed Amjad
    Moradi, Parham
    2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, : 2241 - 2246
  • [6] Optimized clustering-based discovery framework on Internet of Things
    Bharti, Monika
    Jindal, Himanshu
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (02) : 1739 - 1778
  • [7] Clustering-based urbanisation to improve enterprise information systems agility
    Imache, Rabah
    Izza, Said
    Ahmed-Nacer, Mohamed
    ENTERPRISE INFORMATION SYSTEMS, 2015, 9 (08) : 861 - 877
  • [8] Clustering-based visualizations for diagnosing diseases on metagenomic data
    Nguyen, Hai Thanh
    Phan, Trang Huyen
    Pham, Linh Thuy Thi
    Pham, Ngoc Huynh
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (8-9) : 5685 - 5699
  • [9] Fast clustering-based anonymization algorithm for data streams
    Guo, Kun
    Zhang, Qi-Shan
    Ruan Jian Xue Bao/Journal of Software, 2013, 24 (08): : 1852 - 1867
  • [10] Clustering-based incremental learning for imbalanced data classification
    Liu, Yuxin
    Du, Guangyu
    Yin, Chenke
    Zhang, Haichao
    Wang, Jia
    KNOWLEDGE-BASED SYSTEMS, 2024, 292