Identification and characterization of information-networks in long-tail data collections

被引:3
|
作者
Elag, Mostafa M. [1 ]
Kumar, Praveen [1 ]
Marini, Luigi [2 ]
Myers, James D. [3 ]
Hedstrom, Margaret [3 ]
Plale, Beth A. [4 ,5 ]
机构
[1] Univ Illinois, Dept Civil & Environm Engn, Ven Te Chow Hydrosyst Lab, Urbana, IL 61801 USA
[2] Univ Illinois, Natl Ctr Supercomp Applicat, Urbana, IL USA
[3] Univ Michigan, Sch Informat, Ann Arbor, MI 48109 USA
[4] Indiana Univ, Sch Informat, Bloomington, IN USA
[5] Indiana Univ, Data Insight Ctr, Bloomington, IN USA
基金
美国国家科学基金会;
关键词
Long-tail data; Information-networks; Linked-data; Cyberinfrastructure; Environmental data; Data-intensive science; LINK-PREDICTION; KNOWLEDGE; SCIENCE; MODELS;
D O I
10.1016/j.envsoft.2017.03.032
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Scientists' ability to synthesize and reuse long-tail scientific data lags far behind their ability to collect and produce these data. Many Earth Science Cyberinfrastructures enable sharing and publishing their data over the web using metadata standards. While profiling data attributes advances the Linked Data approach, it has become clear that building information-networks among distributed data silos is essential to increase their integration and reusability. In this research, we developed a Long-Tail Information-Network (LTIN) model, which uses a metadata-driven approach to build semantic information-networks among datasets published over the web and aggregate them around environmental events. The model identifies and characterizes the spatial and temporal contextual association links and dependencies among datasets. This paper presents the design and application of the LTIN model, and an evaluation of its performance. The model capabilities were demonstrated by inferring the information-network of a stream discharge located at the downstream end of the Illinois River. (c) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:100 / 111
页数:12
相关论文
共 45 条
  • [1] Boosting Long-Tail Data Classification with Sparse Prototypical Networks
    Figueroa, Alexei
    Papaioannou, Jens-Michalis
    Fallon, Conor
    Bekiaridou, Alexandra
    Bressem, Keno
    Zanos, Stavros
    Gers, Felix
    Nejdl, Wolfgang
    Loeser, Alexander
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT VII, ECML PKDD 2024, 2024, 14947 : 434 - 449
  • [2] Evolution of the "long-tail" concept for scientific data
    Stahlman, Gretchen R.
    Kouper, Inna
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2024,
  • [3] Integrating long-tail data: How far are we?
    Vanderbilt, Kristin
    Gries, Corinna
    ECOLOGICAL INFORMATICS, 2021, 64
  • [4] Long-tail buffer-content distributions in broadband networks
    Choudhury, GL
    Whitt, W
    PERFORMANCE EVALUATION, 1997, 30 (03) : 177 - 190
  • [5] Long-tail buffer-content distributions in broadband networks
    AT&T Lab, Holmdel, United States
    Perform Eval, 3 (177-190):
  • [6] NeuroBridge: a prototype platform for discovery of the long-tail neuroimaging data
    Wang, Lei
    Ambite, Jose Luis
    Appaji, Abhishek
    Bijsterbosch, Janine
    Dockes, Jerome
    Herrick, Rick
    Kogan, Alex
    Lander, Howard
    Marcus, Daniel
    Moore, Stephen M.
    Poline, Jean-Baptiste
    Rajasekar, Arcot
    Sahoo, Satya S.
    Turner, Matthew D.
    Wang, Xiaochen
    Wang, Yue
    Turner, Jessica A.
    FRONTIERS IN NEUROINFORMATICS, 2023, 17
  • [7] A long-tail inspired measure to assess resource use in information services
    Kostagiolas, Petros A.
    Korfiatis, Nikolaos
    Poulos, Marios
    LIBRARY & INFORMATION SCIENCE RESEARCH, 2012, 34 (04) : 317 - 323
  • [8] ADDRESSING DATA ACCESS NEEDS OF THE LONG-TAIL DISTRIBUTION OF GEOSCIENTISTS
    Malik, Tanu
    Foster, Ian
    2012 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2012, : 5348 - 5351
  • [9] FLRF: Federated recommendation optimization for long-tail data distribution
    Gong, Zaigang
    Chen, Siyu
    Dai, Qiangsheng
    Feng, Ying
    Zhang, Jinghui
    ARRAY, 2024, 24
  • [10] Cutting Long-Tail Latency of Routing Response in Software Defined Networks
    Xie, Junjie
    Guo, Deke
    Li, Xiaozhou
    Shen, Yulong
    Jiang, Xiaohong
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2018, 36 (03) : 384 - 396