Boosting Climate Analysis With Semantically Uplifted Knowledge Graphs

被引:11
作者
Wu, Jiantao [1 ]
Orlandi, Fabrizio [2 ]
O'Sullivan, Declan [2 ]
Pisoni, Enrico [3 ]
Dev, Soumyabrata [1 ]
机构
[1] Univ Coll Dublin, ADAPT SFI Res Ctr, Sch Comp Sci, Dublin 4, Ireland
[2] Trinity Coll Dublin, ADAPT SFI Res Ctr, Sch Comp Sci & Stat, Dublin 2, Ireland
[3] European Commiss Joint Res Ctr, I-21027 Ispra, Italy
基金
欧盟地平线“2020”; 爱尔兰科学基金会;
关键词
Meteorology; Machine learning; Resource description framework; Ontologies; Task analysis; Semantics; Linked data; Climate data; knowledge graphs (KGs); linked data; machine learning; semantic webs; CLASSIFICATION; ENHANCEMENT;
D O I
10.1109/JSTARS.2022.3177463
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Nowadays, the fast expansion of heterogeneous climate data resources accessible on the Internet has led to substantial data fragmentation on the web. For example, station-based sensor data from different sources are likely to be interrelated but may be stored in disparate formats, such as CSV, JSON, and XML. To address the data isolation problem, several semantically uplifted knowledge graphs are proposed for climate data exchange. While these knowledge graphs improve data interoperability, the advancement in multisource data interchange is limited to data included inside knowledge graphs. As a result, the exclusive interoperability of current climatic knowledge graphs hampers the flow of data into typical climate analysis workflows in contexts, where analytical models often need data in nonknowledge graph formats. This article addresses this issue by focusing on enhancing climate analysis workflows within the context of the Python machine learning environment, with a preference for tabular data. We propose an analysis workflow able to automatically integrate remote climate knowledge graph data with local tabular data so as to enhance the data usability with respect to certain climate analysis tasks. To underscore the importance of our study, we illustrate how the workflow streamlines the access to multisource climatic variables in the Python environment from a semantic perspective. The additional knowledge graph data have the potential to augment local datasets in the climate domain, as evidenced by an improvement in accuracy of up to 10% for machine learning geared on rainfall detection.
引用
收藏
页码:4708 / 4718
页数:11
相关论文
共 24 条
  • [1] [Anonymous], 2018, P 2018 C N AM CHAPTE
  • [2] Knowledge Graph Semantic Enhancement of Input Data for Improving AI
    Bhatt, Shreyansh
    Sheth, Amit
    Shalin, Valerie
    Zhao, Jinjin
    [J]. IEEE INTERNET COMPUTING, 2020, 24 (02) : 66 - 72
  • [3] Linked Data - The Story So Far
    Bizer, Christian
    Heath, Tom
    Berners-Lee, Tim
    [J]. INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2009, 5 (03) : 1 - 22
  • [4] scikit-learn Pipelines Meet Knowledge Graphs The Python']Python kgextension Package
    Bucher, Tabea-Clara
    Jiang, Xuehui
    Meyer, Ole
    Waitz, Stephan
    Hertling, Sven
    Paulheim, Heiko
    [J]. SEMANTIC WEB: ESWC 2021 SATELLITE EVENTS, 2021, 12739 : 9 - 14
  • [5] Federating queries in SPARQL 1.1: Syntax, semantics and evaluation
    Buil-Aranda, Carlos
    Arenas, Marcelo
    Corcho, Oscar
    Polleres, Axel
    [J]. JOURNAL OF WEB SEMANTICS, 2013, 18 (01): : 1 - 17
  • [6] Remote Sensing Image Scene Classification Meets Deep Learning: Challenges, Methods, Benchmarks, and Opportunities
    Cheng, Gong
    Xie, Xingxing
    Han, Junwei
    Guo, Lei
    Xia, Gui-Song
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 : 3735 - 3756
  • [7] Evaluating the quality of the LOD cloud: An empirical investigation
    Debattista, Jeremy
    Lange, Christoph
    Auer, Soeren
    Cortis, Dominic
    [J]. SEMANTIC WEB, 2018, 9 (06) : 859 - 901
  • [8] Harnessing Deep Neural Networks with Logic Rules
    Hu, Zhiting
    Ma, Xuezhe
    Liu, Zhengzhong
    Hovy, Eduard
    Xing, Eric P.
    [J]. PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 2410 - 2420
  • [9] Jiantao Wu, 2021, 2021 12th International Conference on Computing Communication and Networking Technologies (ICCCNT), P104, DOI 10.23919/USNC-URSI51813.2021.9703648
  • [10] Jiantao Wu, 2021, 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS, P5739, DOI 10.1109/IGARSS47720.2021.9553547