Change-Aware Scheduling for Effectively Updating Linked Open Data Caches

被引:3
作者
Akhtar, Usman [1 ]
Razzaq, Muhammad Asif [1 ]
Rehman, Ubaid Ur [1 ]
Amin, Muhammad Bilal [1 ]
Khan, Wajahat Ali [1 ]
Huh, Eui-Nam [1 ]
Lee, Sungyoung [1 ]
机构
[1] Kyung Hee Univ, Dept Comp Sci & Engn, Yongin 446701, South Korea
来源
IEEE ACCESS | 2018年 / 6卷
基金
新加坡国家研究基金会;
关键词
Linked open data; change propagation; evolving web data; RDF crawling; cache storage; SEMANTIC WEB; MODEL;
D O I
10.1109/ACCESS.2018.2871511
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The linked open data (LOD) cloud is a global information space with a wealth of structured facts, which are useful for a wide range of usage scenarios. The LOD cloud handles a large number of requests from applications consuming the data. However, the performance of retrieving data from LOD repositories is one of the major challenge. Overcome with this challenge, we argue that it is advantageous to maintain a local cache for efficient querying and processing. Due to the continuous evolution of the LOD cloud, local copies become outdated. In order to utilize the best resources, improvised scheduling is required to maintain the freshness of the local data cache. In this paper, we have proposed an approach to efficiently capture the changes and update the cache. Our proposed approach, called application-aware change prioritization (AACP), consists of a change metric that quantifies the changes in LOD, and a weight function that assigns importance to recent changes. We have also proposed a mechanism to update policies, called preference-aware source update (PASU), which incorporates the previous estimation of changes and establishes when the local data cache needs to be updated. In the experimental evaluation, several state-of-the-art strategies are compared against the proposed approach. The performance of each policy is measured by computing the precision and recall between the local data cache update using the policy under consideration and the data source, which is the ground truth. Both cases of a single update and iterative update are evaluated in this study. The proposed approach is reported to outperform all the other policies by achieving an F1-score of 88% and effectivity of 93.5%.
引用
收藏
页码:65862 / 65873
页数:12
相关论文
共 48 条
  • [1] Akhtar U, 2017, ASIA-PAC NETW OPER M, P255, DOI 10.1109/APNOMS.2017.8094122
  • [2] [Anonymous], 2001, Proceedings of the Tenth Conference on World Wide Web, DOI [DOI 10.1145/371920.371960, 10.1145/371920.371960]
  • [3] [Anonymous], 2003, ACM Transactions on Internet Technology (TOIT), DOI DOI 10.1145/857166.857170
  • [4] Auer S., 2012, LNCS, V7603, P353, DOI DOI 10.1007/978-3-642-33876-2_31
  • [5] The Semantic Web - A new form of Web content that is meaningful to computers will unleash a revolution of new possibilities
    Berners-Lee, T
    Hendler, J
    Lassila, O
    [J]. SCIENTIFIC AMERICAN, 2001, 284 (05) : 34 - +
  • [6] Berners-Lee T., 1990, TECH REP
  • [7] Bizer C, 2011, SEMANTIC SERVICES, INTEROPERABILITY AND WEB APPLICATIONS: EMERGING CONCEPTS, P205, DOI 10.4018/978-1-60960-593-3.ch008
  • [8] Efficient crawling through URL ordering
    Cho, J
    Garcia-Molina, H
    Page, L
    [J]. COMPUTER NETWORKS AND ISDN SYSTEMS, 1998, 30 (1-7): : 161 - 172
  • [9] Cho J, 2000, SIGMOD REC, V29, P117, DOI 10.1145/335191.335391
  • [10] Cho J., 2000, Proceedings of the 26th International Conference on Very Large Data Bases, P1