High-performance, Distributed Dictionary Encoding of RDF Datasets

被引:1
作者
Morari, Alessandro [1 ]
Weaver, Jesse [1 ]
Villa, Oreste [2 ]
Haglin, David [1 ]
Tumeo, Antonino [1 ]
Castellana, Vito Giovanni [1 ]
Feo, John [1 ]
机构
[1] Pacific NW Natl Lab, Richland, WA 99354 USA
[2] NVIDIA Res, Santa Clara, CA 95051 USA
来源
2015 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING - CLUSTER 2015 | 2015年
关键词
D O I
10.1109/CLUSTER.2015.44
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this work we propose a novel approach for RDF (Resource Description Framework) dictionary encoding that employs a parallel RDF parser and a distributed dictionary data structure, exploiting RDF-specific optimizations. In contrast with previous solutions, this approach exploits the Partitioned Global Address Space (PGAS) programming model combined with active messages. We evaluate the performance of our dictionary encoder in our RDF database, GEMS (Graph Engine for Multithreaded Systems), and provide an empirical comparison against previous approaches. Our comparison shows that our dictionary encoder scales significantly better and achieves higher performance than the current state of the art, providing a key element for the realization of a more efficient RDF database.
引用
收藏
页码:250 / 253
页数:4
相关论文
共 12 条
  • [1] [Anonymous], W3C Recommendation
  • [2] DBpedia: A nucleus for a web of open data
    Auer, Soeren
    Bizer, Christian
    Kobilarov, Georgi
    Lehmann, Jens
    Cyganiak, Richard
    Ives, Zachary
    [J]. SEMANTIC WEB, PROCEEDINGS, 2007, 4825 : 722 - +
  • [3] The Universal Protein Resource (UniProt)
    Bairoch, Amos
    Bougueleret, Lydie
    Altairac, Severine
    Amendolia, Valeria
    Auchincloss, Andrea
    Puy, Ghislaine Argoud
    Axelsen, Kristian
    Baratin, Delphine
    Blatter, Marie-Claude
    Boeckmann, Brigitte
    Bollondi, Laurent
    Boutet, Emmanuel
    Quintaje, Silvia Braconi
    Breuza, Lionel
    Bridge, Alan
    Saux, Virginie Bulliard-Le
    decastro, Edouard
    Ciampina, Luciane
    Coral, Danielle
    Coudert, Elisabeth
    Cusin, Isabelle
    David, Fabrice
    Delbard, Gwennaelle
    Dornevil, Dolnide
    Duek-Roggli, Paula
    Duvaud, Severine
    Estreicher, Anne
    Famiglietti, Livia
    Farriol-Mathis, Nathalie
    Ferro, Serenella
    Feuermann, Marc
    Gasteiger, Elisabeth
    Gateau, Alain
    Gehant, Sebastian
    Gerritsen, Vivienne
    Gos, Arnaud
    Gruaz-Gumowski, Nadine
    Hinz, Ursula
    Hulo, Chantal
    Hulo, Nicolas
    Innocenti, Alessandro
    James, Janet
    Jain, Eric
    Jimenez, Silvia
    Jungo, Florence
    Junker, Vivien
    Keller, Guillaume
    Lachaize, Corinne
    Lane-Guermonprez, Lydie
    Langendijk-Genevaux, Petra
    [J]. NUCLEIC ACIDS RESEARCH, 2008, 36 : D190 - D195
  • [4] Beckett D., 2014, RDF 1 1 N TRIPLES
  • [5] Boncz P., 2013, BSBM V3 1
  • [6] Ding L, 2006, LECT NOTES COMPUT SC, V4273, P242
  • [7] LUBM: A benchmark for OWL knowledge base systems
    Guo, YB
    Pan, ZX
    Heflin, J
    [J]. JOURNAL OF WEB SEMANTICS, 2005, 3 (2-3): : 158 - 182
  • [8] Scaling Irregular Applications through Data Aggregation and Software Multithreading
    Morani, Alessandro
    Tumeo, Antonino
    Chavarria-Miranda, Daniel
    Villa, Oreste
    Valero, Mateo
    [J]. 2014 IEEE 28TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, 2014,
  • [9] SCALING SEMANTIC GRAPH DATABASES IN SIZE AND PERFORMANCE
    Morari, Alessandro
    Castellana, Vito Giovanni
    Villa, Oreste
    Tumeo, Antonino
    Weaver, Jesse
    Haglin, David
    Choudhury, Sutanay
    Feo, John
    [J]. IEEE MICRO, 2014, 34 (04) : 16 - 26
  • [10] Scalable RDF data compression with MapReduce
    Urbani, Jacopo
    Maassen, Jason
    Drost, Niels
    Seinstra, Frank
    Bal, Henri
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2013, 25 (01) : 24 - 39