Processing SPARQL queries over distributed RDF graphs

Peng Peng, Lei Zou, M. Tamer Özsu*, Lei Chen, Dongyan Zhao

*Corresponding author for this work

Research output: Contribution to journalJournal Articlepeer-review

Abstract

We propose techniques for processing SPARQL queries over a large RDF graph in a distributed environment. We adopt a “partial evaluation and assembly” framework. Answering a SPARQL query Q is equivalent to finding subgraph matches of the query graph Q over RDF graph G. Based on properties of subgraph matching over a distributed graph, we introduce local partial match as partial answers in each fragment of RDF graph G. For assembly, we propose two methods: centralized and distributed assembly. We analyze our algorithms from both theoretically and experimentally. Extensive experiments over both real and benchmark RDF repositories of billions of triples confirm that our method is superior to the state-of-the-art methods in both the system’s performance and scalability.

Original languageEnglish
Pages (from-to)243-268
Number of pages26
JournalVLDB Journal
Volume25
Issue number2
DOIs
Publication statusPublished - 1 Apr 2016

Bibliographical note

Publisher Copyright:
© 2016, Springer-Verlag Berlin Heidelberg.

Keywords

  • Distributed queries
  • RDF
  • RDF graph
  • SPARQL

Fingerprint

Dive into the research topics of 'Processing SPARQL queries over distributed RDF graphs'. Together they form a unique fingerprint.

Cite this