1st International ICST Conference on Scalable Information Systems

Research Article

Scalable p2p based RDF querying

  • @INPROCEEDINGS{10.1145/1146847.1146864,
        author={Felix  Heine},
        title={Scalable p2p based RDF querying},
        proceedings={1st International ICST Conference on Scalable Information Systems},
        publisher={ACM},
        proceedings_a={INFOSCALE},
        year={2006},
        month={6},
        keywords={},
        doi={10.1145/1146847.1146864}
    }
    
  • Felix Heine
    Year: 2006
    Scalable p2p based RDF querying
    INFOSCALE
    ACM
    DOI: 10.1145/1146847.1146864
Felix Heine1,*
  • 1: Paderborn Center for Parallel Computing, Paderborn University, Germany
*Contact email: fh@upb.de

Abstract

In large-scale distributed systems, information is typically generated decentralized. However, for many applications it is desirable to have a unified view on this knowledge, allowing to query it without regarding the heterogeneity of the underlying systems. In this context, two main requirements have to be fulfilled. On the one hand, we need a flexible knowledge representation, and on the other hand the underlying infrastructure and query evaluation algorithm has to be highly scalable.The combination of p2p networks as basic infrastructure with RDF as a knowledge representation is a promising approach to this problem. Within this paper, we focus on the evaluation of RDF queries with respect to RDF data stored in a DHT-based p2p network. We propose a query algorithm and research different optimizations based on a look-ahead technique and Bloom filters which aim at maximizing the throughput and scalability of the entire system.