Towards Efficient Query Processing on Massive Time-Evolving Graphs

Arash Fard; Amir Abdolrashidi; Lakshmish Ramaswamy; John Miller

International Workshop on Collaborative Big Data

Research Article

Towards Efficient Query Processing on Massive Time-Evolving Graphs

Download751 downloads

Cite: BibTeX Plain Text

@INPROCEEDINGS{10.4108/icst.collaboratecom.2012.250532,
    author={Arash Fard and Amir Abdolrashidi and Lakshmish Ramaswamy and John Miller},
    title={Towards Efficient Query Processing on Massive Time-Evolving Graphs},
    proceedings={International Workshop on Collaborative Big Data},
    publisher={IEEE},
    proceedings_a={C-BIG},
    year={2012},
    month={12},
    keywords={time-evolving graphs big-data partitioning reachability pattern matching},
    doi={10.4108/icst.collaboratecom.2012.250532}
}

Arash Fard
Amir Abdolrashidi
Lakshmish Ramaswamy
John Miller
Year: 2012
Towards Efficient Query Processing on Massive Time-Evolving Graphs
C-BIG
ICST
DOI: 10.4108/icst.collaboratecom.2012.250532

Arash Fard¹^,*, Amir Abdolrashidi¹, Lakshmish Ramaswamy¹, John Miller¹

1: UGA

*Contact email: ar@cs.uga.edu

Abstract

Time evolving graph (TEG) is increasingly being used as a paradigm for modeling and analyzing dynamic relationships in many emerging domains such as online social networks, World Wide Web and evolutionary genomics. A time-evolving graph consists of a sequence of snapshots of the graph as it evolves over time. The ability to scalably process various types of queries on massive TEGs is central to building powerful analytic applications for these domains. Unfortunately, indexing techniques and cluster computing schemes that have been designed for static graphs are not very effective for processing massive TEGs. Towards designing scalable mechanisms for answering TEG queries, this paper studies three important problems. The first is the distribution of TEG data on the nodes of a cluster computing framework such as Pregel or Giraph so that the computing and communication resources of the cluster are effectively harnessed. The second is the answering of reachability queries on any snapshot of a TEG and the third is that of processing pattern matching queries in TEGs. For each problem, we provide a brief literature survey and explain why trivial extensions of static graph techniques are not adequate for TEGs. We also present our preliminary ideas towards addressing these problems and discuss their benefits.

Keywords: time-evolving graphs big-data partitioning reachability pattern matching

Published: 2012-12-14
Publisher: IEEE

: http://dx.doi.org/10.4108/icst.collaboratecom.2012.250532