g-Sum: A Graph Summarization Approach for a Single Large Social Network

Saif Rehman; Asif Nawaz; Tariq Ali; Naveed Amin

sis 21(32): e2

Research Article

g-Sum: A Graph Summarization Approach for a Single Large Social Network

Download1452 downloads

Cite: BibTeX Plain Text

@ARTICLE{10.4108/eai.23-3-2021.169073,
    author={Saif ur Rehman and Asif Nawaz and Tariq Ali and Naveed Amin},
    title={g-Sum: A Graph Summarization Approach for a Single Large Social Network},
    journal={EAI Endorsed Transactions on Scalable Information Systems},
    volume={8},
    number={32},
    publisher={EAI},
    journal_a={SIS},
    year={2021},
    month={3},
    keywords={Social Network, Social Graph, Graph Data, Graph Mining, Graph Summarization, Graph Partition, Reconstruction Error, Big Graph},
    doi={10.4108/eai.23-3-2021.169073}
}

Saif ur Rehman
Asif Nawaz
Tariq Ali
Naveed Amin
Year: 2021
g-Sum: A Graph Summarization Approach for a Single Large Social Network
SIS
EAI
DOI: 10.4108/eai.23-3-2021.169073

Saif ur Rehman¹^,*, Asif Nawaz¹, Tariq Ali¹, Naveed Amin¹

1: University Institute of Information Technology, PMAS Arid Agriculture University, Rawalpindi, Pakistan

*Contact email: saifi.ur.rehman@gmail.com

Abstract

With the advances in computing resources, the processing of huge amount of data becomes possible. However, the human ability to identify patterns from such data has not scaled accordingly. Consequently, efficient computational approaches for condensing and simplifying data are becoming essential for uncovering actionable insights, especially the big social networks. Many large datasets analyzed today are represented with the help of graphs. In many real-world applications, summarizing large graphs is beneficial to achieve a number of advantages, including 1) significant speedup for graph algorithms, 2) graph storage space reduction, 3) faster network transmission, 4) improved data privacy, 5) more effective graph visualization, etc. All these benefits can be obtained from the summarized graph, if it is summarized accurately. The quality of the summary graph is mostly measured using the Reconstruction Error (RE). In the existing literature, RE is a very challenging task. Most of the approaches investigated so farending up with high value of RE. Hence, the summarized graph does not express the original graph completely due to the high value of the RE. Therefore, in this work we have examined how can we summarize a big graph structure into a compact summary by reducing its RE and ensuring its usefulness. For this task, we have proposed a novel graph summarization approach, called g-Sum, to calculate the graph structure summaries while minimizing RE and compare to the existing work in the domain. The functionality of the g-Sum is decomposed into three different, but interlinked phases; (1)- graph dataset pre-processing; (2)- graph partitioning; and (3)- graph summarization. We have performed an extensive series of experiments on different real-world social graph dataset, including Ego-Facebook, Enron Email and Web-Stanford graph datasets. The computed results show the effectiveness of the g-Sum and also compared with the state-of-the-art graph summarization approaches, S2L and GraSS.

Keywords: Social Network, Social Graph, Graph Data, Graph Mining, Graph Summarization, Graph Partition, Reconstruction Error, Big Graph

Received: 2020-11-12
Accepted: 2021-03-04
Published: 2021-03-23
Publisher: EAI

: http://dx.doi.org/10.4108/eai.23-3-2021.169073

Copyright © 2021 Saif ur Rehman et al., licensed to EAI. This is an open access article distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/3.0/), which permits unlimited use, distribution and reproduction in any medium so long as the original work is properly cited.

g-Sum: A Graph Summarization Approach for a Single Large Social Network

Abstract

About EAI

Community

Publish with EAI