About | Contact Us | Register | Login
ProceedingsSeriesJournalsSearchEAI
2nd International IEEE Conference on Communication System Software and Middleware

Research Article

EDRFS: An Effective Distributed Replication File System for Small-File and Data-Intensive Application

Cite
BibTeX Plain Text
  • @INPROCEEDINGS{10.1109/COMSWA.2007.382422,
        author={Bin  Cai and Changsheng  Xie and Guangxi  Zhu},
        title={EDRFS: An Effective Distributed Replication File System for Small-File and Data-Intensive Application},
        proceedings={2nd International IEEE Conference on Communication System Software and Middleware},
        publisher={IEEE},
        proceedings_a={COMSWARE},
        year={2007},
        month={7},
        keywords={cluster storage systems  distributed systems  file systems  reliability  replication systemes  storage area network},
        doi={10.1109/COMSWA.2007.382422}
    }
    
  • Bin Cai
    Changsheng Xie
    Guangxi Zhu
    Year: 2007
    EDRFS: An Effective Distributed Replication File System for Small-File and Data-Intensive Application
    COMSWARE
    IEEE
    DOI: 10.1109/COMSWA.2007.382422
Bin Cai1,*, Changsheng Xie1,*, Guangxi Zhu2,*
  • 1: Department of Computer Science and Technology,Huazhong University of Science and Technology. Wuhan National Laboratory for Optoelectronics. Wuhan, P.R. China, 430074
  • 2: Department of Electronics and Information, Huazhong,University of Science and Technology. Wuhan National Laboratory for Optoelectronics.. Wuhan, P.R. China, 430074
*Contact email: hust_caibin@sohu._com, csxie@263.net, gxzhu@mail.hust.edu.cn

Abstract

With the system scale keeping grown, the key challenge is to mask the failures that arise among the system components and to improve the performance of data-intensive applications. This paper designs and implements a cluster-based distributed replication file system EDRFS to meet these critical demands. EDRFS works with a single metadata server and multiple storage nodes, deploys whole-file replication scheme at the file level, and tracks what storage node a file is replicated on. We use a linear hash algorithm to evenly distribute data and load across multiple storage nodes so as to achieve balancing workload and incremental scalability of throughput and storage capacity as the system scale grows. In addition, we employ metadata caches and file data caches in clients to enhance system performance. Furthermore, we deploy a concurrency lock scheme to avoid namespace operation bottleneck and a replicas consistency method to keep a consistent mutation order among replicas of a file. We provide the initial experimental evaluations of our prototypical system on a small-file and data-intensive workload.

Keywords
cluster storage systems distributed systems file systems reliability replication systemes storage area network
Published
2007-07-09
Publisher
IEEE
Modified
2011-07-26
http://dx.doi.org/10.1109/COMSWA.2007.382422
Copyright © 2007–2025 IEEE
EBSCOProQuestDBLPDOAJPortico
EAI Logo

About EAI

  • Who We Are
  • Leadership
  • Research Areas
  • Partners
  • Media Center

Community

  • Membership
  • Conference
  • Recognition
  • Sponsor Us

Publish with EAI

  • Publishing
  • Journals
  • Proceedings
  • Books
  • EUDL