About | Contact Us | Register | Login
ProceedingsSeriesJournalsSearchEAI
Artificial Intelligence for Communications and Networks. Third EAI International Conference, AICON 2021, Xining, China, October 23–24, 2021, Proceedings, Part I

Research Article

Dynamic Data Storage and Management Strategies for Distributed File System

Download(Requires a free EAI acccount)
7 downloads
Cite
BibTeX Plain Text
  • @INPROCEEDINGS{10.1007/978-3-030-90196-7_50,
        author={Feng Liu and Di Lin and Yao Qin and Yuan Gao and Jiang Cao},
        title={Dynamic Data Storage and Management Strategies for Distributed File System},
        proceedings={Artificial Intelligence for Communications and Networks. Third EAI International Conference, AICON 2021, Xining, China, October 23--24, 2021, Proceedings, Part I},
        proceedings_a={AICON},
        year={2021},
        month={11},
        keywords={HDFS File heat value ORMP LSTM},
        doi={10.1007/978-3-030-90196-7_50}
    }
    
  • Feng Liu
    Di Lin
    Yao Qin
    Yuan Gao
    Jiang Cao
    Year: 2021
    Dynamic Data Storage and Management Strategies for Distributed File System
    AICON
    Springer
    DOI: 10.1007/978-3-030-90196-7_50
Feng Liu1,*, Di Lin1, Yao Qin1, Yuan Gao, Jiang Cao
  • 1: University of Electronic Science and Technology of China
*Contact email: 201921090328@std.uestc.edu.cn

Abstract

HDFS has a very wide range of applications in the field of big data, but HDFS was designed for a homogeneous environment at the beginning. HDFS adopts a static replica management strategy, the storage location and number of file replicas will not change after determination. This strategy will low overall system performance. In this paper, we propose optimized replica management strategy, abbreviated as ORMP to fix this problem. ORMP is based on file heat value and LSTM. File heat value is proposed to evaluate the activity of files. LSTM is used to predict the access times of files. Based on LSTM, the file heat value can be updated regularly, so we can dynamically change the storage location and number of replicas. Experiments show that ORMP is 22.08% faster in reading speed compared with the default replicas management strategy.

Keywords
HDFS File heat value ORMP LSTM
Published
2021-11-03
Appears in
SpringerLink
http://dx.doi.org/10.1007/978-3-030-90196-7_50
Copyright © 2021–2025 ICST
EBSCOProQuestDBLPDOAJPortico
EAI Logo

About EAI

  • Who We Are
  • Leadership
  • Research Areas
  • Partners
  • Media Center

Community

  • Membership
  • Conference
  • Recognition
  • Sponsor Us

Publish with EAI

  • Publishing
  • Journals
  • Proceedings
  • Books
  • EUDL