Improving the storage efficiency of small files in cloud storage

Research output: Contribution to journalArticlepeer-review

9 Scopus citations

Abstract

An approach based on SequenceFile is proposed to improve storage efficiency of small files in the cloud storage systems that are on the basis of Hadoop distributed file system(HDFS). The approach uses the multi-attribute decision theory and the indices such as reading time, combining time, and saved memory size to obtain an optimal file merging scheme, so that the balance between computing time and memory space is achieved. A system load forecast algorithm is designed based on the analytic hierarchy process to predict the load of the system. SequenceFile is used to combine small files. Experimental results show that, without degrading the performance of storage system, the storage efficiency of small files is improved.

Original languageEnglish
Pages (from-to)59-63
Number of pages5
JournalHsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University
Volume45
Issue number6
StatePublished - Jun 2011

Keywords

  • Cloud storage
  • Load forecasting
  • Small file
  • Storage efficiency

Fingerprint

Dive into the research topics of 'Improving the storage efficiency of small files in cloud storage'. Together they form a unique fingerprint.

Cite this