Directly to content
  1. Publishing |
  2. Search |
  3. Browse |
  4. Recent items rss |
  5. Open Access |
  6. Jur. Issues |
  7. DeutschClear Cookie - decide language by browser settings

Directory-Based Metadata Optimizations for Small Files in PVFS

Kuhn, Michael

[thumbnail of Thesis.pdf]
Preview
PDF, English
Download (825kB) | Terms of use

Citation of documents: Please do not cite the URL that is displayed in your browser location input, instead use the DOI, URN or the persistent URL below, as we can guarantee their long-time accessibility.

Abstract

In today's file systems each file is made up of data and metadata. The metadata contains some information about the associated data, like ownership and permissions of the file. While this usually is useful, there are situations when the additional overhead of such a design becomes a problem in terms of performance. This is especially true for cluster file systems, because due to their design every metadata operation is even more expensive. If a user creates several thousand temporary files that are going to be deleted soon anyway, it is not necessary to store detailed information about them. In this thesis several changes are made to the parallel cluster file system PVFS to better deal with such cases. To do this, PVFS is altered such that certain unnecessary metadata is discarded and therefore metadata performance is increased. Several tests with a large quantity of files are done to measure the benefits of these changes. The reduction of the metadata overhead halves the time needed for some common file system operations. The speedup of those operations is also analyzed in detail by visualizing the internal workflow of PVFS. Also, additional work that could be done to further increase the metadata performance as well as possible additions to the actual implementation are presented.

Document type: Bachelor thesis
Date Deposited: 25 Feb 2009 09:23
Date: 2007
Faculties / Institutes: The Faculty of Mathematics and Computer Science > Department of Computer Science
DDC-classification: 004 Data processing Computer science
Controlled Keywords: Dateisystem
Uncontrolled Keywords: PVFS , Paralleles Dateisystem , MetadatenPVFS , Parallel file system , Metadata
About | FAQ | Contact | Imprint |
OA-LogoDINI certificate 2013Logo der Open-Archives-Initiative