File Size Distribution on UNIX Systems - Then and Now

A.S. Tanenbaum, J.N. Herder, H.J. Bos

Research output: Contribution to JournalArticleAcademicpeer-review

Abstract

Knowledge of the file size distribution is needed to optimize file system design. In particular, if all the files are small, the disk block size should be small, too, to avoid wasting too large a fraction of the disk. On the other hand, if files are generally large, choosing a large block size is good since it leads to more efficient transfers. Only by knowing the file size distribution can reasonable choices be made. In 1984, we published the file size distribution for a university computer science department. We have now made the same measurements 20 years later to see how file sizes have changed. In short, the median file size has more than doubled (from 1080 bytes to 2475 bytes), but large files still dominate the storage requirements.
Original languageEnglish
Pages (from-to)100-108
JournalACM SIGOPS Operating Systems Review
Volume40
Issue number1
DOIs
Publication statusPublished - 2006

Bibliographical note

filesize_osr2006

Fingerprint

Dive into the research topics of 'File Size Distribution on UNIX Systems - Then and Now'. Together they form a unique fingerprint.

Cite this