2-The Design and Implementation of a Log-Structured File System.pdf
文本预览下载声明
The Design and Implementation of a
Log-Structured File System
MENDEL ROSENBLUM and JOHN K. OUSTERHOUT
University of California at Berkeley
This paper presents a new technique for disk storage management called a log-structured file
system, A log-structured file system writes all modifications to disk sequentially in a log-like
structure, thereby speeding up both file writing and crash recovery. The log is the only structure
on disk; it contains indexing information so that files can be read back from the log efficiently.
In order to maintain large free areas on disk for fast writing, we divide the log into segments and
use a segment cleaner to compress the live information from heavily fragmented segments. We
present a series of simulations that demonstrate the efficiency of a simple cleaning policy based
on cost and benefit. We have implemented a prototype logstructured file system called Sprite
LFS; it outperforms current Unix file systems by an order of magnitude for small-file writes
while matching or exceeding Unix performance for reads and large writes. Even when the
overhead for cleaning is included, Sprite LFS can use 70% of the disk bandwidth for writing,
whereas Unix file systems typically can use only 5 –10%.
Categories and Subject Descriptors: D 4.2 [Operating Systems]: Storage Management–aUoc-
ation / deallocation strategies; secondary storage; D 4.3 [Operating Systems]: File Systems Man-
agement—file organization, du-ectory structures, access methods; D.4. 5 [Operating Systems]:
Reliability—checkpoint /restart; D.4.8 [Operating Systems]: Performance– measurements, sim-
ulation, operatzon anatysis; H. 2.2 [Database Management]: Physical Design— recouery and
restart; H. 3 2 [Information Systems]: Information Storage —file organization
General Terms: Algorithms, Design, Measurement, Performance
Additional Key Words and Phrases: Disk storage management, fast crash recovery, file system
organization, file system performance, high write performance, logging,
显示全部