Computing Reviews

Recovery in the Calypso file system
Devarakonda M., Kish B., Mohindra A. ACM Transactions on Computer Systems14(3):287-310,1996.Type:Article
Date Reviewed: 03/01/97

Calypso is a stateful, scalable distributed Unix file system that is implemented on IBM RISC clusters running AIX. This paper describes the recovery scheme used in Calypso, which guarantees data consistency of the file system across processor failures. The authors highlight Calypso recovery as nondisruptive and based on reconstructing the distributed state rather than on replicating the server state.

The state of a Calypso file system is defined in terms of a set of tokens, which represents client access to parts of files and client authorization to perform file and directory operations. The state of the tokens is distributed among the clients as well as maintained at the server. The recovery protocol reconstructs the state of the tokens by a series of remote procedure calls among the clients and the server. The recovery protocol depends on software services external to Calypso for failure detection and group membership. It also depends on hardware-level redundancy in the form of multiported disks for data availability and on the Journaled File System of AIX for server recovery.

In addition to a description of the recovery protocol, the authors compare the Calypso recovery scheme to that used in other distributed file systems, including Sprite and Spritely NFS. They also examine the bottlenecks of the recovery scheme by measuring its performance on two distributed file system benchmarks. They conclude that the recovery scheme provides relatively short recovery times with low overhead.

Reviewer:  S. K. Andrianoff Review #: CR120441 (9703-0198)

Reproduction in whole or in part without permission is prohibited.   Copyright 2024 ComputingReviews.com™
Terms of Use
| Privacy Policy