Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Rule-based curation and preservation of data: a data grid approach using iRODS
Hedges M., Blanke T., Hasan A. Future Generation Computer Systems25 (4):446-452,2009.Type:Article
Date Reviewed: Jun 29 2009

Why would anybody want to use iRODS? It’s just a glorified file transfer protocol (FTP) system with a database, isn’t it? This paper answers the question. The authors point out that “there is a pressing need for the digital curation and preservation communities to implement strategies for ensuring long-term access,” and the iRODS rule engine provides a framework for such strategies.

The iRODS replication rules can be used to ensure that data is actually stored at multiple sites. But there is also a need to ensure that data stored today can be read by whatever applications may be available ten years from now. One approach to accomplishing this has been to save, along with the data, the current application software and its associated execution environment--this is not always easy to do.

An alternative, presented in this paper, is to use iRODS rules to convert digital objects at inception time to formats that are preferred at that time. For instance, objects that use proprietary document formats can be converted to an open Extensible Markup Language (XML)-based format. To illustrate this, a set of rules is presented for conditionally invoking micro services that convert Microsoft Word and Excel formats to OpenDocument and comma separated values (CSV) formats. The iRODS rule syntax explanations given here are clearer than the ones I’ve seen elsewhere.

However, this, in itself, is not sufficient. A requirement for periodic actions to be taken throughout an object’s life cycle exists, to ensure that the format in which it has been saved has not become obsolete. An example is given for conversion of files in TIFF format to JPEG 2000 formats.

Using iRODS 1.0, the authors implement a system that explores the concepts presented in this paper, using data from the Arts and Humanities Data Service (AHDS); they are now also looking at a technology-neutral approach that will insulate designers from the iRODS syntax.

I recommend this paper to anyone who uses or may be able to use iRODS. I look forward to seeing further papers on the author’s work.

Reviewer:  G. K. Jenkins Review #: CR137026 (1002-0174)
Bookmark and Share
  Reviewer Selected
Featured Reviewer
 
 
Distributed Systems (D.4.7 ... )
 
 
Distributed Databases (H.2.4 ... )
 
 
Distributed Systems (C.2.4 )
 
Would you recommend this review?
yes
no
Other reviews under "Distributed Systems": Date
The design of the Saguaro distributed operating system
Andrews G., Schlichting R., Hayes R., Purdin T. IEEE Transactions on Software Engineering 13(1): 104-118, 1987. Type: Article
Sep 1 1987
Modern operating systems
Tanenbaum A., Prentice-Hall, Inc., Upper Saddle River, NJ, 1992. Type: Book (9780135881873)
Dec 1 1992
The drinking philosophers problem
Chandy K., Misra J. ACM Transactions on Programming Languages and Systems 6(4): 632-646, 1984. Type: Article
Jun 1 1985
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy