Toward Replication in Grids for Digital Libraries with Freshness and Correctness Guarantees

Authors
Fuat Akal, Heiko Schuldt and Hans-Jörg Schek
Type
Journal Article
Date
2008/1
Appears in
Concurrency and Computation Practice and Experience
Publisher
John Wiley & Sons
Abstract
Building digital libraries (DLs) on top of data grids while facilitating data access and minimizing access overheads is challenging. To achieve this, replication in a Grid has to provide dedicated features that are only partly supported by existing Grid environments. First, it must provide transparent and consistent access to distributed data. Second, it must dynamically control the creation and maintenance of replicas. Third, it should allow higher replication granularities, i.e. beyond individual files. Fourth, users should be able to specify their freshness demands, i.e. whether they need most recent data or are satisfied with slightly outdated data. Finally, all these tasks must be performed efficiently. This paper presents an approach that will finally allow one to build a fully integrated and self-managing replication subsystem for data grids that will provide all the above features. Our approach is to start with an accepted replication protocol for database clusters, namely PDBREP, and to adapt it to the grid.