Accession Number : ADA293966
Title : Recoverable Distributed Shared Memory Under Sequential and Relaxed Consistency.
Descriptive Note : Technical rept.,
Corporate Author : ILLINOIS UNIV AT URBANA COORDINATED SCIENCE LAB
Personal Author(s) : Janssens, Bob ; Fuchs, W. K.
PDF Url : ADA293966
Report Date : MAY 1995
Pagination or Media Count : 31
Abstract : Distributed shared memory (DSM) implemented on a cluster of workstations is an increasingly attractive platform for executing parallel scientific applications. Checkpointing and rollback techniques can be used in such a system to allow the computation to progress in spite of the temporary failure of one or more processing nodes. The complexity and overhead inherent in traditional message-passing checkpointing techniques can be reduced by taking advantages of specific properties of DSM. In this paper we show that, if designed correctly, a DSM system only needs to consider a subset of message-passing dependencies for correct rollback. A passive server model of DSM computation is described that allows a loosening of dependency restrictions by considering the events involved in interactions between nodes as atomic. An ownership timestamp scheme is used to eliminate many of the dependencies related to keeping directories consistent. The schemes can be implemented in DSM hardware by simply redesigning the directory at the network interface. Finally, we show that by relaxing the memory consistency model and using lazy release consistency, it is possible to further relax dependency restrictions. (AN)
Descriptors : *DISTRIBUTED DATA PROCESSING, *BLOCK ORIENTED RANDOM ACCESS MEMORIES, ALGORITHMS, RECOVERY, OPTIMIZATION, DATA MANAGEMENT, INFORMATION TRANSFER, COMPUTER COMMUNICATIONS, PARALLEL PROCESSING, PARALLEL PROCESSORS, CONSISTENCY, FINITE DIFFERENCE THEORY, CLUSTERING, WORK STATIONS, MULTIPROCESSORS, SYSTEMS ANALYSIS, COMPUTER NETWORKS, MESSAGE PROCESSING, TIME SHARING, CONTROL SEQUENCES.
Subject Categories : Computer Hardware
Distribution Statement : APPROVED FOR PUBLIC RELEASE