150 likes | 168 Vues
Explore the complexities of synchronization and locking in distributed systems, the importance of replication for reliability and performance, and strategies for maintaining consistency across replicas.
 
                
                E N D
Replication and Consistency CS-4513D-Term 2007 (Slides include materials from Operating System Concepts, 7th ed., by Silbershatz, Galvin, & Gagne, Modern Operating Systems, 2nd ed., by Tanenbaum, and Distributed Systems: Principles & Paradigms, 2nd ed. By Tanenbaum and Van Steen) Replication and Consistency
Problem • Given that synchronization and locking is so difficult, do we really need it in a distributed system? • Is there a better way? Replication and Consistency
Example • Grapevine registry contains a list of items • Groups • Individuals • If we need to update one item, do we need to lock entire registry? • Where do we update that one item? • How does it get propagated to other replicas? Replication and Consistency
Replication • The art and science of keeping multiple copies of data in a distributed system • Two reasons:– • Reliability • Performance Replication and Consistency
Example 1 • Grapevine keeps replicas of each registry on at least three Registration Servers • At least one is located far away • Performance • Registries accessed frequently – spread the load • Updates happen often enough that locking a registry would be a real bottleneck • Reliability • Destination registry needed for sending a message • Even if a server is down or network is partitioned Replication and Consistency
Example 2 • DNS (Domain Name Service) allows owner of a domain to replicate name database • Same two reasons • Performance • Reliability • Also • Scaling technique Replication and Consistency
Replication • Data centric • I.e., try to keep data consistent across replicas • Client centric • Weaker condition • Only maintain consistency for each client separately Replication and Consistency
General Approach • Update a single item in local replica • Atomically • Time-stamp – e.g., logical clock • Replica propagates update to all of its other replicas • Periodically • Receiving replica merges update with its own copy • Conflicting updates resolved arbitrarily to latest time-stamp Replication and Consistency
Strict consistency (in time) Not strict consistency Consistency Replication and Consistency
All processes see updates in same order Processes see updates in opposite order! Consistency (continued) Replication and Consistency
Consider three processes • Three shared variables: x, y, z • Initialized to zero • Each updates one variable, and then reads and prints other two Replication and Consistency
Four Valid Execution Sequences Replication and Consistency
Causal Relationships • If A causes B to happen, then A should appear before B to all observers Replication and Consistency
Summary • Can be very complicated • Textbooks go into much greater depth • Needed for particular distributed systems • Can be ignored in others • In Grapevine, update may not be reflected to updater! • Simply retry later. Replication and Consistency
Questions? Replication and Consistency