Resilient Overlay Networks (RON)

Resilient Overlay Networks (RON) Work by: andersen, balakrishnan, Kaashoek, and Morris Appeared In: SOSP Oct 2001 Presented By: Matt Trower and Mark Overholt Some Images Are Taken From Original Presentation

Background • Overlay network: Network built on top of a network • Examples: Internet (telephone), Gnutella (Internet), RON (Internet + Internet2) • AUP: Acceptable Use Policy (educational Internet2 can’t be used for commercial traffic)

Motivation • BGP is slow to converge (~10 minutes) • TCP Timeout less than 512 seconds typically • End-to-End paths unavailable 3.3% of the time • 5% of outages last more than 2 hours • Failures caused by config errors, cut lines (ships), DOS attacks • What if we need more than 4 9’s of reliability?

Sidenote Anderson started ISP in Utah before going to MIT Industry Research

Goals • Detect failures and recover in less than 20s • Integrate routing decisions with application needs • Expressive Policy Routing • Per-user rate controls OR packet-based AUPs

Big Idea Use inherent diversity of paths to provide link redundancy Use failure detectors for small subset of nodes in network Select best path from multiple options

Triangle Inequality Best Path ≠ Direct Path

Conduit Conduit Forwarder Forwarder Router Prober Router Prober Design C++ Application level library (send/recv interface) Performance Database Application-specific routing tables Policy routing module

Failure Detection • Send probes over N-1 links O() • Probe interval: 12 seconds • Probe timeout: 3 seconds • Routing update interval: 14 seconds • Send 3 fast retries on failure

Overhead Acceptable for small cliques

Experimental Setup • Two Configurations • Ron1: 12 hosts in US and Europe, mix of academic and industry • 64 hours collected in March 2001 • Ron2: Original 12 hosts plus 4 new hosts • 85 hours collected in May 2001

Analysis – Packet Loss RON1 UDP

Analysis - Latency RON1 UDP

Analysis - Throughput RON1 TCP

Resiliency to DOS Attacks Done on 3-node system Ack’s take different path Utah’s Network Emulation Testbed

Pros & Cons • Pros • Recovers from complete outages and severe congestion • Doubles throughput in 5% of samples • Single-hop redirection sufficient • Cons • Valid paths not always considered • Cisco->MIT->NC-Cable-> CMU • growth limits scalability • Route Ack’s

Discussion How does geographic distribution affect RON? Why doesn’t BGP do this already? What if everyone was part of a RON? What if all latencies fall below 20ms?

A Scalable Content-Addressable Network Work by: Ratnasamy, Francis, Handley, and Karp Appeared In: SIGCOMM ‘01

Distributed Hash Table A Decentralized system that provides key-value pair lookup service across a distributed system. DHTs support Insert, Lookup, and Deletion of Data Image from Wikipedia

Content Addressable Network CAN is a design for a distributed hash table. CAN was one of the original four DHT proposals. It was introduced concurrently with Chord, Pastry, and Tapestry

Motivation P2P Applications were the forefront in designing CAN. P2P Apps were scalable in their file transfers, but not very scalable in their indexing of content. Originally designed as a scalable index for P2P content. Use of CAN was not limited to P2P apps however. Could also be used in large scale storage systems, content distribution systems, or DNS.

CAN: Basic Design The Overlay Network is a Cartesian Coordinate Space on a d-torus (The coordinate system wraps around). Each node of the CAN is assigned a “Zone” of the d-dimensional space to manage. Each Node only has knowledge of nodes in Neighboring Zones. Assume for now, that each zone has only 1 node.

Example

Example Neighbor Lists: Zone 1 knows about Zones 4, 5, and 3 Zone 2 knows about Zones 4, 5, and 3 Zone 3 knows about Zones 5, 1, and 2 Zone 4 knows about Zones 2 and 1 Zone 5 knows about Zones 1, 2, and 3

Inserting Data in a CAN Given a (Key,Value) Pair, hash the Key d different ways, where d is the # of Dimensions The resulting coordinate is mapped onto the Overlay. The node responisible for that coordinate is the node that stores the (Key,Value) pair.

Example • Given (Key,Value): • HashX(Key) = Xcoord • HashY(Key) = Ycoord

Routing in a CAN A routing message hops from node to node, Getting closer and closer to the Destination. A node only knows about its immediate Neighbors Routing Path Length is (d/4)(n1/d) As d approaches log(n), the total Path length goes to log(n).

Adding Nodes in a CAN A new node, N inquires at a Bootstrap node for the IP of any node in the system, S. Pick a random point, P, in the coordinate space, managed by Node D. Using CAN Routing, route from S to D. Node D splits its Zone and gives half to N to manage. Update the Neighbor List in all Neighboring Nodes to D and N, including D and N.

Example The Zone is split between the new Node, N and the old node D. Node, N, routes to the zone containing Point P

Node Removal Need to repair the routing in case of a leaving node or a dead node. If one of the neighboring zones can merge with the empty zone and maintain Rectilinear Integrity, it does so. If not, the neighboring Node with the smallest zone attempts to Takeover the zone of the dead node. Each node independently sends a “Takeover” message to its neighbors. If a node receives a Takeover message, it cancels its timer if the sending zone is smaller, or it sends a takeover message of its own if it is bigger.

Proposed Improvements Multi-Dimensioned Coordinate Spaces Multiple Realities: Multiple Overlapping Coordinate Spaces Better Routing Metrics Multiple nodes per Zone Multiple Hash Functions (replication) Geographically sensitive overlay

Experimental Data

Comparisons Comparing Basic CAN to “Knobs on Full” CAN

Discussion - Pros Using some of the improvement made CAN a very robust routing and storage protocol. Using geographic location in the overlay creation would create smarter hops between close nodes. (But what about a geographically centralized disaster?)

Discussion - Cons Not much work on Load-Balancing the Keys When all of the Extra Features are running at once, CAN becomes quite complicated. Tough to guarantee uniform distribution of keys with hash functions on a large scale. Query Correctness

Pastry Work by: Rowstron and Druschel Appeared In: Middleware ‘01

The Problem Maintain overlay network for both arrivals and failures Load Balancing Network proximity sensitive routing

Pastry Lookup/insert O(logN) Per-node state O(logN) Network proximity-based routing

Design O 2128-1 objId nodeIds

Design Owner of obj O 2128-1 objId nodeIds

Lookup Table • Prefix matching based on Plaxton Routing

Locality of Search Search widens as prefix match becomes longer!

Parameters b: tradeoff between local storage and average hop count L: resiliency of routing

Security Choose next hop for routes randomly amongst choices Replicate data to nearby nodes

Scalability

Distance

Discussion What does bigger leafset gain you? How do we decide proximity? What other features might we want to create a lookup table based upon?

Resilient Overlay Networks (RON)

Resilient Overlay Networks (RON)

Presentation Transcript

Overlay/P2P Networks

RON: Resilient Overlay Networks

Resilient Overlay Networks

CS5412 : Overlay Networks

Resilient Overlay Networks (RON)

Overlay Infrastructure

Interaction of Overlay Networks: Properties and Control

Topology-Aware Overlay Networks

Overlay Networks and Overlay Multicast

Resilient Overlay Networks

RON: Resilient Overlay Networks

Voronoi Overlay Networks

Internet and Overlay Networks

Structured P2P overlay networks

Self-stabilizing Overlay Networks

Resilient Overlay Networks

Characterizing Overlay Multicast Networks

Resilient Overlay Networks

Resilient Overlay Networks

RON: Resilient Overlay Networks

Overlay Infrastructure

Perspective on Overlay Networks