230 likes | 312 Vues
A Multi-Site Virtual Cluster System for Wide Area Networks. Takahiro Hirofuchi, Takeshi Yokoi, Tadashi Ebara, Yusuke Tanimura, Hirotaka Ogawa, Hidetomo Nakada, Yoshio Tanaka, and Satoshi Sekiguchi National Institute of Advanced Industrial Science and Technology (AIST). What is Virtual Cluster?.
E N D
A Multi-Site Virtual Cluster System for Wide Area Networks Takahiro Hirofuchi, Takeshi Yokoi, Tadashi Ebara, Yusuke Tanimura, Hirotaka Ogawa, Hidetomo Nakada, Yoshio Tanaka, and Satoshi Sekiguchi National Institute of Advanced Industrial Science and Technology (AIST) LASCO08
What is Virtual Cluster? • Virtualization • Virtual Machine, SAN, VLAN • Cloud Computing • Get computing powers on demand • Virtualized Datacenters, Amazon EC2 (1VM = $0.1/hour) • Virtual cluster • Large-scale computing environment composed of VMs • Allow quick and easy deployment of many virtual nodes • Share physical clusters for multiple applications Application Scientific application Web 2.0 services Emulation testbed Application LASCO08
Background • Our virtual cluster system • Virtualization toolkitfor physical clusters • Easy to install, low cost • Allow users to customize virtualization • Problem • Physical resource limitation at a single site • Scalability and flexibility • => Use multi-site, many-site resources • Exponential increase of management cost • How to manage widely-distributed virtual nodes? LASCO08
Research Focus • Multi-site virtual cluster • Solve large-scale deployment and configuration issues of distributed virtual nodes • Break resource limitations at a single site • Small management cost, easy-to-use UI • Quick deployment and easy customization • Contribution • Design and implementation • Evaluation in WAN environments • The minimum WAN traffic for management • Quick configuration time like physical clusters LASCO08
Scenario Comparison(Single-Site Virtualization) • Software deployment and configuration are very tiresome work! • Manage many virtual nodes distributed in many sites • Update and customize them day by day Application Application Site B Site A LASCO08
Scenario Comparison(Multi-Site Cluster Virtualization) • Software deployment and configuration are performed through the view of a single cluster • Many virtual nodes at many sites can be seen like one physical cluster. • Easy management like a local single cluster Application Application Deployment and configuration Scientific application Web 2.0 services Emulation testbed Site B Site A LASCO08
Requirement • Powerful management support for large-scale virtual nodes • Scalable and robust • Flexible customization of node by node • Support heterogeneous resources in many sites • Intuitive administration interface • Allow quick deployment, minimize administrative network traffic LASCO08
Design Criteria • Exploit an existing cluster management system for distributed virtual nodes • Designed for physical clusters • Virtual node management (add, delete, and configure) • Integrate distributed VMs by Ethernet VPN • Bridge internal networks of single-site virtual clusters • Transparency for admins and users • Transparent package caching at each site • Package-based software installer • Quick installation and reconfiguration • Flexible customizability LASCO08
Cluster Management System(NPACI Rocks) Installation Configuration Operation Frontend Node Compute Node Private Network (LAN) • Automatic node install by PXE network boot • Cluster-wide application package called “Roll” • Parallel command execution tools (e.g., cluster-fork, tentakel) • Node status monitoring (e.g., Ganglia) • Automatic recovery from node down Node Config. Database LASCO08
Virtual Cluster Management Installation Configuration Operation Virtual Frontend Virtual Node Private Network Node Config. Database Allocate a group of VMs Isolate them by VLAN Ethernet VPN LASCO08
Virtual Node (Re-)Configuration(1) Virtual Frontend Virtual Node Ethernet VPN Node Config. DB NodeA: App P&Conf X NodeB: App Q&Conf Y NodeC: App R&Conf Z … PXE Boot Installation Package &Configuration Download LASCO08
Virtual Node (Re-)Configuration(2) Virtual Frontend Virtual Node Ethernet VPN Huge Download Traffic Packages x #Nodes Node Config. DB NodeA: App P&Conf X NodeB: App Q&Conf Y NodeC: App R&Conf Z … PXE Boot Installation Package &Configuration Download LASCO08
Virtual Node (Re-)Configurationwith Package Caching Virtual Frontend Virtual Node Ethernet VPN Node Config. DB NodeA: App P&Conf X NodeB: App Q&Conf Y NodeC: App R&Conf Z … Package Cache Server Minimize admin. traffic for node configuration LASCO08
PrototypeImplementation • REST API • Create a virtual cluster • Add/Delete a VM • Start/Stop a VPN • Reservation portal • Site resource monitoring • Automatic reservation • NPACI Rocks 4.2 • VMware Server • VLAN, iSCSI • OpenVPN 2.0 • Squid 3.0 LASCO08
Evaluation • Current status • Distributed VM configuration • Parallel command line tools • Node status monitoring • Points of evaluation • Feasibility • WAN environment • Large number of virtual nodes • Reconfiguration time • Network traffic Work fine ! LASCO08
Experiment Setting Node Config. DB 10 or 134 nodes, reconfigured with 900MByte packages 16 nodes AMD Opteron 244, 3GB Mem, Gb Eth x2 AMD Opteron 246, 6GB Mem, Gb Eth x2 LASCO08
Experiment without Package Caching Installation Time over WAN (10 Nodes) Even for only 10 nodes,it is too slow due to WAN traffic. (s) RTT (ms) LASCO08
Experiment with Package Caching VPN Throughput • RTT 20ms 8MB/s The package cache server reduces download traffic to a unique set of installed packages (800MB). 22 min Cache Server Throughput 80MB/s LASCO08
Summary • Minimized WAN traffic • A unique set of installed packages • Pre-caching is possible • Rapid configuration of virtual nodes • 20 – 30 min / 134 nodes (RTT20ms) • Not so far from local physical nodes • 15 – 25 min / 134 nodes LASCO08
Related Work • Virtual cluster + VPN • Virtuoso [Northwestern Univ., 2003] • Remote VM access via VPN • VioCluster [Purdue Univ., 2005] • Automatic shrink/expand by VPN topologies • Virtual Workspace • Globus-based virtual cluster services • Amazon EC2 • Xen-image-based deployment • PlanetLab • Distributed network emulation testbed LASCO08
Conclusion • Large-scale application environment • Scalability beyond a single site • Smart management framework • Flexible configuration • Multi-site virtual cluster • Exploit a powerful cluster manager • Single cluster view by Ethernet VPN • Package caching at each site • Evaluation • Quick configuration of distributed VMs • Minimized WAN traffic LASCO08
Future Work • Ongoing Project • Xen support • VM migration over WAN • Diskless boot • Other OS support • Availability • http://code.google.com/p/grivon LASCO08
Node ID Time Node Installation Progress Generating node configuration scripts. Download an Installer image Download packages 22 min LASCO08