Dynamic Web Application Deployment

Dynamic Web Application Deployment Instructor: Dr. Zhang Presenter: Ningfang Mi Date: Nov. 3 2004

Outline • Motivation • Challenge of Dynamic Content • Approaches • ESI • CSI • ACDN • Discussion • Conclusion • Reference

Motivation • Caching • Important tool to deal with the rate of requests to Internet servers • Reduce network congestion • Reduce page display time • Client-centric: proxy caching • Server-centric: • reverse proxy caching • content delivery network (CDN) • Limitation: mostly oriented toward static content

Dynamic Web Pages • Static Web pages are not ENOUGH! • More and more pages contain dynamic content • News headlines, stock information, current temperature …… • Good news: • a more compelling experience for the end-user • an easier development model for the application designer • Bad news: • Bad for caching!

Dynamic component

Challenge of Dynamic Content • Web developers frequently use technologies like JavaServer Pages (JSP) and Active Server Pages (ASP) to design their applications • But when traffic on Web sites increases, the computing overhead can result in increasing delays and failures in data delivery

Challenge of Dynamic Content (2) • Dynamic content places significant strain on traditional Web site architectures • the same infrastructure used to generate the content is used to deliver the content www.esi.org

Challenge of Dynamic Content (3) • Generating dynamic content typically incurs: • network overhead as user requests are dispatched to appropriate software modules that service these requests • processing overhead as these modules determine which data to fetch and present • disk I/O as these modules query the back-end database • In short, building dynamic Web pages is computationally expensive

Challenge of Dynamic Content (4) • Two major issues: • Site Experience and Effectiveness • dynamic content, abandon rate, download speed • Site Cost Structure • investments to support scalability, reliability, performance, system management, etc. • One more problem: • How to facilitate caching for dynamic Web pages?

Approaches • Caching dynamic responses and on mechanisms of timely invalidation of the cached copies • Assembling a response at the edge from static and dynamic components • Edge Side Includes (ESI) assembling • Client Side Includes (CSI) assembling • Application distribution networks, running complete applications at the edge of the network • Application Content Delivery Network (ACDN)

Fragment-based Technologies • Dynamic pages are not all dynamic • Most bytes are in a static page template • Dynamic fragments are a small fraction of the entire page • Different portions have different properties • Template: slow-changing content • Fragment: fast-changing content

Fragment 1 Fragment 2 Full page: • 30731 bytes news headlines: • 927 bytes (3%) • refetched every few hours stock quotes: • 1231 bytes (4%) • refetched every minute

Reassembling Fragmented Page • Historically, page is assembled at origin sites using ASP, JSP, Server-Side Includes. • Edge Side Includes Language • An XML-based mark-up language • A mechanism to assemble page from different components at edge servers (reverse proxies) • Separate cache control for each component • Independently download changed fragments

GET /www.att.com GET /www.att.com Full page Full page GET /www.att.com GET /frag1.html Full page Frag1 Akamai’s Approach for ESI-encoded Contents Origin server Edge server Browser No ESI Edge server (template cached) Origin server Browser Page Assembly ESI with edge-side page assembly

ESI -- Benefits and Limitation • Key Benefits of ESI • extends the performance and cost-saving benefits of Web caching and content delivery services • Bottleneck of the Last Mile • A large majority of Web users still rely on dial-up connections. • Network traffic & revenue analysis: 79% of consumer subscribers as of March 2002. • Jupiter Media Metrix, Aug 2001: 59% of the predicated on-line households in the US in 2006. • ESI does NOT help dial-up customers! • The speed of the last mile dominates the page display time.

Client-Side Includes (CSI) • Key idea: • Assemble page in the browsers • Dramatically reduce user response time • Not need browser modifications or configurations. • Use existing technologies inside Internet Explorer • Page parsing and assembly: JavaScript • Retrieval of page components: ActiveX • Free to use or not use a CDN • Without: client  origin server directly • With: client  edge server  origin server • scalable delivery of page template and fragments • Traffic reduction between client and edge server

GET /frag1.html GET /frag1.html Frag1 Frag1 Page Assembly Alternatives Edge server Origin server Browser GET /www.att.com GET /www.att.com No ESI Full page Full page Page Assembly GET /www.att.com GET /frag1.html ESI with edge-side assembly Full page Frag1 (template cached) (template cached) ESI with client-side assembly Page Assembly

ESI vs. CSI • Same markup language (ESI) • ESI assembling: • Reduces bandwidth and server load • CSI assembling: • Reduces connectivity costs at origin server (less load and bandwidth). • Reduces CDN-related costs (less bandwidth from edge to clients). • Reduces browser download times (less bandwidth at last mile).

Implementation of CSI • Implement CSI for the prevalent browser only (Microsoft Internet Explorer MSIE) • Resort to edge-side or server-side page assembly for all other browsers

Javascript: assemble a Web page ActiveX: download page component Wrapper: invoke Javascript and pass it the URL of the requested page GET /www.att.com Wrapper Typically satisfied from client’s cache (cacheable, immutable for given page) GET CSI Javascript (cacheable, same for all pages) Obtain fragments using ActiveX Obtain fragments Using HTTP Implementation (with a CDN) Edge server Origin server Browser

Performance Evaluation • Synthetic pages: random generated contents • Sizes: 20K, 60K, 100K • Template (80%) + four fragments (5% each) • AT&T page: http://www.att.com • One template, two fragments • Wall Street Journal page: http://online.wsj.com • One template, three fragments

nothing cached ESI processing overhead CSI script cached Template cached Display Time of Synthetic Pages Over dial-up links Conclusion: Substantial reduction in display time across all page sizes

Bandwidth Reduction All numbers are in bytes Conclusion: CSI can achieve significant reduction in bandwidth when the templates are cached in the browser.

Limitation of CSI • The need to download the wrapper increases latency when first time access the page. • Sequentially and synchronously downloading fragments may slow down page assembly. • Javascript downloads the template and all fragments only from the same Web site. • Some pages that are well suited to ESI assembly may not be amenable to CSI. • Accessed by very many clients • Once per client over a long interval

Application Content Delivery Networks (ACDNs) • Currently CDN provide access to static and streaming content • Proxy caches can improve the delivery • Unique CDN value: • Delivering dynamic content • Proxy can’t cache the dynamic content • An Application CDN (ACDN) • Deploy the application on a single computer • Replicate or migrate the application as needed

Issues of ACDN • Application distribution framework • Dynamically deploy a replica • Keep consistency of replicas • Content placement algorithm • Decide which applications to deploy where and when • Request distribution algorithm • Decide how to distribute requests among replicas • System stability – reach a steady state • Bandwidth overhead – create replicas

Server CGI scripts Decision process Start-up Load reporter Repl target Repl source Local replicator Updater Server Application Application Local replicator Metafile Metafile Architecture Overview Standard Web server Keep track of application replicas Central Replicator Compute request distribution policy Load-balancing DNS Client Client Client

Server CGI scripts Decision process Start-up Load reporter Repl target Repl source Local replicator Updater Server Application Application Local replicator Metafile Metafile Architecture Overview Invoked by system administrator when a new ACDN server on-line Central Replicator Load-balancing DNS Client Client Client

Server CGI scripts Decision process Start-up Load reporter Repl target Repl source Local replicator Updater Server Application Application Local replicator Metafile Metafile Architecture Overview Invoked by central replicator and report load of the server Central Replicator Load-balancing DNS Client Client Client

Server CGI scripts Decision process Start-up Load reporter Repl target Repl source Local replicator Updater Server Application Application Local replicator Metafile Metafile Architecture Overview Periodically examine every application to decide replicate or delete Central Replicator Load-balancing DNS Client Client Client

ACDN Components • Application distributed framework • Dynamically create and delete application replicas based on demand • Maintain replica consistency • Content placement algorithms • Request distribution algorithms

Application Distributed Framework -- Metafile • Two parts in a metafile: • A list of all files comprising the application along with their last-modified dates • An initialization script (or a URL of the file with the script) ran by the recipient server before accepting any request Metafile Executable file FILE /home/applications/mapping/query_engine.cgi 1999.apr.14.08:46:12 FILE /home/applications/mapping/map_database 2000.oct.15.13:15:59 FILE /home/applications/mapping/user_preferences 2001.jan.30.18:00:05 Two Data files SCRIPT mkdir /home/applications/mapping/access_stats setenv ACCESS_DIRECTORY /home/applications/mapping/access_stats ENDSCRIPT Create a directory Set the environment variable

Application Metafile • A metafile is treated as a static Web page with its own URL. • Using a metafile, the application distribution framework can be implemented over standard HTTP. • Operations of framework: • Replica creation • Replica deletion • Replica consistency Migration = creation + deletion

Query for least-load server Return the least-load server Invoke the repl target CGI script URL of the application metafile Replica Creation • Initiated by the decision process on the source server Central Replicator overload Source Server Target Server

Invoke the repl source CGI script URL of the application metafile Replica Creation • Initiated by the decision process on the source server Central Replicator Query for least-load server Return the least-load server overload Source Server Target Server

Tar file of application Replica Creation • Initiated by the decision process on the source server Unpack Install Execute initialization script Central Replicator Query for least-load server Return the least-load server overload Source Server Target Server

Update DNS Server New replica Replica Creation compute request distribution policy • Initiated by the decision process on the source server Central Replicator Query for least-load server Return the least-load server overload Source Server Target Server

Central Replicator query to delete Deletion update DNS Server Permission to delete with DNS TTL Confirm with DNS TTL Replica Deletion • Initiated by the decision process on a server with the replica compute request distribution policy Mark the replica as “deleted” Delete it after the TTL Not the last replica Source Server TTL: delay for the application requests arriving due to earlier DNS responses

Consistency Maintenance • Only deal with the developer updates • Three issues: • Replica divergence: conflicting updates • Only update the primary application replica • Replica staleness and replica coherency • Missing updates and updates not to all files • If detect the cached metafile not valid, then download the new metafile and copy all modified objects from the primary server

ACDN Algorithms • Application distribution framework • Content placement algorithm • Decide which applications to deploy where and when • Request distribution algorithm

Content Placement Algorithm • Executed periodically by ACDN server • Make a local decision on deleting, replicating, migrating its applications For each application app: (1) If demand below Deletion threshold, delete app unless the only replica; (2) If demand from another server’s region exceeds Deletion threshold and replication benefits are likely to exceed transfer overhead, try to replicate there; (3) If demand from another server’s region exceeds 50% of total and migration benefits are likely to exceed transfer overhead, try to migrate there; Improve proximity of servers to client requests

Content Placement Algorithm (2) If server is overloaded: (1) Find the least-loaded server from central replicator; (2) Replicate some applications there if the load at the least-loaded server is above the deletion threshold (3) Otherwise, migrate some applications there if its projected load after receiving the application will remain acceptable (below LW) Achieve load balancing among servers

ACDN Algorithms • Application distribution framework • Content placement algorithm • Request distribution algorithm • Decide how to distribute requests among replicas

Request Distribution Algorithm • Goal: Never skip the nearest non-overloaded server and yet reduce oscillations in request distribution • iDNS: load-balancing DNS server • Request distribution policy • (R, Prob(1), …, Prob(N)) • Prob(i) is the probability of selecting server i for a request from the region R

Request Distribution Algorithm(2) • Three phases: • Assign the probability to each server based on its load • Examine all servers with a replica of the application in the order of the increasing distance from the region • Normalize the probabilities of these servers so that they sum up to one

Initial probabilities: Set prob(i) = 0 for all i Loop through the replicas in order of decreasing proximity if load(i) < LW prob(i) =1.0 exit else if LW <= load(i) < HW prob(i) = (HW – load(i)) / (HW – LW) • Adjustments to distance from region R: remainder = 1.0 Loop through the servers with a replica of the application in order of increasing distance from region R prob(i) = prob(i) * remainder remainder = remainder – prob(i) • Final probabilities: if sum of all > 0 prob(i) = prob(i) / sum of all else prob(i)=1/n, where n is the number of replicas

ACDN Performance -- Request Distribution • Three servers with decreasing proximity to all clients • Server 1 is the closest, server 2 is the next closest, server 3 is the farthest. • HW=1000 request/second • LW=200 request/second • Start with 10 clients, gradually increase to over server capacity, then decrease back to 10 clients • ACDN, pure random and CDN brokering • CDN brokering: select the closest one with load < 80% of its capacity

Random Prefect load balancing Unnecessary high latency ACDN Efficient using proximity information Avoid overloading the replicas CDN brokering Consider both load and proximity But not as well as ACDN

ACDN Performance -- Content Placement • 10% of servers are in “hot” regions with 90% of demand • 90% of servers are in “cold” regions with 10% of demand • The set of hot regions changed every 400 seconds, see how the system adapts. • Two other algorithms: • Static: a replica is created when the simulation starts and is fixed through the simulation • Ideal: can get instantaneous knowledge of hot region and replicates or deletes application.

static static ACDN ACDN Ideal Ideal Response Latency Network bandwidth consumption Conclusion: Quickly adapt to the set of hot regions and significantly reduce network bandwidth and response time

Dynamic Web Application Deployment