1 / 33

Fault Tolerance and Performance Analysis in Wireless CORBA

Fault Tolerance and Performance Analysis in Wireless CORBA. Chen Xinyu. Supervisor: Markers:. Prof. Michael R. Lyu Prof. Jerome Yen Prof. John C.S. Lui. 2002-12-09. Outline. Motivation Wireless CORBA Fault Tolerant Wireless CORBA Performance and Availability Analysis

lluvia
Télécharger la présentation

Fault Tolerance and Performance Analysis in Wireless CORBA

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Fault Tolerance and Performance Analysis in Wireless CORBA Chen Xinyu Supervisor: Markers: Prof. Michael R. Lyu Prof. Jerome Yen Prof. John C.S. Lui 2002-12-09

  2. Outline • Motivation • Wireless CORBA • Fault Tolerant Wireless CORBA • Performance and Availability Analysis • Conclusions and Future Work

  3. Motivation • Mobile Computing • Permanent failures • Physical damage • Transient failures • Mobile host • Wireless link • Environmental conditions • Fault Tolerant CORBA • Entity replication

  4. Home Domain Terminal Domain ab1 GIOP Tunnel Access Bridge Access Bridge mh1 Visited Domain Static Host Static Host Access Bridge Access Bridge ab2 Wireless CORBA Architecture Terminal Bridge GTP Messages

  5. Terminal Domain Terminal Domain Terminal Domain ab1 Terminal Bridge Terminal Bridge Terminal Bridge GIOP Tunnel mh1 GIOP Tunnel Static Host Static Host mh1 mh1 GIOP Tunnel Terminal Domain GIOP Tunnel Terminal Bridge mh1 ab2 Wireless CORBA Architecture Home Domain Home Location Agent Access Bridge Access Bridge Visited Domain Access Bridge Access Bridge

  6. Outline • Motivation • Wireless CORBA • Fault Tolerant Wireless CORBA • Performance and Availability Analysis • Conclusions and Future Work

  7. Basic Concepts • Checkpoint • the saved program’s states during failure-free execution • Repair • brings the failed device back to normal operation • Rollback • reloads the program’s states saved at the most recent checkpoint • Recovery • the reprocessing of the program, starting from the most recent checkpoint, applying the logged messages and until the point just before the failure

  8.  Applying Access Bridge as stable storage Applying mobile host as stable storage  Uncoordinated checkpointing Pessimistic message logging  a large number of system messages or a large size of information carried in one message  Checkpoints and Logs collection Device, Wireless & Mobile Issues • Device Issues • Slow processor • Small memory • Small disk space • Low power supply • Physical damage • Wireless Issues • High bit error rate • Little bandwidth • Long transfer delay • Mobile Issue • Handoff

  9. Client Object Object Replica Multicast Messages GIOP Tunnel Access Bridge Mobile Side Fixed Side Fault Tolerance Architecture Mobile Host Mobile Support Station Static Server ORB Terminal Bridge ORB ORB Recovery Mechanism Logging Mechanism Recovery Mechanism Logging Mechanism Recovery Mechanism Platform Platform Platform

  10. Access Bridge 1 Access Bridge 2 Access Bridge 3 Handoff Location Update Mobile Host Handoff Home Location Agent

  11. Access Bridge 1 Access Bridge 2 Access Bridge 3 Location Update Handoff Mobile Host Handoff Home Location Agent

  12. Access Bridge 1 Access Bridge 2 Access Bridge 3 Mobile Host Crash Home Location Agent

  13. Access Bridge 1 Access Bridge 2 Access Bridge 3 Collect last checkpoint and succeeded message logs Reconnect Sorted by Ack. SN Messages Replay Mobile Host Recovery Home Location Agent

  14. Outline • Motivation • Wireless CORBA • Fault Tolerant Wireless CORBA • Performance and Availability Analysis • Conclusions and Future Work

  15. Assumptions • Failure occurrence, message arrival and handoff event  homogeneous Poisson process with parameter ,  and  respectively • Failures do not occur when the program is in the repair or rollback process • A failure is detected as soon as it occurs

  16. Execution without Checkpointing Z0 H1 Hk F1 Fj m0(N) m1(n1) mj(1) mj(N) 0 t Y0 R H R H X0 X(N) Repair Handoff

  17. Conditional Execution Time & LST

  18. LST and Expectation of Program Execution Time

  19. Execution with Equi-number Checkpointing Ci-1 Ci Z i(0) Hi(1) Hi(k) Fi(1) Fi(j) mi0(a) mi1(ni1) mij(1) mij(a) 0 t C Yi(0) H H C R+C R+C Xi(0) Xi(N,a) Checkpointing Repair + Rollback Handoff

  20. Conditional Execution Time & LST

  21. LST and Expectation of Program Execution Time

  22. Average Availability • uptime interval: a program produces useful work towards its completion • downtime interval: • Repair and rollback • Handoff • Checkpoint creation • Wasted Computation • average availability: how much of the time an MH is in uptime interval during an execution

  23. Optimal Checkpointing Interval

  24. Beneficial Condition

  25. Equi-number Checkpointing • Equi-number checkpointing with respect to message number • Message number in each checkpointing interval is not changed • Equi-number checkpointing with respect to checkpoint number • Checkpoint number is not changed

  26. Equi-number Checkpointing with respect to Checkpoint Number

  27. Equi-number Checkpointing with respect to Message Number

  28. Comparison Between Checkpointing and Without Checkpointing

  29. Average Availability vs. Message Arrival Rate and Handoff Rate

  30. Conclusions • Fault tolerant wireless CORBA • Equi-number checkpoiting strategy • LST and expectation of program execution time • Average availability • Optimal checkpointing interval • Beneficial condition

  31. Future Work • Analysis model • The message queuing effect during repair and recovery • Failure detector • Distributed consensus with link failures, process failures, and mobile disconnections • Leads to a faster solution • Reduces communication costs • Fault tolerance in Ad Hoc network • Without infrastructure support • Self-organizing and adaptive

  32. Thank You

More Related