1 / 11

Automating Network Diagnostics to Help End-Users

Automating Network Diagnostics to Help End-Users. Dave Thaler dthaler@microsoft.com. Motivation. User needs are becoming more dependent on the Internet Problems can occur anywhere in the Internet Users have no control over these problems, and often neither do their direct providers!

robert-ward
Télécharger la présentation

Automating Network Diagnostics to Help End-Users

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Automating Network Diagnostics to Help End-Users Dave Thaler dthaler@microsoft.com

  2. Motivation • User needs are becoming more dependent on the Internet • Problems can occur anywhere in the Internet • Users have no control over these problems, and often neither do their direct providers! • Increased support calls are NOT the answer

  3. Poor error messages today • User doesn’t understand the problem, only the symptom • Error message pop-ups often aren’t helpful to users • Event logs aren’t really any better • Technical information is only useful to technical experts who don’t have access to end-users’ machines

  4. Goals • Reduce number of support calls • Help the user/app help itself where possible • Locate the correct party to contact if not • Reduce the time spent on support calls that do occur

  5. Focus on what the user wants! • User doesn’t want to have to call anyone • User doesn’t want to get email about an outage • User doesn’t want to go to some web site to find out • User just wants the application to work • If it doesn’t work, user often wants to know when it will work • If application is non-interactive, application may want to retry as soon as it is fixed (e.g. search engine) • User also wants to know that someone is working on fixing the problem • If user policy decision needed, user wants choices

  6. Multiple Adminstrative Entities • Policy Principles • Freedom of information: Outage info should be available to all those affected by it. • Privacy: Outage info should be available only to those affected by it. • Freedom of speech: Any entity should be able to report a problem, whether or not it is trusted. • Conservation of effort: Perform the minimum work needed to troubleshoot the problem.

  7. Solution Framework • Self-diagnosing, self-healing: • Naming: identification of a problem instance • Message routing: getting problem instance to capable agent • Methodology: structured process for confirming, diagnosing, repairing, etc. • Domain-specific classes/agents • Self-improving: • Learning what possible causes are the most likely • Learning what is normal/abnormal within a class • Reporting on agent behavior for improvements

  8. Architecture Application or Monitoring Tool Client API Engine Protocol Engine Helper Class Helper Class Helper Class Helper Class Component Component Component Component

  9. Separation of Roles • Engine • Maintains cause-effect tree • Handles message routing • Implements core methodology • Helper Classes • Implement simple API • Embed knowledge about one component type • Generate causal hypotheses that are treated just like client reports

  10. Example TCPIP-related Helper Classes

  11. Summary • It’s not just about one network being self-managing • We need to improve the end-user experience • Handling multiple administrative domains is a core issue • Structured methodology aids in problem analysis • Today too many things are ad hoc • High-level methodology should be independent of component-specific knowledge • Provides extensibility • Facilitates appropriate learning at both levels

More Related