230 likes | 331 Vues
This paper delves into the impact of network failures and the importance of network survivability management systems in maintaining network performance and reliability. Key components of the proposed system, such as network survivability status monitoring and control, play a vital role in ensuring efficient resource utilization and timely restoration of network failures. The comparison between parallel and sequential activation mechanisms sheds light on resource utilization efficiency and restoration time factors. The use of agents for network task management and cooperation is explored as a promising approach to handle complex network operations effectively. Furthermore, the advantages and disadvantages of agent-based representations over conventional object-based systems are discussed, highlighting the benefits of reusability, increased throughput, and personalized information retrieval. The agent cooperation model outlines the essential functions for managing multi-agent tasks efficiently.
E N D
Optical Network Management Perspective for Quality Assurance Dr. Ardian Greca Yamacraw Assistant Professor Department of Computer Sciences Georgia Southern University Phone: (912) 681 0170 naidrag@ieee.org
Introduction (The impact of network failures) Conclusion: A single cable cut can lead to a dramatic amount of lost traffic
Proposed TMN – based Network Survivability Management System
Proposed System Components • NSMS should be able: • to evaluate the response procedures • replan the response procedure when the network topology changes, • update restoration techniques, • or if the required survivability requirement is not met yet • analyze the performance • failure the recovery can be static or dynamic. • The static technique performs the recovery by using the pool capacity strategy. • The dynamic recovery uses more intelligent techniques and leads to effective local reconfigurations. • NSMS has a spare resource allocation control function to ensure efficient resource utilization
Proposed System Components • Network survivability status monitor • Surveillance • detect the NE under failure • which part and layer are currently controlling the restoration process • how the restoration process deal with the failure • Control • control the number of connections that are rerouted and where are rerouted • which connection should not be rerouted at all • Test • initializes a survivability test (i.e., simulate failures • do not interrupt the network! • Evaluation • evaluates survivability process
Survivability Control System • The survivability control system (SCS) is a very important function of NSMS, because there are several failure types and restoration techniques for a network layer. • SCS will respond with the right restoration technique(s) to a failure, since the performance of one technique might not meet the overall survivability-cost requirements of the network.
Survivability Control System • Restoration technique function, • Recover the failure inside one layer without affecting other layers • Can use different techniques • Priority restoration function and • Choose appropriate technique • Cost effective technique • Alteration control. • Use parallel activation • Use sequential activation
Comparison between two activation mechanisms • Parallel activation: • Advantages • fast restoration, because there is no need to coordinate different techniques, • Disadvantage • could lead to inefficient resource utilization • causes resource waste after restoration. • NSMS should update spare resources in the network, but unfortunately this process is very complex and can lead to a disruption of new incoming calls. • Sequential activation • Advantages • does not have the above mentioned problems • can be easily controlled, • Disadvantages • might lead to a longer restoration time
Timing issues • time to detect the failure at layer i is TDi. This time, TDi, for the lowest layer determines the time to detect the failure, • elapsed time to generate an alarm indication message from layer i to the layer i+1 is TAi, • estimated restoration time for a given restoration technique in a given layer i is TRi,
Using Agents • Run computation- and communication- intensive applications in real time • End users interact with their applications when they are running • Applications • Telecommunications • Interactively steered high performance computations • Data mining • Distributed interactive simulations • Smart sensor and instruments • Math calculations • Etc…
Agents vs. conventional object-based representations • Why? • Re-use without Re-work • Increased throughput (provides parallel processing) • On-line reconfigured applications in response of user request or user behavior change • End users do not need to be aware of the current task representation • Information obtained can be personalized to the information seeker • Increases trust since information gathered from different agents • Know where to get the information • Better interaction … • Etc…
Agent Cooperation • the agent management platform will consist • on the functions of network task management, • agent task scheduling, • communication and agent management • agent generation, • dispatch, • clone, • registration and dismissal • Agents will be composed into sequent queue or parallel sub-tasks. • sub-tasks will be provided to agent scheduling function to perform the task schedule for multi-agents. • Agents will be produced and assembled with the service code and dispatched to perform a certain job. • free agents needed by the busy agent at the target node to help the busy agent to accomplish its task.
Agent Cooperation • From the network management requirements point of view, we design two kinds of intelligent agents. • One we call the common agent and • the other the scheduling agent. • Both agents will be equipped with knowledge database where they can store the information they got from the environment. • Agents communicate with each other or with the management platform through communication function. • Scheduling agents have a scheduling code to manage all agents in their groups. • Scheduling agents can give some new service code to free agents and make them to perform a new task.
Conclusions • Propose a network survivability management system as a new independent function structure, focusing on its architecture. • NSMS compound functions are established and described • NSMS will preplan and download to every NE the response procedure, by coordinating different restoration techniques in different layers. • uses the global survivability strategy • sequential activation mechanism to construct cost-effective survivable WDM networks. • The management function is based on the multi-agent framework, • and a coordination protocol is also proposed. • Future work • an integrated spare resources control that considers different restoration techniques, • and how to determine the best restoration techniques for a given failure. • strongly related with the network size, traffic pattern, and the network cost. • to analyze problems for communication between NSMS and agents when the network is heavily loaded.