Part IV: Network Survivability and Resilience
4.1 The Imperative for Optical Layer Protection
Telecommunication networks are engineered to meet exceptionally high availability requirements, often demanding 99.999% uptime, which translates to less than six minutes of downtime per year. Consequently, network survivability is a primary design consideration. Modern networks are built in layers, with client layers like IP, ATM, and SDH operating over a foundational Optical WDM layer. This raises a central question: Why is optical layer protection needed when higher client layers already have their own protection mechanisms?
The answer lies in a combination of efficiency, cost-effectiveness, and enhanced resilience. There are several powerful justifications for implementing protection directly at the optical layer:
- Client Layer Insufficiency: Some client layers have protection mechanisms that are slow or incomplete. IP routing, for example, can take seconds or longer to converge after a failure, which is too slow for real-time services. The optical layer can provide protection in tens of milliseconds.
- Legacy Equipment Support: Implementing optical layer protection can instantly upgrade the survivability of a network built with legacy client equipment that lacks its own protection capabilities.
- Enhanced Resilience: It provides an additional, independent layer of protection, which can be crucial for surviving multiple, simultaneous network failures.
- Operational Efficiency: When a single fiber carrying dozens of wavelengths is cut, it is far more efficient to restore the entire optical path in one action at the optical layer than for dozens of higher-level streams to each independently trigger alarms and initiate their own recovery procedures.
- Cost Savings: By sharing protection resources at the high-capacity optical layer, significant cost benefits can be realized compared to dedicating protection resources for every individual client-layer service.
Despite these powerful justifications, optical layer protection is not a panacea and has specific limitations that must be understood to create a comprehensive survivability strategy.
4.2 Limitations and Key Definitions
A comprehensive survivability strategy requires understanding the specific limitations of optical layer protection to avoid a false sense of security. Protection at the optical layer alone cannot solve every problem.
Key limitations include:
- It cannot handle client-side faults. For example, if the laser on a router connected to the optical network fails, that failure must be handled by the IP layer.
- It may not detect certain signal degradations. In a transparent light path, the optical layer might be unaware of a high bit error rate (BER) that would normally trigger a protection switch.
- It cannot provide granular protection. The optical layer protects an entire light path as a single unit and cannot differentiate between high-priority and low-priority traffic within that path.
- Link budget constraints can limit the length or complexity of protection routes.
- There is a risk of “race conditions” where the client layer and the optical layer both attempt to protect against the same failure simultaneously, potentially leading to instability.
To discuss survivability schemes accurately, it is essential to differentiate between two key terms: Protection and Restoration.
| Feature | Protection | Restoration |
| Speed | Very fast (typically < 60 ms) | Slower (seconds to minutes) |
| Planning | Pre-planned, pre-assigned backup resources | Can be computed on-demand after a failure occurs |
| Control Mechanism | Distributed, automated by network elements | Often centrally controlled by a management system |
| Bandwidth Usage | Uses 100% dedicated backup bandwidth | Can use algorithms to reduce excess bandwidth |
| Typical Topology | Simple, often linear or ring architectures | Can support complex mesh architectures |
Within the optical layer itself, protection schemes are broadly divided into two categories: those that operate at the line layer and those that operate at the path layer.
4.3 Comparative Analysis: Path Layer vs. Line Layer Protection
The optical layer can be divided into sublayers, which provides the basis for the distinction between path-layer and line-layer protection.
- Path Layer Protection: Operates on individual optical channels, or light paths. It protects a single wavelength from its source to its destination.
- Line Layer Protection: Operates on the aggregate WDM signal, known as the Optical Multiplex Section (OMS). It protects the entire group of wavelengths on a fiber link between two nodes.
The choice between these two approaches involves significant trade-offs in cost, flexibility, and bandwidth efficiency.
| Criterion | Line Protection | Path Protection |
| Protects Against | Interoffice facility (fiber) cuts, Site/node failures | Interoffice facilities, Site/node failures, Equipment failures |
| Number of Fibers | Typically four (working + protection pair) | Two (protection path can be routed over any available fiber) |
| Can handle single path failure? | No | Yes |
| Supports unprotected traffic? | No, all traffic is protected as a bundle. | Yes, protection can be provisioned on a per-channel basis. |
| Equipment Cost | Low (shared protection switching equipment) | High (requires duplicated equipment for each protected channel) |
| Bandwidth Efficiency | Good for fully protected traffic | Low if all traffic is protected, but efficient for mixed services |
The fundamental trade-off is between cost and flexibility. The choice is a strategic one for the service provider: Line protection optimizes for homogenous, high-reliability transport (like a highway for freight trucks), while path protection optimizes for a flexible, service-oriented network that can offer different tiers of survivability to different customers (like a city with local roads, bus lanes, and expressways).
4.4 Detailed Survey of Protection and Restoration Techniques
A variety of specific protection and restoration schemes have been developed, categorized by the layer at which they operate.
Path Layer Schemes
These schemes operate on individual light paths.
- 1+1 Path Protection (OUPSR): Also known as Optical Unidirectional Path Switched Ring. The signal is permanently bridged onto two diverse fiber paths (working and protection). The receiving node constantly monitors both and selects the better signal.
- Bidirectional Path Switched Ring (OBPSR): Also known as Optical Channel Shared Protection Ring (OCh/SPRing). A bandwidth-efficient ring scheme where multiple light paths can share a common pool of protection bandwidth.
- Mesh Path Protection/Restoration: In a mesh topology, a backup path can be pre-calculated and reserved for each working light path. In the event of a failure, signaling between nodes activates the backup path.
- 1:N Equipment Protection: This scheme protects against the failure of a transponder card. A single spare transponder is designated to protect a group of ‘N’ working transponders.
Line Layer Schemes
These schemes operate on the entire group of multiplexed wavelengths.
- 1+1 Linear Protection: The entire WDM signal is bridged onto two diverse fiber facilities. The receiving terminal selects the better of the two aggregate signals.
- 1:1 Linear Protection: The signal is only sent down either the working or the protection path at any given time. This allows low-priority traffic to be carried on the protection path when it is not in use.
- Optical Unidirectional Line Switched Ring (OULSR): The line-layer equivalent of OUPSR, where the entire WDM signal is duplicated and sent in both directions around a ring.
- Bidirectional Line Switched Ring (OBLSR): Also known as Optical Multiplex Section Shared Protection Ring (OMS/SPRing). This scheme uses four fibers and allows for the reuse of protection bandwidth around the ring.
- Mesh Line Protection/Restoration: This approach uses all-optical cross-connects to divert the entire WDM signal from a failed link onto an alternate route through the mesh network.
These protection schemes are essential for providing the high reliability demanded of modern networks. The next evolution in optical networking builds upon this foundation by adding dynamic flexibility and reconfigurability.