Supervision Alarms by Layer
The table below lists the SD-WAN Orchestrator alarms by layer and briefly describes the recovery procedure to use when an alarm condition occurs.
As a reminder, layers are the following:
• | Underlay: physical connection between devices (LAN, WAN), VRRP or HA state change, ip|engine configuration |
• | Overlay: connection between ip|engines or external gateways through IPsec tunnels |
• | Services: services provided with the ip|engines such as application visibility, application control, WAN optimization, firewall |
• | AQS: site AQS for applications and site connectivity |
• | Resources: device local resources, i.e. hardware monitoring |
• | Management: connection to Azure components (Orchestrator, ZTP Server, etc.) |
Underlay
Alarm |
Severiy |
Troubleshooting |
---|---|---|
Network interface down [interface name] |
Critical |
Check physical connections with the device. |
Bad network interface configuration [interface name] |
Critical |
Check the interface configuration parameters. |
No IP address [Transport Network Identifier name] |
Critical |
Define a correct IP address for the configured WAN interface. |
No default gateway [Transport Network Identifier name] |
Critical |
Define a default gateway for the configured WAN interface. |
VRRP state change |
Information |
Status change alarm. |
HA state change |
Information |
Status change alarm. |
Configuration mismatch |
Critical |
There is a configuration version mismatch between the SD-WAN Orchestrator and the ip|engines. Contact Ipanema Support. |
HA Configuration mismatch |
Critical |
Check the information of the Event History window to identify the issue. Check the Routing section of the Troubleshooting window (by clicking the icon on the Network -> Configuration window) and verify the status of the HA ip|engines. Fix your HA configuration. |
HA Peer unreachable |
Critical |
The HA connection may be broken due to an ip|engine reboot, an unplugged cable, a power failure or an incident on another client device (for example, port down on a switch). Contact Ipanema Support. |
Overlay
Alarm |
Severiy |
Troubleshooting |
---|---|---|
Disconnected from the overlay |
Warning |
The specified site is fully isolated from the rest of the network (zero overlay tunnel). Check your ip|engine configuration and define at least one IPsec tunnel. |
Tunnel failure (ip|engine) |
Critical |
Refer to the Event History window to identify the issue. Check the Tunnels section of the Troubleshooting window (by clicking the icon on the Network -> Configuration window) and verify the state of the GRE/IPsec tunnels. Fix your ip|engine configuration. |
External tunnel failure (External Gateway) |
Critical |
Check the configuration of the external gateway connection. |
CloudMesh failure |
Critical |
Contact Ipanema Support. |
EdgeSentry failure |
Critical |
Contact Ipanema Support. |
LAN BGP peering failure |
Warning |
Check the Local Peer IP address in the LAN and the Site AS number. |
Services
Alarm |
Severiy |
Troubleshooting |
---|---|---|
Visibility down |
Warning |
Contact Ipanema Support. |
Control down |
Warning |
Contact Ipanema Support. |
WAN Optimization down |
Warning |
Contact Ipanema Support. |
Synchronization lost |
Warning |
Contact Ipanema Support. |
DTI traffic overload |
Warning |
The number of DTI connections exceeds 95% of the maximum threshold of authorized connections. The alarm is cleared when this value decreases. |
Connection to the SYSLOG server is lost |
Warning |
Check network connectivity between the SYSLOG server and the ip|engine. |
AQS
Alarm |
Severiy |
Troubleshooting |
---|---|---|
Site AQS for Top Applications dropped below 5 |
Warning |
The alarm is cleared when this value increases. |
Site AQS for High Applications dropped below 5 |
Warning |
The alarm is cleared when this value increases. |
End-to-end connectivity lost |
Warning |
Check end-to-end connectivity between Site A and Site B for the specified Transport Network (broken NAP). |
Resources
Alarm |
Severiy |
Troubleshooting |
---|---|---|
Disk is almost full (<5% left) on the volume [volume name] |
Warning |
For hardware resource alarms, contact Ipanema Support. |
Disk failure |
Warning |
|
Reboot |
Information |
|
Traffic overload |
Warning |
Throughput or the number of flows exceeds the capacity of the ip|engine, or packet loss occurs on Ethernet interfaces. Contact Ipanema Support. They will determine whether a more powerful ip|engine needs to be installed. |
Management
Alarm |
Severiy |
Troubleshooting |
---|---|---|
Disconnected from Orchestrator |
Critical |
One or several SD-WAN platform components are disconnected (either never connected or not recently connected) from the Orchestrator. Contact Ipanema Support. |
Connectivity with Orchestrator impaired |
Warning |
One or several SD-WAN platform components are disconnected (either never connected or not recently connected) from ZTP (Zero Touch Provisioning server). Contact Ipanema Support. |