Troubleshooting
The troubleshooting was done with a Cisco IOS software release released before the Cisco bug ID CSCdr61598 (registered customers only) and Cisco bug ID CSCdu08678 (registered customers only) integration.
Single Area Network
This image is a representation of the single area network described in these steps.
- Issue the show proc cpu | include OSPF command. This allows you to see the OSPF processes that utilize the CPU.r4#show proc cpu | include OSPF3 4704 473 9945 1.38% 0.81% 0.68% 0 OSPF Hello71 9956 1012 9837 1.47% 1.62% 1.41% 0 OSPF RouterAs seen in the previous example, there is high CPU for OSPF. This shows that there must be something wrong with either the link stability or a duplicate router-id.
- Issue the show ip ospf statistics command. This allows you to see if the SPF algorithm is run more than ordinary.r4#show ip ospf statisticsArea 0: SPF algorithm executed 46 timesSPF calculation timeDelta T Intra D-Intra Summ D-Summ Ext D-Ext Total Reason00:01:36 0 0 0 0 0 0 0 N,00:01:16 0 0 0 0 0 0 0 R, N,00:01:26 0 0 0 0 0 0 0 R, N,00:00:56 0 0 0 0 0 0 0 R, N,00:01:06 0 0 0 0 0 0 0 R, N, 00:00:46 0 0 0 0 0 0 0 R, N,00:00:26 0 0 0 0 0 0 0 R, N,00:00:36 0 0 0 0 0 0 0 R, N, kmbgvc 00:00:16 0 0 0 0 0 0 0 R, N,00:00:06 0 0 0 0 0 0 0 R, N,The show ip ospf statistics command shows that recalculation of SPF is done every 10 seconds, as seen in the previous example. It is triggered by the router and network LSA. There is a problem in the same area as the current router.
- Issue the show ip ospf database command.r4#show ip ospf databaseOSPF Router with ID (50.0.0.4) (Process ID 1)Router Link States (Area 0)Link ID ADV Router Age Seq# Checksum Link count50.0.0.1 50.0.0.1 681 0x80000002 0x7E9D 350.0.0.4 50.0.0.4 705 0x80000003 0x83D 450.0.0.2 50.0.0.2 674 0x80000004 0x2414 5 50.0.0.5 50.0.0.5 706 0x80000003 0x5C24 6192.168.2.6 50.0.0.6 6 0x8000007A 0xABC7Link ID ADV Router Age Seq# ChecksumThe show ip ospf database command shows that one LSA is newer (age 16) and its sequence number is much higher then the other LSAs in the same OSPF database. You need to figure out which router sent this LSA. Since it is in the same area, the advertising router id is known (50.0.0.6). It is more probable that this router ID is duplicated. You need to find out which other router has the same router-id.
- This example shows several instances of the show ip ospf database command.r4#show ip ospf database router adv-router 50.0.0.6OSPF Router with ID (50.0.0.4) (Process ID 1)Router Link States (Area 0) LS age: 11Link State ID: 50.0.0.6Options: (No TOS-capability, DC) LS Type: Router Links Advertising Router: 50.0.0.6Link connected to: a Transit NetworkLS Seq Number: 800000C0 Checksum: 0x6498 Length: 72 Number of Links: 4Number of TOS metrics: 0(Link ID) Designated Router address: 192.168.2.6 (Link Data) Router Interface address: 192.168.2.6 TOS 0 Metrics: 10(Link ID) Neighboring Router ID: 50.0.0.7Link connected to: another Router (point-to-point)(Link ID) Network/subnet number: 192.168.0.20Link connected to: a Stub Network (Link Data) Network Mask: 255.255.255.252Link connected to: a Stub NetworkNumber of TOS metrics: 0 TOS 0 Metrics: 64 (Link ID) Network/subnet number: 50.0.0.6r4#show ip ospf database router adv-router 50.0.0.6(Link Data) Network Mask: 255.255.255.255 Number of TOS metrics: 0 TOS 0 Metrics: 1OSPF Router with ID (50.0.0.4) (Process ID 1)Router Link States (Area 0) LS age: 7Link State ID: 50.0.0.6Options: (No TOS-capability, DC) LS Type: Router Links Advertising Router: 50.0.0.6LS Seq Number: 800000C7!--- The sequence number has increased.Checksum: 0x4B95Length: 96!--- The number of links has increased although the network has been stable.Link connected to: a Stub Network(Link ID) Network/subnet number: 192.168.3.0(Link Data) Network Mask: 255.255.255.0TOS 0 Metrics: 10Number of TOS metrics: 0(Link ID) Neighboring Router ID: 50.0.0.5Link connected to: another Router (point-to-point)TOS 0 Metrics: 64(Link Data) Router Interface address: 192.168.0.9 Number of TOS metrics: 0(Link Data) Network Mask: 255.255.255.252Link connected to: a Stub Network (Link ID) Network/subnet number: 192.168.0.8 Number of TOS metrics: 0(Link ID) Neighboring Router ID: 50.0.0.2TOS 0 Metrics: 64 Link connected to: another Router (point-to-point) (Link Data) Router Interface address: 192.168.0.2(Link ID) Network/subnet number: 192.168.0.0Number of TOS metrics: 0 TOS 0 Metrics: 64 Link connected to: a Stub Network (Link Data) Network Mask: 255.255.255.252(Link Data) Network Mask: 255.255.255.255Number of TOS metrics: 0 TOS 0 Metrics: 64 Link connected to: a Stub Network (Link ID) Network/subnet number: 50.0.0.6 Number of TOS metrics: 0TOS 0 Metrics: 1
- If you know your network, you can find which router advertises those links. The first previous output shows that the LSAs are sent by a router with OSPF neighbors 50.0.0.7, whereas the second output shows neighbors 50.0.0.5 and 50.0.0.6. Issue the show ip ospf command in order to find those routers and access them in order to verify their OSPF router-id. In this example setup, they are R6 and R3.3>show ip ospfRouting Process "ospf 1" with ID 50.0.0.6Supports only single TOS(TOS0) routesr6#show ip ospfSupports opaque LSASupports opaque LSASupports only single TOS(TOS0) routes
- Issue the show run | beg router ospf command in order to check the configuration that starts at the OSPF configuration.R6#show run | include router ospfrouter ospf 1router-id 50.0.0.6log-adjacency-changesnetwork 50.0.0.0 0.0.0.255 area 0network 192.168.0.0 0.0.0.255 area 0network 192.168.2.0 0.0.0.255 area 0r3#show run | begin router ospflog-adjacency-changesnetwork 50.0.0.0 0.0.0.255 area 0network 192.168.0.0 0.0.0.255 area 0network 192.168.3.0 0.0.0.255 area 0In the previous example, the router-id command was removed and the OSPF process was not restarted. The same problem can also result from a loopback interface that is removed and configured somewhere else.
- Issue the clear ip ospf 1 process command and the show ip ospf command in order to clear the process.r3#clear ip ospf 1 processReset OSPF process? [no]: yr3#show ip ospfRouting Process "ospf 1" with ID 50.0.0.6Supports only single TOS(TOS0) routesSupports opaque LSAAs shown in the previous example, the wrong IP address still appears.
- Issue the show ip int brie command in order to check the interface.r3#show ip int brieInterface IP-Address OK? Method Status ProtocolEthernet0/0 192.168.3.1 YES NVRAM up upSerial2/0 192.168.0.9 YES NVRAM up upSerial1/0 192.168.0.2 YES NVRAM up up Loopback0 unassigned YES NVRAM up upIn order to correct the problem, make sure that either the highest loopback configured on the router is unique in your OSPF network, or configure statically the router-id with the router-id <ip address> command under the OSPF router configuration mode.
Multiple Areas with ASBR
The symptoms of these problems are that the external route, which is learned through the redistribution from static into OSPF process by R6, ASBR router flaps from the routing table on all routers within OSPF Area 0. The external route is 120.0.0.0/16 and the problem is noticed on Router 5 in Area 0. Start to troubleshoot from there.
- Issue the show ip route command a few times consecutively in order to see the symptom.r5#show ip route 120.0.0.0Routing entry for 120.0.0.0/16, 1 known subnetsO E2 120.0.0.0 [110/20] via 192.168.0.9, 00:00:03, Serial2/0r5#show ip route 120.0.0.0% Network not in tabler5#
- Take a look at the OSPF database in order to check whether the LSA is received. If you issue the show ip ospf databasecommand several times in a row, you notice that the LSA is received by two routers, 50.0.0.6 and 50.0.0.7. If you look at the age of the second entry, if present, you notice that its value changes dramatically.r5#show ip ospf database | begin Type-5Type-5 AS External Link StatesLink ID ADV Router Age Seq# Checksum Tag120.0.0.0 50.0.0.6 2598 0x80000001 0xE10E 0r5#show ip ospf database | begin Type-5120.0.0.0 50.0.0.7 13 0x80000105 0xD019 0Link ID ADV Router Age Seq# Checksum Tag120.0.0.0 50.0.0.6 2599 0x80000001 0xE10E 0120.0.0.0 50.0.0.7 14 0x80000105 0xD019 0r5#show ip ospf database | begin Type-5Type-5 AS External Link StatesLink ID ADV Router Age Seq# Checksum Tag120.0.0.0 50.0.0.6 2600 0x80000001 0xE10E 0r5#show ip ospf database | begin Type-5120.0.0.0 50.0.0.7 3601 0x80000106 0x6F6 0Link ID ADV Router Age Seq# Checksum Tag120.0.0.0 50.0.0.6 2602 0x80000001 0xE10E 0r5#show ip ospf database | begin Type-5Type-5 AS External Link StatesLink ID ADV Router Age Seq# Checksum Tag120.0.0.0 50.0.0.6 2603 0x80000001 0xE10E 0r5#
- You also notice strange behavior if you look at the sequence number for the LSAs that are received from 50.0.07, which is the advertising router. Review what other LSAs are received from 50.0.0.7. If you issue the show ip ospf database adv-router 50.0.0.7 command several times in a row, the entries vary quickly, as shown in this example.r5#show ip ospf database adv-router 50.0.0.7OSPF Router with ID (50.0.0.5) (Process ID 1)Router Link States (Area 0)Link ID ADV Router Age Seq# Checksum Link count50.0.0.7 50.0.0.7 307 0x8000000D 0xDF45 6 Type-5 AS External Link States120.0.0.0 50.0.0.7 9 0x8000011B 0xA42F 0Link ID ADV Router Age Seq# Checksum Tagr5#show ip ospf database network adv-router 50.0.0.7OSPF Router with ID (50.0.0.5) (Process ID 1)r5#show ip ospf database network adv-router 50.0.0.7OSPF Router with ID (50.0.0.5) (Process ID 1)This last output does not show anything. Either the route is flapping or there is a problem of another kind, most probably a duplicate router ID within the OSPF domain.
- Issue the show ip ospf database command in order to view the external LSAs advertised by 50.0.0.7.r5#show ip ospf database external adv-router 50.0.0.7OSPF Router with ID (50.0.0.5) (Process ID 1)Type-5 AS External Link StatesOptions: (No TOS-capability, DC)Delete flag is set for this LSA LS age: MAXAGE(3600)Advertising Router: 50.0.0.7LS Type: AS External Link Link State ID: 120.0.0.0 (External Network Number )Metric Type: 2 (Larger than any link state path)LS Seq Number: 80000136 Checksum: 0xA527 Length: 36 Network Mask: /16 TOS: 0r5#show ip ospf database external adv-router 50.0.0.7Metric: 16777215 Forward Address: 0.0.0.0 External Route Tag: 0OSPF Router with ID (50.0.0.5) (Process ID 1)r5#
- Look at the SPF calculation reasons in order to verify this. X means that SPF runs every 10 seconds because of an External LSA (type 5) flap and indeed, you see that SPF runs.r5#show ip ospf statisticArea 0: SPF algorithm executed 2 timesSPF calculation timeDelta T Intra D-Intra Summ D-Summ Ext D-Ext Total Reason00:47:23 0 0 0 0 0 0 0 X00:33:21 0 0 0 0 0 0 0 X00:46:33 0 0 0 0 0 0 0 X 00:32:05 0 0 0 0 0 0 0 X00:10:03 0 0 0 0 0 0 0 R, SN, X00:10:13 0 0 0 0 0 0 0 R, SN, X 00:09:53 0 0 0 0 0 0 0 R,00:09:23 0 0 0 0 0 0 0 X00:09:43 0 0 0 0 0 0 0 R, SN, X00:09:33 0 0 0 0 0 0 0 X
- It is known that the problem is outside the current area. Turn your focus on the ABR. Telnet to the ABR Router 2 in order to have more visibility on other areas than OSPF area 0. Issue the show ip ospf border-routers and show ip ospf database network adv-router commands.r2#show ip ospf border-routersOSPF Process 1 internal Routing TableCodes: i - Intra-area route, I - Inter-area routei 50.0.0.7 [20] via 192.168.2.1, Ethernet0/0, ASBR, Area 1, SPF 25r2#show ip ospf database network adv-router 50.0.0.7OSPF Router with ID (50.0.0.2) (Process ID 1)Net Link States (Area 1)Options: (No TOS-capability, DC)Routing Bit Set on this LSA LS age: 701 LS Type: Network LinksAdvertising Router: 50.0.0.7Link State ID: 192.168.1.2 (address of Designated Router) LS Seq Number: 80000001 Checksum: 0xBC6B Length: 32Attached Router: 50.0.0.1Network Mask: /24Attached Router: 50.0.0.7
- The faulty router is on the same LAN as 50.0.0.1. It must be Router 6. Issue the show ip ospf command.r6#show ip ospfRouting Process "ospf 1" with ID 50.0.0.7Supports only single TOS(TOS0) routesIt is an autonomous system boundary router.Supports opaque LSA
- Once the faulty router is found, refer to the Single Area Network section of this document to correct the problem.
Error Message: %OSPF-4-FLOOD_WAR: Process 60500 flushes LSA ID 10.x.x.0 type-5 adv-rtr 10.40.x.x in area 10.40.0.0
The %OSPF-4-FLOOD_WAR: Process 60500 flushes LSA ID 10.35.70.4 type-5 adv-rtr 10.40.0.105 in area 10.40.0.0error message is received.
This error message states that the router originates or flushes LSA at a high rate. A typical scenario in a network may be where one router in the network originates LSA and the second router flushes that LSA. A detailed description of this error message is provided here:
- Process 60500 - The OSPF process that reports the error. In this example, the process ID is 60500.
- re-originates or flushes (keyword) - Indicates if the router originates LSA or flushes. In this error message, the routerflushes LSA.
- LSA ID 10.35.70.4 - Link state ID for which a flood war is detected. In this example, it is 10.35.70.4.
- type -5 - LSA type. This example has a Type 5 LSA.Note: A flood war has a different root cause for every LSA.
- adv-rtr - Router which originates LSA (that is, 10.40.0.105).
- Area - Area to which the LSA belongs. In this example, the LSA belongs to 10.40.0.0.
No comments:
Post a Comment