Loop-Free Alternate Routes

Whilst at Cisco Live recently I was intrigued to learn about an apparently simple yet effective fast routing convergence technique called Loop-Free Alternate Fast ReRoute (LFA-FRR), a.k.a. IP Fast ReRoute (IP-FRR).

Many service providers and enterprises these days tune their interior gateway protocol (IGP) timers, aiming to achieve sub-200ms convergence. This can be quite an intensive and design impacting challenge.

LFA-FRR claims to be able to automatically provide local sub-50ms convergence times and complement any other fast convergence tuning techniques that have been employed. Some literature even claims it to achieve sub-25ms convergence.

It works with link-state routing protocols, ie. OSPF and IS-IS. It can be configured on a single router and works by calculating one loop-free alternate backup path for every prefix. It then installs this path into the Routing Information Base (RIB) to provide local restoration in the event of a failure in the primary path. As it is prefix-independent, the convergence time is deterministic and doesn’t vary according to parameters such as the size of the routing table or link-state database.

The diagram below shows a small network with LFA-FRR configured on Router B. Router B is using its primary path to reach the destination prefix via Router D, but has also calculated a loop-free alternate path via Router C which it has pre-installed into its RIB.

lfa-frr-1

There is now a link failure between Routers B and D. As soon as Router B detects the failure, which could be due to loss of carrier or a keepalive mechanism such as Bi-directional Forwarding Detection (BFD), Router B begins using its loop-free alternate path via Router C to quickly restore traffic flow. If Router D is also configured for LFA-FRR, it will do likewise to restore traffic flow in the reverse direction.

LFA-FRR

For those of you familiar with EIGRP, you’ve probably already spotted that this is the same principle as Successors and Feasible Successors. Feasible Successors are loop-free alternates to the primary path which is the Successor. EIGRP also uses this technique to perform unequal cost load balancing by sending a percentage of the traffic to the Feasible Successor, so I wonder whether this could potentially be a future enhancement to LFA-FRR.

However, neither EIGRP nor LFA-FRR can always provide a loop-free alternate path; it depends on the network topology and metrics. Indications are that LFA-FRR may be able to provide protection of 70-85% of prefixes at the edge of a typical network.

In summary, LFA-FRR is easily configured on a router by a single command, calculates everything automatically, complements other fast routing convergence techniques, works on 70-85% of prefixes and doesn’t break anything, so why wouldn’t you want to use it?

Well, firstly it’s only currently available in IOS-XR, so it will be predominantly service providers using it for the time being. Due to the larger routing tables the routers may require more memory. There will also be some additional load on the router CPU, but this should be minimal as the LFA paths are calculated after the primary path.

In general, this technique appears to have good merit and be one for both service providers and enterprises to keep an eye on. However, it should be considered a ‘bonus’, as it is not a replacement for proper design and tuning to achieve fast routing convergence.

  • http://reaper81.wordpress.com Daniel

    Nice post, hopefully this will find its way to regular IOS also.Surprisingly there are few alternatives in some areas, we have a layer 2 network at a customer that runs EAPS. Convergence is in less than 50 ms range, doing this with STP is just not possible and it’s only now that REP is gaining some ground but only on metro type switches.

  • Brannen

    IOS-XR huh … I guess it’s going to be a while then – we’re still migrating to 12.4 from 12.2. Any IOS-XR running in the enterprise space?

  • Anonymous

    Anyone ever heard of BFD?

    • Tony Brown

      Good point. BFD complements LFA-FRR. However, BFD will only help detect the failure and withdraw the primary route. LFA-FRR pre-computes the backup path, so that it’s available as soon as that failure is detected.

  • http://blog.michaelfmcnamara.com Michael McNamara

    Hi Tony,

    Thanks for taking the time to introduce me to Loop-Free Alternate Routes.

    The problem for most mid-sized organizations isn’t the re-convergence time but the time it takes the network to detect the failure, “As soon as Router B detects the failure”, and start routing packets over the remaining paths. If you’re using a Metro Ethernet transport you’ll never get a link down so you need to rely on OSPF hello and dead router timers along with protocols such as BFD and VLACP (Avaya proprietary) to detect the communications breakdown with the remote peer. It becomes especially critical for healthcare organizations who are transmitting real-time patient telemetry which are usually UDP packets and will never been retransmitted if lost.

    On a different note I see our 30 days demo of full RSS feeds has expired. Any chance Greg will extend our evaluation another 30 days? I thought he had a pretty good response from his original post on the subject and I might have missed the subsequent follow-up (because I can’t see it in my RSS reader) Link – http://etherealmind.com/rss-feed-full-content/

    Thanks!

    • Tony Brown

      Hi Michael,

      Thanks for the comments, much appreciated! Youíre absolutely spot on; the recovery time is how long it takes to detect the failure, determine the next best path and update the forwarding tables. LFA-FRR will only help with the latter two aspects.

      Ultimately you want to detect link loss at layer 1 rather than relying on any keep-alive mechanism such as BFD. However, if youíre using Metro Ethernet you probably donít have any alternative, especially if it is multipoint. That topic could make a good article all by itself!

      Iíll forward your second comment to Greg.

      Regards,
      Tony

    • http://etherealmind.com Greg Ferro

      Hey Michael

      Yeah, it didn’t work out. I dropped a several hundred hits a day and the spam bloggers (see Jaluri.com) got the bulk of the views. Because I use feedburner I can’t block them and can’t afford to upgrade the hosting to handle the RSS feeds.

      So, partial feeds until I can find a better way. I need/want the views and the ad revenue to pay for hosting and software costs (small though the revenue is – feel free to click on them more often…..).

      I’m considering a ‘for fee’ feed. But it will take a while to sort something out and may be too hard to maintain anyway.

      greg

      • http://blog.michaelfmcnamara.com Michael McNamara

        I understand Greg… I fight those same battles probably just on a smaller scale than yourself. I will say that it doesn’t help your argument though when Jim Duffy of Network World (http://www.networkworld.com/slideshows/2011/012611-cisco.html#slide11) is promoting the one site you referenced above, but I understand the issue nonetheless.

        I’m about to learn about (click on) “Load Balancing 101″ from F5 even though I learned about it years ago from Alteon and have Cisco ACEs in place today. Hopefully it’ll benefit at least one of us.

        Cheers!

  • Pingback: Welcome John McManus and Tony Brown as Authors