Download IP: Addresses and Forwarding - RPI ECSE

Routing II: Protocols (RIP, EIGRP, OSPF, PNNI, IS-IS) Shivkumar Kalyanaraman Rensselaer Polytechnic Institute [email protected] Based in part upon slides of Prof. Raj Jain (OSU), S. Keshav (Cornell), J. Kurose (U Mass) Shivkumar Kalyanaraman 1 Overview         RIP, RIPv2, EIGRP OSPF, PNNI, IS-IS: LS efficiency & robustness  Link state distribution, DB synchronization, NBMAs etc Refs: Chap 16,14 Books: “Interconnections” by Perlman, “OSPF” by John Moy, “Routing in Internet” by Huitema. Reference: RFC 2328: OSPF Version 2: In HTML Reading: Notes for Protocol Design, E2e Principle, IP and Routing: In PDF Reading: Routing 101: Notes on Routing: In PDF | In MS Word Reference: Tsuchiya, "The Landmark Hierarchy: A New Hierarchy for Routing in Very Large Networks" Shivkumar Kalyanaraman 2 RIP: Routing Information Protocol    Uses hop count as metric (max: 16 is infinity) Tables (vectors) “advertised” to neighbors every 30 s.  Each advertisement: upto 25 entries No advertisement for 180 sec: neighbor/link declared dead  routes via neighbor invalidated  new advertisements sent to neighbors (Triggered updates)  neighbors in turn send out new advertisements (if tables changed)  link failure info quickly propagates to entire net  poison reverse used to prevent ping-pong loops (infinite distance = 16 hops) Shivkumar Kalyanaraman 3 RIPv1 Problems (Continued) Split horizon/poison reverse does not guarantee to solve count-to-infinity problem  16 = infinity => RIP for small networks only!  Slow convergence  Broadcasts consume non-router resources  RIPv1 does not support subnet masks (VLSMs)  No authentication  Shivkumar Kalyanaraman 4 RIPv2 Why ? Installed base of RIP routers  Provides:  VLSM support  Authentication  Multicasting  “Wire-sharing” by multiple routing domains,  Tags to support EGP/BGP routes.  Uses reserved fields in RIPv1 header.  First route entry replaced by authentication info.  Shivkumar Kalyanaraman 5 E-IGRP (Interior Gateway Routing Protocol)     CISCO proprietary; successor of RIP (late 80s) Several metrics (delay, bandwidth, reliability, load etc) Uses TCP to exchange routing updates Loop-free routing via Distributed Updating Alg. (DUAL) based on diffused computation  Freeze entry to particular destination  Diffuse a request for updates  Other nodes may freeze/propagate the diffusing computation (tree formation)  Unfreeze when updates received.  Tradeoff: temporary un-reachability for some destinations Shivkumar Kalyanaraman 6 Link State vs. Distance Vector  Link State (LS) advantages:  More stable (aka fewer routing loops)  Faster convergence than distance vector  Easier to discover network topology, troubleshoot network.  Can do better source-routing with link-state  Type & Quality-of-service routing (multiple route tables) possible  Caveat: With path-vector-type (paths instead of distances) DV routing, these differences blur… Shivkumar Kalyanaraman 7 Link State Protocols  Key: Create a network “map” at each node.  1. Node collects the state of its connected links and forms a “Link State Packet” (LSP) 2. Flood LSP => reaches every other node in the network and everyone now has a network map. 3. Given map, run Dijkstra’s shortest path algorithm (SPF) => get paths to all destinations 4. Routing table = next-hops of these paths. 5. Hierarchical routing: organization of areas, and filtered control plane information flooded.     Shivkumar Kalyanaraman 8 Link State Issues Reliable Flooding: sequence #s, age  LSA types, Neighbor discovery and maintainence (hello)  Efficiency in Broadcast LANs, NBMA, Pt-Mpt subnets: designated router (DR) concept  Areas and Hierarchy  Area types: Normal, Stub, NSSA: filtering  External Routes (from other ASs), interaction with inter-domain routing.  Advanced topics: incremental SPF algorithms  Shivkumar Kalyanaraman 9 Reliable Flooding… Shivkumar Kalyanaraman 10 Topology Dissemination A.k.a LSP distribution  1. Flood LSPs on links except incoming link  Require at most 2E transfers for n/w with E edges  2. Sequence numbers to detect duplicates  Why? Routers/links may go down/up  Issue: wrap-around, larger sequence number is not the most recent!  Shivkumar Kalyanaraman 11 Sequence Number Space Organization    Circular space: S1 > S2 > S3 > S1  Accidental bit errors in switch memory caused this problem in ARPANET Lollipop sequence: Start with S0, increment till you reach circle and then view it as a circular space  No ambiguity in lollipop handle Linear space: OSPFv2.  If Smax reached, expicitly delete Smax LSA before wrapping around Shivkumar Kalyanaraman 12 Topology Dissemination (Continued) Checksum field:  Drop packet if in error, get retransmission from neighbor  Age field (similar to TTL)  Number of seconds since LSA originated  Periodically incremented after acceptance  Originating router refreshes LSA after 30 min  Delete if Age = MaxAge  Low age field + large seq # => that LSA is flapping or frequently changing …  Shivkumar Kalyanaraman 13 Recovering from a partition  On partition, LSP databases can get out of synch  Databases described by database descriptor records Routers on each side of a newly restored link talk to each other to update databases (determine missing and out-ofdate LSPs) => selective synchronization  Shivkumar Kalyanaraman 14 LSA-types, Neighbor & flooding Adjacencies in Different Subnets Shivkumar Kalyanaraman 15 OSPF Router-LSA: Scenario Shivkumar Kalyanaraman 16 Neighbor Discovery & Relationship    Every OSPF router sends out 'hello' packets Hello packets used to determine if neighbor is up Hello packets sent periodically (short intervals) HelloInterval = 10s (in example) Assumes neighbor dead if no response within  RouterDeadInterval = 40s (in example)    This is also called an “adjacency”  Note that adjacency is a logical routing relationship and is more than physical connection.  It consumes bandwidth and computation resources  Becomes an issue if large number of adj need to be maintained Shivkumar Kalyanaraman 17 Neighbor …   Once an adjacency is established, trade information  Neighbor relationship is bi-directional as a result of OSPF hello packets Local topology information is packaged in a "link state announcement“ (LSA)  Multiple types of LSAs: (detail later) Initial DB synchronization  New announcements are sent ONCE, and only updated if there's a change   Or every 45mins... Shivkumar Kalyanaraman 18 Hello: Packet Format Shivkumar Kalyanaraman 19 Router-LSA: Shivkumar Kalyanaraman 20 Database Synchronization    LS Database (LSDB): collection of the Link State Advertisements (LSAs) accepted at a node.  This is the “map” for Dijkstra algorithm When the connection between two neighbors comes up, the routers must wait for their LS DBs to be synchronized.  Else routing loops and black holes due to inconsistency OSPF technique:  Source sends only LSA headers, then  Neighbor requests LSAs that are more recent.  Those LSAs are sent over  After sync, the neighbors are said to be “fully adjacent” Shivkumar Kalyanaraman 21 Recap: IP Subnet Model     Each subnet assigned one or more address prefixes. Each address prefix is called an IP subnet IP routes to subnets, not to individual hosts Two hosts on different IP subnets have to go through one or more routers.  Even if they are on the same “physical” network Shivkumar Kalyanaraman 22 IP Subnet Model (Contd)    Two hosts or routers on a common subnet can send packets “directly” to one another Two routers cannot exchange routing information directly unless they have one or more IP subnets in common All these issues will be strained as we study OSPF adjacency operation over different subnets Shivkumar Kalyanaraman 23 Broadcast Media Issues  Multiple (N) OSPF routers attached to a common subnet  Problems:  One “physical link” vs N*(N-1) “adjacencies”  How many “links” to be counted for Dijkstra algo? Shivkumar Kalyanaraman 24 Broadcast net: # links for DIjkstra    Each router is assumed to be “linked” to every other router for the purposes of Dijkstra. Hello protocol optimization:  Each node multicasts Hello to 224.0.0.5 (multicast address “AllSPFRouters”)  The Hello multicast message also indicates acks for other routers’ Hellos by listing their RouterIDs  “Link” relationship for purposes of Dijkstra maintained by each node sending a single Hello packet, instead of N packets. What about “flooding adjacencies”, I.e.,  Whom to send (flood) LSAs when a router generates or learns a new LSA?  Does it need to synchronize DBs with all nodes ? Shivkumar Kalyanaraman 25 Flooding Adjacencies : option 1  Using Router-LSAs …  O(N) Router-LSAs, with O(N2) adjacency info  Multicast of Router-LSAs does not solve O(N2) DB synchronization issue Shivkumar Kalyanaraman 26 Flooding Adjacencies: option 2  New LSA-type: Network-LSA …  O(N) Router-LSAs + 1 network-LSA+ O(N) adjacencies  Converted O(N2) adjacency problem into O(N) problem Shivkumar Kalyanaraman 27 Recap: O(N2) model  O(N) model  Question: Who creates the network-LSA? Shivkumar Kalyanaraman 28 Designated Router (DR)  One router elected as a designated router (DR)  Each router maintains flooding adjacency with the DR, I.e., sends acks of LSAs to DR  DR informs each router of other routers on LAN  DR generates the network-LSA on subnet’s behalf after synchronizing with all routers Shivkumar Kalyanaraman 29 DR, BDR … continued Backup DR (BDR) also syncs with all routers, and takes over if DR dies (typically 5 s wait)  Total: 2N – 1 adjacencies  Multicast-based optimization:  New LSAs, Hellos sent to AllSPFRouters avoids DR re-advertising new information  LSA acks sent to AllDRRouters avoids separate copies to be sent to DR and BDR  DR election:  First router on net = DR, second = BDR  RouterPriority: [0, 127] indicated in Hello packet=> highest priority router becomes DR  If network is partitioned and healed, the two DRs are reduced to one by looking at RouterPriority  Shivkumar Kalyanaraman 30 Network-LSA Example: Summary DR Shivkumar Kalyanaraman 31 What if subnet does not support broadcast?   Non-Broadcast Multiple Access (NBMA) media NBMA segments may support more than 2 routers, and allow any two routers to communicate directly, but do not support data-link broadcast/mcast capability  Eg:X.25, SMDS, Frame-Relay, ATM etc  Connection-oriented (VC-based) communication  Each VC is costly => setting up full mesh for Hellos is prohibitively expensive  Two flooding adjacency models in OSPF:  Non-Broadcast Multiple Access (NBMA) model  Point-to-Multipoint (pt-mpt) Model  Different tradeoffs… Shivkumar Kalyanaraman 32 NBMA Subnet Model       Neighbor discovery: manually configured Dijkstra SPF views NBMA as a full mesh! Most routers assigned a RouterPriority = 0 Other routers: eligible to become DRs =>  ID of all routers in the NBMA configured  Maintains VCs and Hellos with all routers eligible to become DRs (RouterPriority > 0)  Enables election of new DR if current one fails DR and BDR only maintain VCs and Hellos with all routers on NBMA  DB synchronization works same as broadcast subnet  Flooding in NBMA always goes through DR  Multicast not available to optimize LSA flooding. DR generates network-LSA just like broadcast subnet Shivkumar Kalyanaraman 33 NBMA vs Pt-Mpt Subnet Model   Key assumption in NBMA model:  Each router on the subnet can communicate with every other (same as IP model)  But this requires a “full mesh” of expensive PVCs at the lower layer!  Many organizations have a hub-and-spoke PVC setup, a.k.a. “partial mesh”  Conversion into NBMA model requires multiple IP subnets, and complex configuration (see fig on next slide) OSPF’s pt-mpt subnet model breaks the rule that two routers on the same network must be able to talk directly  Can turn partial PVC mesh into a single IP subnet Shivkumar Kalyanaraman 34 Partial Mesh F-Relay: NBMA model Shivkumar Kalyanaraman 35 Partial Mesh F-Relay: pt-mpt model Shivkumar Kalyanaraman 36 Pt-Mpt Subnet Model       Each router: single OSPF interface, but multiple neighbor relationships Note that neighbor relationships not formed to nodes to which direct PVC does not exist. Key differences:  No DRs or BDRs! Just hellos over the PVCs. Make sure that the communication is bi-directional.  I.e. Partial mesh is viewed in Dijkstra as a partial mesh. Full mesh view not forced like in NBMA model. Sometimes auto-configuration is possible. Loss in efficiency because the DB synchronization has to be done between every peer. O(n^2) if full mesh. So, in true full PVC mesh situations, it is better to operate subnet as an NBMAShivkumar Kalyanaraman 37 Hierarchical Routing Shivkumar Kalyanaraman 38 Why Hierarchy?   Information hiding (filtered) => computation, bandwidth, storage saved => efficiency => scalability Address abstraction vs Topology Abstraction  Multiple paths possible between two areas  Shivkumar Kalyanaraman 39 Hierarchical OSPF Shivkumar Kalyanaraman 40 Area Configured area ID  A set of address prefixes  Do not have to be contiguous  So a prefix can be in only one area  A set of router IDs  Router functions may be interior, inter-area, or external  Shivkumar Kalyanaraman 41 Hierarchical OSPF     Two-level hierarchy: local area, backbone.  Link-state advertisements only in area  each nodes has detailed area topology; only know direction (shortest path) to nets in other areas.  Two-level restriction avoids count-to-infinity issues in backbone routing. Area border routers (ABR): “summarize” distances to nets in own area, advertise to other Area Border routers. Backbone routers: uses a DV-style routing between backbone routers Boundary routers (AS-BRs): connect to other ASs (generate “external” records) Shivkumar Kalyanaraman 42 Sample Area Configuration 10.2.0.0/24 Shivkumar Kalyanaraman 43 Summary-LSA Example Shivkumar Kalyanaraman 44 Externals and Aggregation 1 A full ISP routing table has approximately 100K routes!  But will you do anything differently if you know all of them and have a single ISP?  Multiple ISP situations call for complex OSPF and BGP design  Never redistribute IGPs into BGP! (later…)  Redistribute BGP into IGPs with extreme care  Shivkumar Kalyanaraman 45 Externals & Aggregation 2  In an enterprise  Limit externals from subordinate domains (e.g., RIP) to be within area (area-scope)  Flood only in area 0 and in area with ASBR  Allow externals from Internet, peer domains to go outside Area 0…  Only when there will be significant path differences  Do things with defaults where possible Shivkumar Kalyanaraman 46 Type 1 and Type 2 externals  Type 2:  Default type for routes distributed into OSPF  EGP costs very different from IGP costs  Exit based on external (EGP) cost only  Type 1  Needs to be set explicitly: not default  IGP costs can be compared and summed  Selects exit based on internal + external costs Shivkumar Kalyanaraman 47 Stubbiness: A Means of Controlling Externals Shivkumar Kalyanaraman 48 Normal Areas Flood AS-external-LSAs (type 5) across areaboundaries (AS flooding scope)  ASBR-summary-LSAs (type 4) advertises location of ASBR (area flooding scope) Shivkumar Kalyanaraman  49 Stub Areas AS-external-LSAs (type 5) not flooded into stub areas  Summary-LSA flooded only optionally  Default route to ABR for all non-area prefixes  Paths may be inefficient, cannot place an ASBR in Shivkumar Kalyanaraman stub areas  50 Not-So-Stubby-Areas (NSSA) A subset of external LSAs may be flooded  Use Type-7 LSAs for such external routes  Used to import RIP domain routes and flood it externally, but keep default route for BGP routes  Shivkumar Kalyanaraman 51 IS-IS Overview  The Intermediate Systems to Intermediate System Routing Protocol (IS-IS) was originally designed to route the ISO Connectionless Network Protocol (CLNP) . (ISO10589 or RFC 1142)  Adapted for routing IP in addition to CLNP (RFC1195) as Integrated or Dual IS-IS (1990)  IS-IS is a Link State Protocol similar to the Open Shortest Path First (OSPF). OSPF supports only IP  IS-IS competed neck-to-neck with OSPF.  OSPF deployed in large enterprise networks  IS-IS deployed in several large ISPs Shivkumar Kalyanaraman 52 IS-IS Terminology Intermediate system (IS) - Router Designated Intermediate System (DIS) - Designated Router Pseudonode - Broadcast link emulated as virtual node by DIS End System (ES) - Network Host or workstation Network Service Access Point (NSAP) - Network Layer Address Subnetwork Point of attachment (SNPA) - Datalink interface Packet data Unit (PDU) - Analogous to IP Packet Link State PDU (LSP) - Routing information packet Level 1 and Level 2 – Area 0 and lower areas Shivkumar Kalyanaraman 53 Functional Comparison         Protocols are recognizably similar in function and mechanism (common heritage) Link state algorithms Two level hierarchies Designated Router on LANs Widely deployed (ISPs vs enterprises) Multiple interoperable implementations OSPF more “optimized” by design (and therefore significantly more complex) IS-IS not designed from the start as an IP routing protocol (and is therefore a bit clunky in places) Shivkumar Kalyanaraman 54 Sample comparison points   Encapsulation  OSPF runs on top of IP=> Relies on IP fragmentation for large LSAs  IS-IS runs directly over L2 (next to IP) => fragmentation done by IS-IS Media support  Both protocols support LANs and point-to-point links in similar ways  IS-IS supports NBMA in a manner similar to OSPF ptmpt model: as a set of point-to-point links  OSPF NBMA mode is configuration-heavy and risky (all routers must be able to reach DR; bad news if VC fails) Shivkumar Kalyanaraman 55 Packet Encoding  OSPF is “efficiently” encoded  Positional fields, 32-bit alignment  Only LSAs are extensible (not Hellos, etc.)  Unrecognized types not flooded. Opaque-LSAs recently introduced.  IS-IS is mostly Type-Length-Value (TLV) encoded  No particular alignment  Extensible from the start (unknown types ignored but still flooded)  All packet types are extensible  Nested TLVs provide structure for more granular extension Shivkumar Kalyanaraman 56 IS-IS LS Database: Generic Packet Format No. of Octets R Intra-domain Routing Protocol Discriminator 1 Length Indicator 1 Version/Protocol ID Extension 1 ID Length 1 R R PDU Type 1 Version 1 Reserved 1 Maximum Area Addresses 1 Packet-Specific Header Fields TLV Fields Shivkumar Kalyanaraman 57 More detailed comparison provided as a reference (not covered in class)… Shivkumar Kalyanaraman 58 Private Network to Node Interface (PNNI)  Link State Routing Protocol for ATM Networks  “A hierarchy mechanism ensures that this protocol scales well for large world-wide ATM networks. A key feature of the PNNI hierarchy mechanism is its ability to automatically configure itself in networks in which the address structure reflects the topology…” Shivkumar Kalyanaraman 59 PNNI Features           Scales to very large networks. Supports hierarchical routing. Supports QoS. Supports multiple routing metrics and attributes. Uses source routed connection setup. Operates in the presence of partitioned areas. Provides dynamic routing, responsive to changes in resource availability. Separates the routing protocol used within a peer group from that used among peer groups. Interoperates with external routing domains, not necessarily using PNNI. Supports both physical links and tunneling over VPCs. Shivkumar Kalyanaraman 60 PNNI Terminology (partial)         Peer group: A group of nodes at the same hierarchy Border node: one link crosses the boundary Logical group node: Representation of a group as a single point Child node: Any node at the next lower hierarchy level Parent node: LGN at the next higher hierarchy level Logical links: links between logical nodes Peer group leader (PGL): Represents a group at the next higher level.  Node with the highest "leadership priority" and highest ATM address is elected as a leader.  PGL acts as a logical group node.  Uses same ATM address with a different selector value. Peer group ID: Address prefixes up to 13 bytes Shivkumar Kalyanaraman 61 PNNI Terminology Shivkumar Kalyanaraman 62 Hierarchical Routing: PNNI Shivkumar Kalyanaraman 63 Source Routing Source specifies route as a list of all intermediate systems in the route. Abstracts out area hops.  Designated Transit List (DTL) Source route across each level of hierarchy  Entry switch of each peer group specifies complete route through that group  Set of DTLs and manipulations implemented as a stack  DTL example: next slide  Shivkumar Kalyanaraman 64 DTL Example Shivkumar Kalyanaraman 65 Crank back and Alternate Path Routing  If a call fails along a particular route:  It is cranked back to the originator of the top DTL  The originator finds another route or  Cranks back to the generator of the higher level source route Shivkumar Kalyanaraman 66 Summary DV Protocols: RIP, EIGRP  LS Protocols: OSPF, IS-IS, PNNI  Shivkumar Kalyanaraman 67

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Download IP: Addresses and Forwarding - RPI ECSE