Exterior Gateway Protocols: EGP and BGPv4

Between autonomous systems, exterior gateway protocols (EGPs) distribute interdomain routing information, or (to be more precise) network layer reachability information (NLRI). The purpose of this approach is to create a loop-free view of the Internet in terms of AS paths and related path attributes. The term EGP refers to both the generic family of exterior routing protocols as well as a particular archaic protocol also called EGP, the ancestor of today's predominant signaling protocol, the Border Gateway Protocol version 4 (BGPv4).

The following subsections introduce general aspects of interdomain EGP routing and gradually concentrate on BGPv4 signaling and operation.

BGPv4: Introductory Thoughts

BGP prefix routes carry multiple attributes, in particular one AS_Path itself, for both loop prevention and administrative granularity. Because of this rich set of attributes, BGP offers extended capabilities for policy-based routing, which is of paramount importance to represent complex policies of interprovider communication. Therefore, BGP is the glue that holds the Internet together. The Internet itself essentially consists of transit autonomous systems and stub autonomous systems (as shown in Figure 10-1).

Figure 10-1. The Architecture of the Internet

graphics/10fig01.jpg

Carriers form the heart of the Internet and are classified into tier 1 (no further upstream) and tier 2 carriers that usually interconnnect at commercial exchange points (MAEs, or metropolitan-area exchanges), IXs (Internet exchanges), or NAPs (network access points). Today, these interconnection points are switched Ethernet colocation centers with frequent deployments of route servers, looking-glass access, and connectivity to the Internet Route Registry (IRR).

Neighboring Relations

Peering, upstream, and subscriber agreements govern neighborship relations. A tier 1 carrier is a telco or Internet service provider (ISP) that is at the top of the Internet telecommunication hierarchy and owns its own network cable infrastructure. These are global players such as Cable & Wireless, AT&T, Sprint, and British Telecom, just to mention a few. Tier 1s do not pay anyone for transit; they are paid to provide transit and peer with other tier 1s. Tier 2s typically buy transit from at least one tier 1, while peering with as many tier 2s as they can technically realize and afford. Tier 2s also own their network infrastructure, but they are not big enough to peer with all tier 1s.

In contrast to the IGPs we investigated, which use unicast, multicast, broadcast, and even data-link addresses (Intermediate System-to-Intermediate System, IS-IS) for communication, BGP facilitates the transport protocol TCP port 179 for reliable sessions between neighbors or peers. It is established practice to secure these TCP-connections with MD5 hashes. On UNIX systems, providing MD5 capabilities for TCP connections is a responsibility of the kernel, but such provision is still missing or in experimental stages with regard to the BGP implementations used in this book. Other approaches are the use of firewall chains on Linux or divert sockets/netgraph hooks on BSD operating systems. This communication is intrinsically connection-oriented and monitored via keepalive packets. Two BGP peers run through several steps of a finite state engine until a neighborship becomes established and messages or notifications can be passed back and forth. Then NLRI can be exchanged and ultimately a BGP table (Routing Information Base, RIB) derived.

BGP always places a single best path in the actual routing (forwarding) table. Initially, after peering establishment, the two peering routers exchange their full BGP table (flash update). Later on, only incremental updates are sent, and the related BGP table version number is incremented. The table number is an indicator of topological stability or volatility.

Limitations of IGPs

Why can't we use IGPs throughout the Internet? EGPs serve entirely different purposes than IGPs, both technically and from an administrative point of view (policy enforcement). The global routing table is approaching 130,000 prefixes and consists of myriad nodes (network elements). The increase rate of new prefixes appears to have slowed down, however, most likely due to aggregation improvements, stricter policies, Network Address Translation (NAT) deployments, and improved management. This number cannot be handled with the specialized approaches of IGPs.

IGP strengths turn into limits and weaknesses in the case of managing the vast Internet "playground"; just imagine the Shortest Path First (SPF) flooding, database maintenance and calculation burden, and complicated area topologies with Open Shortest Path First (OSPF); the Routing Information Protocol (RIP) hop-count limit would not get us very far either. However, RIP and BGP share a common approach: They are both distance-vector protocols. BGP is referred to as a path-vector routing protocol because it transports a sequence of AS numbers (ASNs) that identifies the path that the network prefix has traversed, sometimes referred to as an AS tree or path.

The essential idea of the BGP designers was that it is practically impossible to coordinate interconnected realms without a protocol that has rich capabilities to reflect and transport policies and control ingress and egress flows in terms of transit. This is the reason why BGP strongly depends on regular expressions and powerful filtering and tagging capabilities. BGP explicitly does not propagate information about the internal structure of autonomous systems. Remember that the primary design goal of the Internet and its predecessors NFSNET, ARPANET, and MILNET was dynamic recovery from link or node failure. BGP has hooks to accommodate this requirement.

BGP itself intrinsically does not load balance. However, one can tune the egress and ingress behavior to some extent to achieve what is referred to as "pseudo" load/flow balancing later in this chapter. This usually includes cooperation of your peering AS, upstream or downstream provider, or carrier. This is the art of attracting certain traffic at a certain ingress point and directing traffic to certain egress gateways.

Flavors of BGPv4

BGPv4 supports two different types of peering sessions: IBGP (Internal BGP) is used within one and the same AS, and EBGP (External BGP) is used between neighboring autonomous systems.

IBGP is used widely to configure transit autonomous systems and BGP-based Multiprotocol Label Switching (MPLS) virtual private network (VPN) architectures. In the MPLS VPN context, IBGP is referred to as Multiprotocol BGP. BGP is entirely a signaling protocol, even more than OSPF or IS-IS are; in a strict sense, it is incapable of delivering traffic within an AS solely by its own means. For this purpose, it relies on an underlying IGP and static or connected routes to actually forward traffic and resolve next hops.

EBGP is just the formal protocol used between neighboring (directly connected) autonomous systems to exchange aggregated routing information and to reflect macroscopic routing policies on an AS scale.

BGPv4 is a powerful and feature-rich protocol, but not necessarily complicated. To use it fully, you must understand regular expressions, classless interdomain routing (CIDR), and aggregation. Therefore, a complete discussion goes beyond the scope of almost any book. For this reason, the lab section of this chapter predominantly uses Zebra/Quagga and occasionally GateD for demonstration purposes. The BGP configuration of MRTd is almost equivalent, similar to the Cisco IOS architecture, and supports multiple BGP views; it also has the added benefit of being multithreaded. You will read more about BGP later in this chapter.

BGP Message Types

BGP systems use four different types of messages (see Table 10-1). During normal operation, only UPDATE and KEEPALIVE messages are exchanged. OPEN messages govern connection establishment with optional capabilities negotiation. NOTIFICATIONs gracefully terminate the BGP/TCP session in case of malformed information, errors, or manual-session resets.

Table 10-1. BGP Message Types
Message	Explanation
OPEN	Exchange connection parameters, session establishment, optional capabilities negotiation
UPDATE	Routing updates/withdrawals/replacement routes
NOTIFICATION	Handling error conditions and closing the BGP/TCP session
KEEPALIVE	BGP speaker monitoring/heartbeat

Capabilities Negotiation

As described in RFC 3392, "Capabilities Advertisement with BGP-4," capability negotiation was added to the BGPv4 protocol behavior to enable peers to negotiate certain additional capabilities, especially with the success of Multiprotocol BGP extensions. This is done via OPEN/NOTIFICATION messages, as demonstrated in Example 10-1 (highlighted text). When a BGP speaker that supports capability negotiation does not support a particular capability, it should respond with a notification error and a corresponding error subcode. This scheme was introduced to leave the UPDATE message mechanism untouched.

Example 10-1. Packet Capture to Demonstrate Capabilities Negotiation


[root@callisto:~#] tethereal -i eth0 ?V



Frame 5 (111 bytes on wire, 111 bytes captured)

    Arrival Time: May 17, 2003 10:37:28.533785000

    Time delta from previous packet: 0.000059000 seconds

    Time relative to first packet: 0.000442000 seconds

    Frame Number: 5

    Packet Length: 111 bytes

    Capture Length: 111 bytes

Ethernet II, Src: 00:60:08:6a:18:45, Dst: 00:10:5a:d7:93:60

    Destination: 00:10:5a:d7:93:60 (3com_d7:93:60)

    Source: 00:60:08:6a:18:45 (3Com_6a:18:45)

    Type: IP (0x0800)

Internet Protocol, Src Addr: 192.168.14.3 (192.168.14.3), Dst Addr: 192.168.14.1 (192.168

.14.1)

    Version: 4

    Header length: 20 bytes

    Differentiated Services Field: 0x00 (DSCP 0x00: Default; ECN: 0x00)

        0000 00.. = Differentiated Services Codepoint: Default (0x00)

        .... ..0. = ECN-Capable Transport (ECT): 0

        .... ...0 = ECN-CE: 0

    Total Length: 97

    Identification: 0x064f

    Flags: 0x04

        .1.. = Don't fragment: Set

        ..0. = More fragments: Not set

    Fragment offset: 0

    Time to live: 1

    Protocol: TCP (0x06)

    Header checksum: 0xd5f3 (correct)

    Source: 192.168.14.3 (192.168.14.3)

    Destination: 192.168.14.1 (192.168.14.1)

Transmission Control Protocol, Src Port: 34665 (34665), Dst Port: bgp (179), Seq:



    

    
    

    
    

Command Syntax Conventions
Chapter 1. Operating System Issues and Features-The Big Picture
Why UNIX Is Viable
Routing, Forwarding, and Switching Approaches
The Evolution of AT&amp;T System V (SVR4) UNIX and 4.4-Lite BSD Derivatives
Operating Systems Design Considerations
Kernel-Space Modules Versus User-Space Applications
Cisco IOS Software
OpenBSD
FreeBSD
NetBSD
Linux
GNU Hurd/Mach
Other Commercial Unices
Summary
Recommended Reading
Endnotes
Chapter 2. User-Space Routing Software
The GNU Zebra Routing Software
The Quagga Project
The routed Daemon
GateD 3.6
MRT (Multithreaded Routing Toolkit)
The Bird Project
The XORP Project
Multicast Routing Daemons: mrouted and pimd
Summary
Recommended Reading
Chapter 3. Kernel Requirements for a Full-Featured Lab
The sysctl Facility
IP Forwarding Control and Special Interfaces
Ethernet Channel Bonding
Multicast Support
Firewall and Traffic-Shaping Support
The IPv6 Protocol Stack
Summary
Recommended Reading
Chapter 4. Gateway WAN/Metro Interfaces
Dial-on-Demand Routing: Analog and ISDN Dialup
Wireless Technologies
SDH/SONET
Powerline Communications
Ethernet to the Home/Premises
Cisco Long-Reach Ethernet (LRE)
Synchronous Serial Interface and PRIs
ATM Interfaces
Cable Access (Ethernet Interfaces)
DSL Access
Lab 4-1: Synchronous Serial Connection Setup
Exercise 4-1: Frame Relay Point-to-Multipoint Setup
Summary
Recommended Reading
Chapter 5. Ethernet and VLANs
Ethernet NICs
Hubs, Bridges, and Multilayer Switches
Access Ports, Uplinks, Trunks, and EtherChannel Port Groups
Alias Interfaces
VLAN Configurations
A Few Words on Cabling
Lab 5-1: FreeBSD Bridge Cluster Lab
Lab 5-2: Linux Bridging and the Spanning Tree
Lab 5-3: OpenBSD Bridging and Spanning Tree
A Few Words on Layer 2 Security
Exercise 5-1: Linux/FreeBSD Ethernet Channel Bonding
Exercise 5-2: STP Operation
Summary
Recommended Reading
Chapter 6. The Analyzer Toolbox, DHCP, and CDP
Terminal Emulation Software
Secure Shell Tools
Protocol Analyzer
Statistical Tools
Port Scanners
socklist and netstat
Ping and Traceroute Combinations
DNS Auditing Tools
Traffic and Packet Generators
Lab 6-1: Using Sniffers-DHCP Example
Lab 6-2: UNIX CDP Configuration
Summary
Recommended Reading
Chapter 7. The UNIX Routing and ARP Tables
Address Resolution: ARP and RARP
Power of the Linux ip, netstat, and route Utilities
ARP-Related Tools
Lab 7-1: ARP Security Issues
Summary
Recommended Reading
Endnote
Chapter 8. Static Routing Concepts
Administrative Distance and Metric
Classful Routing, VLSM, and CIDR
Default Gateways, Default Routes, and Route(s) of Last Resort
Route Caches, Routing Tables, Forwarding Tables, and the ISO Context
The Near and Far End of a Link
The route Command-Adding and Removing Routes
Route Cloning
Blackholes and Reject/Prohibit Routes
Floating Static Routes
Equal-Cost Multi-Path (ECMP) Routing
Lab 8-1: Interface Metrics, Floating Static Routes, and Multiple Equal-Cost Routes (ECMP)
Linux TEQL (True Link Equalizer)
Adding Static Routes via Routing Daemons
Summary
Recommended Reading
Endnotes
Chapter 9. Dynamic Routing Protocols-Interior Gateway Protocols
Interaction with the UNIX Routing Table
Classification of Dynamic Routing Protocols
From RIP to EIGRP
Lab 9-1: RIPv2 Scenario
Lab 9-2: RIP Neighbor Granularity
Lab 9-3: RIPv2 via GateD
Introduction to Link-State Routing Protocols
OSPFv2
Lab 9-4: Leaf-Area Design Featuring GateD and Cisco IOS
Lab 9-5: Leaf-Area Design Featuring Zebra and Cisco IOS Software
ECMP-Manipulating Metric and Distance
The Art of Redistribution
Lab 9-6: Route Filtering and Redistribution
Lab 9-7: OSPF Authentication
Route Tagging and Multiple OSPF Processes/Instances
IS-IS (Intermediate System-to-Intermediate System)
Lab 9-8: IS-IS Flat Backbone Area
Lab 9-9: IS-IS Backbone and Leaf Area
Lab 9-10: OSPF Point-to-Point Lab
Advanced OSPF Features
Summary
Recommended Reading
Endnotes
Chapter 10. ISP Connectivity with BGPv4-An Exterior Gateway Path-Vector Routing Protocol for Interdomain Routing
Exterior Gateway Protocols: EGP and BGPv4
Internet Exchange Points
EBGP and EBGP Multihop
IBGP Full Mesh, Route Reflectors, and Confederation
Lab 10-1: Route Reflection
Lab 10-2: Confederation
Lab 10-3: Multi-AS BGP Topology
Lab 10-4: BGP with GateD
Avoiding Single Points of Failure
Route Server and Routing Registries
Looking Glasses
Routing Policies
Special BGP Topics
Summary
Recommended Reading
Chapter 11. VPN Technologies, Tunnel Interfaces, and Architectures
The Rationale for Tunnels in Routing Environments
The VPNC Concept of VPNs
The OSI Stack Perspective
Internet, Intranet, and Extranet Terminology
IP-IP Tunnel
Generic Router Encapsulation (GRE) Tunnel
Special Multicast and IPv6 Tunneling (RFC 2473, RFC 3053)
Cisco L2F (Layer 2 Forwarding)
PPTP (Point-to-Point Tunnel Protocol)
L2TP (Layer 2 Tunnel Protocol)
Mobile IP
User-Space Tunneling
IPSec Foundation
General Tunnel and Specific IPSec Caveats
Advice About IPSec Lab Scenarios
Road-Warrior Scenarios (Road Warrior-to-OpenBSD/FreeBSD Gateway with IKE)
Dynamic Routing Protocols over Point-to-Point Tunnels-Transparent Infrastructure VPN
Summary
Recommended Reading
Endnotes
Chapter 12. Designing for High Availability
Increasing Availability
Withstanding a (D)DoS Attack
Network HA Approaches
Simple but Effective Approaches to Server HA
DNS Shuffle Records and Round-Robin (DNS RR)
Dynamic Routing Protocols
Firewall Failover
Clustering and Distributed Architectures
The Service Routing Redundancy Daemon (SRRD)
IPv4/IPv6 Anycast
A Few Words About Content Caches and Proxies
Load Balancing
Cisco HA and Load-Balancing Approaches
VRRP
OpenBSD CARP
IRDP
Summary
Recommended Reading
Endnotes
Chapter 13. Policy Routing, Bandwidth Management, and QoS
Policy Routing
Traffic Shaping, Queuing, Reservation, and Scheduling
Linux QoS
Layer 3 QoS: IP ToS, Precedence, CoS, IntServ, and DiffServ Codepoints
802.1P/Q Tagging/Priority-QoS at the Data-Link/MAC Sublayer
MPLS Exp Field and MPLS Traffic Engineering
DiffServ and RSVP/RSVP-TE Implementations for UNIX
Cisco IOS QoS and Queuing Architectures
UNIX Firewalling Engines and Queuing
Summary
Recommended Reading
Endnote
Chapter 14. Multicast Architectures
Multicast Deployments
Multicast Addresses and Scope
Internet Group Management Protocol (IGMP) and Cisco Group Management Protocol (CGMP)
mrouted and DVMRP
The ip and smcroute Multicast Utilities
PIM Operation and Daemons
Multicast Open Shortest Path First (MOSPF)
Multicast Source Discovery Protocol (MSDP)
BGPv4 Multicast Extensions (Multiprotocol BGP, RFC 2858)
Multicast Transport Layer Protocols
Multicast Invitations and Session Announcements
Multicast Security
Summary
Recommended Reading
Chapter 15. Network Address Translation
The NAT Foundation-Basic/Traditional NAT
NAT, PAT(NAPT), Masquerading, and Port Mapping/Multiplexing
Static NAT and ARP/Routing Issues
Redirection (Port Forwarding/Relaying or Transparent Proxying)
UNIX NAT Approaches
NAT-Hostile Protocols
Future Developments: NAT-T, MPLS+NAT, Load Balancer
NAT Redundancy-Stateful Failover
Summary
Recommended Reading
Appendix A. UNIX Kernel Configuration Files
Appendix B. The FreeBSD Netgraph Facility
Reasons for Netgraph
Recommended Reading




            
        
     
        
            
                Remember the name: eTutorials.org
                
                    
                
                Copyright eTutorials.org 2008-2024. All rights reserved.