| Network Working Group | F. AUDET |
| INTERNET DRAFT | Nortel Networks |
| <draft-audet-natfwp2p-00> | C. Jennings |
| Category: Standards Track | Cisco Systems |
| Expires: December 2004 | June 2004 |
NAT/Firewall Requirements for Peer-to-Peer Media
draft-audet-natfwp2p-00
By submitting this Internet-Draft, I certify that any applicable patent or other IPR claims of which I am aware have been disclosed, and any of which I become aware will be disclosed, in accordance with RFC 3668.
Internet-Drafts are working documents of the Internet Engineering Task Force (IETF), its areas, and its working groups. Note that other groups may also distribute working documents as Internet-Drafts.
Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress".
The list of current Internet-Drafts can be accessed at <http://www.ietf.org/ietf/1id-abstracts.txt>.
The list of Internet-Draft Shadow Directories can be accessed at <http://www.ietf.org/shadow.html>.
This Internet-Draft will expire in December 2004.
Copyright (C) The Internet Society (2004). All Rights Reserved.
This document defines basic terminology for describing different types of behavior for NATs and firewalls. It also defines a set of requirements for supporting Peer-to-Peer (P2P) media in NATs and firewalls. NATs and firewalls that meet this set of requirements are described as P2P capable and greatly increase [CHECK] the likelihood that P2P application will function properly.
1
Introduction
2
Terminology
3
NAT Behavior
3.1
Address and port binding
3.2
Port assignment
3.3
NAT binding timeout
4
Firewall Behavior (Filtering)
4.1
Filtering of unsolicited packets
4.2
Firewall binding timeout
5
Hairpinning Behavior
6
Application Level Gateways
7
Deterministic Properties
8
Requirements
8.1
Requirement Discussion
9
Security Considerations
10
IANA Considerations
11
IAB Considerations
12
Open Issues
13
Acknowledgements
§
Normative References
§
Informational References
§
Author's Addresses
§
Intellectual Property and Copyright Statements
Network Address Translators (NAT) and firewalls are well known to cause very significant problems with applications that carry IP addresses in the payload [5]. Applications that suffer from this problem include Voice Over IP and Multimedia Over IP (e.g., SIP [6] and H.323 [11]), as well as on-line gaming.
Many techniques are used to attempt to make peer-to-peer media applications work across NATs and firewalls. Application Level Gateways [3] are one such mechanism. STUN [7] describes a UNilateral Self-Address Translation (UNSAF) mechanism[2]. Media Relays have also been used to enable peer-to-peer media across NATs and firewalls, but it is generally seen as a solution of last resort. ICE [9] describes a methodology for using many of these techniques and avoiding a Media Relay unless the type of NAT/firewall is such that it forces the use of such a Media Relay.
This specification defines requirements for NATs and firewalls aimed at ensuring that a NAT or firewall that satisfies these requirements will avoid forcing the use of a Media Relay for supporting peer-to-peer media applications. "Peer-to-Peer (P2P) communication across middle boxes" [10] made several recommendations regarding NAT and firewalls for Peer-to-Peer media: this specification derives a lot of its requirements from that draft.
As pointed out in UNSAF [2], "From observations of deployed networks, it is clear that different NAT boxes implementation vary widely in terms of how they handle different traffic and addressing cases." This wide degree of variability is one part of what contributes to the overall brittleness introduced by NATs and makes it extremely difficult to predict how any given protocol will behave on a network traversing NATs. Discussions with many of the major NAT vendors have made it clear that they would prefer to deploy NATs that were deterministic and caused the least harm to applications while still meeting the requirements that caused their customers to deploy NATs in the first place. The problem the NAT vendors face is they are not sure how best to do that or how to document how their NATs behave. The situation is not as problematic for firewalls but still exists: there is no good common terminology even to describe the behavior of firewalls.
The goal of this document is to define a set of common terminology for describing the behavior of NATs and firewalls, and produce a set of requirements on a specific set of behaviors for "P2P Enabled" NATs/firewalls. The requirements represent what many vendors are already doing, and it is not expected that it should be any more difficult to build a NAT that meets these requirements or that these requirements should affect performance.
The authors strongly believe that if there were a common set of requirements that were simple and useful for voice, video, and games, the bulk of the NAT vendors would choose to meet those requirements. This document will simplify the analysis of protocols for deciding whether or not they work in this environment and will allow providers of services that have NAT traversal issues to make statements about where their applications will work and where they will not, as well as to specify requirements for NATs.
This specification only covers Traditional NATs [4]. Bi-directional, Twice NAT, and Multihomed NAT [3] are outside the scope of this document. Approaches using directly signaled control off the middle boxes such as midcom, UPNP or in-path signaling are also out of scope. Media Relays are out of the scope of this document as well.
This document only covers the UDP aspects of NAT/firewall traversal and does not cover TCP, ICMP, IPSEC, or other protocols.
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in RFC 2119 [1].
It is assumed that the reader is familiar with the terminology described in RFC 2663 [3] and RFC 3022 [4]. This specification attempts to preserve the terminology used in those RFCs.
This document uses the term "session" as defined in RFC 2663 [3]: "TCP/UDP sessions are uniquely identified by the tuple of (source IP address, source TCP/UDP ports, target IP address, target TCP/UDP Port)."
This document uses the term "address binding" as defined in RFC 2663 [3] and RFC 3022 [4]: "Address binding is the phase in which a local IP address is associated with an external address, or vice versa, for purpose of translation."
The term NAT is used to refer to both traditional address translation and address port translation. The authors understand that there was a time when these were considered different, but terminology has changed over time, and the term NAT has subsumed port translation as part of it.
RFC 3489 [7] defines a terminology for different NAT variations. In particular, it uses the terms "Full Cone", "Restricted Cone", "Port Restricted Cone" and "Symmetric" to refer to different variations of NATs/firewalls. Unfortunately, this terminology has been the source of much confusion. This terminology does not distinguish between the NAT and the firewalling behavior of NAT/firewall devices. It was found that many devices' behaviors do not exactly fit into the described variations. For example, a device could be symmetric from a firewall point of view and Cone from a NAT point of view. Other aspects of NAT/firewall are not covered by this terminology: for example, many NATs will switch over from basic NAT (preserving ports) to NAPT (mapping ports) in order to preserve ports when possible.
This specification will therefore not use the Cone/Symmetric terminology. Furthermore, many other important behaviors are not fully described by the Cone/Symmetric terminology. This specification refers to specific individual NAT/Firewall behaviors instead of using the Cone/Symmetric terminology.
Note: RFC 3489 [7] defines a "Symmetric NAT" in effectively two parts:
Condition 1 is the NAT behavior and condition 2 is the firewall behavior. However, they are not necessarily dependent: we have observed NATs that will conform to condition (1) but not to (2). Using RFC 3489, this type of NAT would be detected as a "Cone NAT" since it uses condition (2). Using a different algorithm such as the one described in NATCECK [12] which uses condition (1), it would be detected as a "Symmetric NAT". If the endpoint receiving the media has a permissive policy on accepting media, condition (2) is more appropriate, but if it has a restrictive policy, condition (1) is more appropriate.
This section describes the various NAT behaviors applicable to dynamic NAT; static NAT is outside the scope of this document.
When an internal endpoint opens an outgoing UDP session through a NAT, the NAT assigns the session an external IP address and port number so that subsequent response packets from the external endpoint can be received by the NAT, translated and forwarded to the internal endpoint. This is a binding between an internal IP address and port (IP:port) and external IP:port tuple. It establishes the translation that will be performed by the NAT for the duration of the session. For peer-to-peer applications, it is important to distinguish the behavior of the NAT when there are multiple simultaneous sessions established to different external endpoints.
The key behavior to describe is the criteria for re-use of a binding for new sessions to external endpoints, after establishing a first binding between an internal X:x address and port and an external Y1:y1 tuple. Let's assume that internal IP address and port X:x is mapped to X1':x1' for this first session. The endpoint then sends from X:x to an external address Y2:y2 and gets a mapping X2':x2' on the NAT. The relationship between X1':x1' to X2':x2' for various combinations of the relationship between Y1:y1 to Y2:y2 is critical to describing the NAT behavior. This arrangement is illustrated in the following diagram:
E
+------+ +------+ x
| Y1 | | Y2 | t
+--+---+ +---+--+ e
| Y1:y1 Y2:y2 | r
+----------+ +----------+ n
| | a
X1':x1' | | X2':x2' l
+--+---+-+
...........| NAT |...............
+--+---+-+ I
| | n
X:x | | X:x t
++---++ e
| X | r
+-----+ n
a
l
The following address and port binding behavior are defined:
The three possibilities are abbreviated as NB=I, NB=AD, and NB=APD, respectively. NB stands for Nat Binding, I for independent, AD for Address Dependent, and APD for Address Port Dependent.
It is important to note that these three possible choices make no difference to the security properties of the NAT. The security properties are fully determined by which packets the NAT allows in and which it does not. This is determined by the firewall behavior in the firewall portions of the NAT/FW.
Some NATs attempt to preserve the port number used internally when assigning a binding to an external IP address and port (e.g., X:x to X':x). A basic NAT, for example, will preserve the same port and will assign a different IP address from a pool of external IP addresses in case of port collision (e.g. X1:x to X1':x and X2:x to X2':x). This is only possible as long as the NAT has enough external IP addresses. If the port x is already in use on all available external IP addresses, then the NAT needs to switch from Basic NAT to a Network Address and Port Translator (NAPT) mode. (i.e., X1:x to X':x and X2:x to X':x'). This is referred-to as "port preservation". It does not guarantee that the external port x' will always be the same as the internal port x but only that the NAT will preserve the port if it is possible.
A NAT that does not attempt to make the external port numbers match the internal port numbers in any case (i.e., X1:x to X':x1', X2:x to X':x2') is referred-to as "no port preservation".
One of the benefits of port preservation is that tools such as network sniffers that try to identify and parse traffic based on the port number keep working. Another benefit is that it may enable P2P applications that are not capable of using UNSAF [2] techniques.
Some particularly nasty NATs use Port overloading: i.e., they always use port preservation even in case of collision (i.e., X1:x to X':x, and X2:x to X':x). These NATs rely on the source of the response from the external endpoint (Y:y, Z:z) to forward a packet to the proper internal endpoint (X1 or X2). Port overloading fails if the two internal endpoints are establishing sessions to the same external destination. This is referred-to as "Port overloaded".
Most applications fail in some cases with "Port overloaded". It is clear that "Port overloaded" behavior will result in many problems.
Furthermore, RTP/RTCP [8] are typically used on UDP to transport real-time media. The convention is that RTP streams use an even port number, and the associated RTCP stream is supposed to use the following port number. Most applications should not have any problems using an odd port for an RTP stream or an even port for an RTCP stream: however, it is more prudent to respect the convention. The sequencing convention of RTCP=RTP+1, however, is a little more problematic. In theory, some protocols do allow indicating that the normal rule does not apply so that the RTCP port may be explicitly specified. Since some protocols rely on "extensions" to the baseline protocol to indicate the non-conformance to the convention, it is very likely that a NAT will result in a failure to establish the RTCP stream.
NAT UDP binding timeouts implementations vary but include the timer's value and the way the binding timer is refreshed to keep the binding alive.
The binding timer is defined as the time a binding will stay active without packets traversing the NAT. There is great variation in the values used by different NATs.
Some NATs keep the binding active (i.e., refresh the timer value) when a packet goes from the internal side of the NAT to the external side of the NAT, but do not take into account packets from the external side of the NAT to the internal side of the NAT. This is referred-to as having a NAT refresh direction behavior of "Outbound".
Furthermore, if the binding is refreshed for all sessions on that bind by any outbound traffic, the NAT is said to have a NAT refresh method behavior of "Per binding". If the binding is refreshed only on a specific session on that particular bind by any outbound traffic, the NAT is said to have a "Per session".
Other less secure NATs keep the binding active when packets go in any direction. This is referred-to as a NAT refresh direction behavior of "Bidirectional". The NAT refresh method behavior can also be "Per session" or "Per bind".
Yet other NATs keep the binding active when a packet goes from the external side of the NAT to the internal side of the NAT, but do not take into account packets from the internal side of the NAT to the external side of the NAT. This is referred-to as a NAT refresh direction behavior of "Inbound". The NAT refresh method behavior can be "Per session" or "Per bind".
In a single combined NAT/FW device, in theory the NAT binding timeout behavior could be independent from the firewall binding timeout behavior. In practice, it is more likely that the NAT binding timeout and the firewall binding timeout behavior will be one and the same. However, when the NAT and firewall are deployed in two different devices that are serially chained, the behaviors may be different.
This section describes various firewall behaviors.
When an internal endpoint opens an outgoing UDP session through a firewall, the firewall assigns a filtering rule for the binding between an internal IP:port (X:x) and external IP:port (Y:y) tuple.
The key behavior to describe is what criteria are used by the firewall to filter packets originating from specific external endpoints.
These are abbreviated at EF=O, EF=I, EF=AD, EF=APD respectively.
See section on NAT.
If two hosts (called X1 and X2) are behind the same NAT and exchanging traffic, the NAT may allocate an address on the outside of the NAT for X2, called X2':x2'. If X1 sends traffic to X2':x2', it goes to the NAT, which must relay the traffic from X1 to X2. This is referred to as hairpinning and is illustrated below.
NAT
+----+ from X1:x1 to X2':x2' +-----+ X1':x1'
| X1 |>>>>>>>>>>>>>>>>>>>>>>>>>>>>>--+---
+----+ | v |
| v |
| v |
| v |
+----+ from X2':x2' to X1:x1 | v | X2':x2'
| X2 |<<<<<<<<<<<<<<<<<<<<<<<<<<<<<--+---
+----+ +-----+
Hairpinning allows two endpoints on the internal side of the NAT to communicate even if they only use each other's external IP addresses and ports.
More formally, A NAT/Firewall that supports hairpinning forwards packets originating from an internal address, X1:x1, destined to an external address X2':x2' that has an active binding to an internal address X2:x2, back to that internal address X2:x2. (Note that typically, X1'=X2').
Furthermore, the NAT may present the hairpinned packet with either an internal or an external source IP address and port. The Hairpinning NAT behavior can therefore be either "External source IP address and port" or "Internal source IP address and port". "Internal source IP address and port" may cause problems by confusing an implementation that is expecting an external IP address and port.
Application Level Gateways have been used to support specific protocols for Peer-to-peer media through a NAT or Firewall. However, if an ALG is "on" on a NAT, it could conflict with other techniques such as STUN [7] or ICE [9].
The diagnosis is further complicated by the fact that under some conditions the same NAT exhibits different behaviors. This has been seen on NATs that preserve ports or have a specific algorithm for selecting a port other than selecting a free one. If the external port that the NAT wishes to use is already in use by another session, the NAT must select a different port. This results in different code paths for this conflict case, which results in different behavior.
For example if three hosts X1, X2, and X3 all sent from the same port x, through a port preserving NAT with only one external IP address, called X1', the first one to send (i.e., X1) will get an external port of x but the next two will get x2' and x3' (where these are not equal to x). There are NATs where the External Nat Binding characteristics and the External Filter characteristics change between the X1:x and the X2:x binding. To make matters worse, there are NATs where the behavior may be the same on the X1:x and X2:x binds but changes on the third X3:x binding.
Some NATs try to reuse external ports flow from two internal IP to two different external IP. For example X1:x is going to Y1:y1 and X2:x is going to Y2:y2 where Y1:y1 does not equal Y2:y2. Some NATs will map X1:x to X1':x and will also map X2:x to X1':x. This works in the case where the NAT Binding is address port dependant. However some NATs change their behavior when this type of port reuse is happening. The NAT may look like is has NAT Bindings that are Independent when this type of reuse is not happening but change to Address Port Dependent when this reuse happens.
Any NAT that changes the NAT Binding or the External Filtering at any point in time or under any particular conditions is referred to as a "non-deterministic" NAT. NATs that don't are called "deterministic".
Non-deterministic NATs generally change behavior when a conflict of some sort happens, when the port that would normally be used is already in use by another bind. The NAT binding and External Filtering when there is no conflict is referred to as the Primary behavior. The behavior after the first conflict is referred to as Secondary and after the second conflict is referred to as Tertiary. No NATs have been observed that change on further conflicts but additional testing may be required.
The requirements in this section are aimed at minimizing the damage caused by NATs and Firewalls on Peer-to-peer media applications (e.g., forcing the use of Media Relays, or failure to establish RTCP streams).
However, it should be understood that Peer-to-peer media applications normally do not know in advance if the NAT or Firewall conforms to the recommendations defined in this section. Therefore, Peer-to-peer media applications still need to use normal procedures such as ICE [9] or STUN [7] to determine the course of action.
This section describes why each requirement was chosen, and the consequences of violating them:
Firewalls and NATs are often deployed to achieve some security goals. Most of the recommendations and requirements in this document do not affect the security properties of these devices, but a few of them do have security implications and are discussed in this section.
This work recommends that the timers for binding be refreshed only on outgoing packets and that inbound packets should not update the timers. If inbound packets update the timers, an external attacker can keep the binding alive forever and attack future devices that may end up with the same internal address. Some devices today do update the timers on inbound packets.
This work recommends that the firewall filters be specific only to the external IP and not the external IP and port. It can be argued that this is less secure than using [CHECK] the IP and port. Devices that wish to filter on IP and port are still compliant with these requirements.
Non-deterministic NATs and firewalls are risky from a security point of view. They are very difficult to test because they are, well, non-deterministic. Testing by a person configuring one may result in the person thinking it is behaving as desired, yet under different conditions that an attacker can create, it may behave differently. These requirements recommend that devices not be non-deterministic [CHECK THIS PARA, BE DETERMINISTIC?] .
The work requires that NATs have an "external NAT binding is endpoint independent" behavior. This does not reduce the security of devices. Which packets are allowed to flow across the device is determined by the external filtering behavior, which is independent of the binding behavior.
There are no IANA considerations.
The IAB has studied the problem of "Unilateral Self Address Fixing", which is the general process by which a client attempts to determine its address in another realm on the other side of a NAT through a collaborative protocol reflection mechanism [2].
This specification does not constitute in itself an UNSAF application. It consist of a series of requirements for NATs and firewalls aimed at minimizing the negative impact that those devices have on peer-to-peer media applications, especially when those applications are using UNSAF methods.
Section 3 of UNSAF lists several practical issues with solutions to NAT problems. This document makes recommendations to reduce the uncertainty and problems introduced by these pratical issues with NATs. In addition, UNSAF [2] lists five architectural considerations. Though this is not an UNSAF proposal, it is interesting to consider the impact of this work on these architectural considerations.
Should we discuss requirements for dealing with fragmentation of UDP packets.
The editor would like to acknowledge Bryan Ford, Pyda Srisuresh and Dan Kegel for the NATP2P [10] draft, from which a lot of the material in this specification is derived. Thanks to Rohan Mahy for many discussions on this.
| [1] | Bradner, S., "Key words for use in RFCs to Indicate Requirement Levels", BCP 14, RFC 2119, March 1997. |
| [2] | Daigle, L. and IAB, "IAB Considerations for UNilateral Self-Address Fixing (UNSAF) Across Network Address Translation", RFC 3424, November 2002. |
| [3] | Srisuresh, P. and M. Holdrege, "IP Network Address Translator (NAT) Terminology and Considerations", RFC 2663, August 1999. |
| [4] | Srisuresh, P. and K. Egevang, "Traditional IP Network Address Translator (Traditional NAT)", RFC 3022, January 2001. |
| [5] | Holdrege, M. and P. Srisuresh, "Protocol Complications with the IP Network Address Translator", RFC 3027, January 2001. |
| [6] | Rosenberg, J., Schulzrinne, H., Camarillo, G., Johnston, A., Peterson, J., Sparks, R., Handley, M. and E. Schooler, "SIP: Session Initiation Protocol", RFC 3261, June 2002. |
| [7] | Rosenberg, J., Weinberger, J., Huitema, C. and R. Mahy, "STUN - Simple Traversal of User Datagram Protocol (UDP) Through Network Address Translators (NATs)", RFC 3489, March 2003. |
| [8] | Schulzrinne, H., Casner, S., Frederick, R. and V. Jacobson, "RTP: A Transport Protocol for Real-Time Applications", RFC 3550, July 2003. |
| [9] | Rosenberg, J, "Interactive Connectivity Establishment (ICE): A Methodology for Network Address Translator (NAT) Traversal for the Session Initiation Protocol (SIP)", Internet-Draft draft-ietf-mmusic-ice-00, February 2004. |
| [10] | Ford, B, "Network Address Translation and Peer-to-Peer Applications (NATP2P)", Internet-Draft draft-ford-natp2p-00, April 2003. |
| [11] | "Packet-based Multimedia Communications Systems (includes Annex C - H.323 on ATM)", ITU-T Recommendation H.323v3, September 1999. |
| [12] | Ford, B and D Andersen, "Nat Check Web Site: http://midcom-p2p.sourceforge.net", June 2004. |
| [13] | Jennings, C, "NAT Classification Results using STUN", Internet-Draft draft-jennings-midcom-stun-results-00, February 2004. |
| Francois AUDET | |
| Nortel Networks | |
| 4655 Great America Parkway |
|
| Santa Clara, CA 95054 | |
| USA | |
| Phone: | +1 408 495 3756 |
| EMail: | audets@nortelnetworks.com |
| Cullen Jennings | |
| Cisco Systems | |
| 170 West Tasman Drive MS: SJC-21/2 |
|
| San Jose, CA 95134 | |
| USA | |
| Phone: | +1 408 902-3341 |
| EMail: | fluffy@cisco.com |
The IETF takes no position regarding the validity or scope of any Intellectual Property Rights or other rights that might be claimed to pertain to the implementation or use of the technology described in this document or the extent to which any license under such rights might or might not be available; nor does it represent that it has made any independent effort to identify any such rights. Information on the IETF's procedures with respect to rights in IETF Documents can be found in BCP 78 and BCP 79.
Copies of IPR disclosures made to the IETF Secretariat and any assurances of licenses to be made available, or the result of an attempt made to obtain a general license or permission for the use of such proprietary rights by implementers or users of this specification can be obtained from the IETF on-line IPR repository at <http://www.ietf.org/ipr>.
The IETF invites any interested party to bring to its attention any copyrights, patents or patent applications, or other proprietary rights that may cover technology that may be required to implement this standard. Please address the information to the IETF at ietf-ipr@ietf.org.
This document and the information contained herein are provided on an "AS IS" basis and THE CONTRIBUTOR, THE ORGANIZATION HE/SHE REPRESENTS OR IS SPONSORED BY (IF ANY), THE INTERNET SOCIETY AND THE INTERNET ENGINEERING TASK FORCE DISCLAIM ALL WARRANTIES, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF THE INFORMATION HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.
Copyright (C) The Internet Society (2004). This document is subject to the rights, licenses and restrictions contained in BCP 78, and except as set forth therein, the authors retain all their rights.
Funding for the RFC Editor function is currently provided by the Internet Society.