[IP] IAB on VeriSign SiteFinder -- "disastrous for users"

To: ip@xxxxxxxxxxxxxx
Subject: [IP] IAB on VeriSign SiteFinder -- "disastrous for users"
From: Dave Farber <dave@xxxxxxxxxx>
Date: Sat, 20 Sep 2003 18:59:50 -0400
List-help: <http://v2.listbox.com/doc/help_sub?list_name=ip@v2.listbox.com>
List-id: <ip@xxxxxxxxxxxxxx>
List-software: listbox.com v2.0
List-subscribe: <mailto:subscribe-ip@v2.listbox.com>, <http://v2.listbox.com/subscribe/?listname=ip@v2.listbox.com>
List-unsubscribe: <mailto:unsubscribe-ip@v2.listbox.com>, <http://v2.listbox.com/member/unsubscribe/?listname=ip@v2.listbox.com>
Reply-to: dave@xxxxxxxxxx
Sender: owner-ip@xxxxxxxxxxxxxx


http://www.iab.org/documents/docs/2003-09-20-dns-wildcards.html

<http://www.iab.org/documents/docs/../../index.html> Home<http://www.iab.org/documents/docs/../../about/index.html> About<http://www.iab.org/documents/docs/../index.html> Documents<http://www.iab.org/documents/docs/../../liaisons/index.html>Liaisons <http://www.iab.org/documents/docs/../../appeals/index.html>Appeals

[]

This document contains a number of observations on the implications of theuse of wildcards in DNS zones, and makes some recommendations concerningtheir use.

The contact person for the IAB on this statement is<mailto:harald@xxxxxxxxxxxxx>Harald Alvestrand

19 September 2003

IAB Commentary:
Architectural Concerns on the use of DNS Wildcards

There are many architectural assumptions regarding DNS behavior that arenot specified in the IETF standards documents describing DNS, but which aredeeply embedded in the behavior of Internet protocols and applications.These assumptions are inherent parts of the network architecture of whichthe DNS is one component.

It has long been known that it is possible to use DNS wildcards in waysthat violate these assumptions.

Recent deployments of DNS wildcards with A records at high levels in theDNS tree have shown by experience that the cost of violating theseassumptions is significant. In this document we provide an explanation ofhow DNS wildcards function, and many examples of how their injudicious usenegatively impacts both individual Internet applications and indeed theInternet architecture itself.

In particular, we recommend that DNS wildcards should not be used in a zoneunless the zone operator has a clear understanding of the risks, and thatthey should not be used without the informed consent of those entitieswhich have been delegated below the zone.



----------



A brief primer on DNS wildcards

The DNS "wildcard" mechanism has been part of the DNS protocol since theoriginal specifications were written twenty years ago, but the capabilitiesand limitations of wildcards are sufficiently tricky that discussions ofboth the protocol details of precisely how wildcards should be implementedand the operational details of how wildcards should or should not be usedcontinue to the present day. This section attempts to explain the essentialdetails of how wildcards work, but readers should refer to the DNSspecifications ([<http://www.ietf.org/rfc/rfc1034.txt>RFC 1034] etsequentia) for the full details.

In essence, DNS wildcards are rules which enable an authoritative nameserver to synthesize DNS resource records on the fly. The basic mechanismis quite simple, the complexity is in the details and implications.

The most basic and by far the most common operation in the DNS protocols isa simple query for all resource records matching a given query name, queryclass, and query type. Assuming (for simplicity) that all the software andnetworks involved are working correctly, such a query will produce one ofthree possible results: successIf the system finds a match for all three parameters, it returns thematching set of resource records;


no data

If the system finds a match for the query name and query class but not forthe query type, it returns an indication that the name exists but no datamatching the given query type is present.


no such name

If the system fails to find a match for the given query name and queryclass, it returns an indication that the name does not exist.

Ordinarily, matches for all three parameters must be exact. This is wherewildcards come into the picture.

A wildcard record is an otherwise ordinary DNS resource record whoseleftmost (least significant) label consists of a single asterisk ("*")character, such as "*.bar.example". Conceptually, the asterisk matches oneor more labels at the left (least significant) end of the DNS name.

When wildcard records are present, the rules become more complicated.Specifically, if the query class matches, there is no exact match for thequery name, and the closest match for the query name is a wildcard, thesystem in effect synthesizes a set of resource records matching the queryname on the fly by treating the resource records present at the wildcardname as if they had been present at the query name. Thus, if the wildcardname has records matching the desired query type, the system will returnthose records, precisely as in the "success" case above; otherwise, thesystem will return an indication that the name exists but no data matchingthe given query type is present, precisely as in the "no data" case above.The response is identical to that of a normal "success" response for thequery name, so the resolver which issued the query can not tell that theresults it got back were the result of wildcard expansion.

Note that, in the case of a wildcard match, the "no such name" case cannotoccur; the wildcard match eliminates this possibility. Note also that onlythe query name and query class matter for purposes of determining whether awildcard matches: any record type can produce a wildcard match, regardlessof whether or not the record type happens to match the query type.



----------



Problems with Wildcard Records

One of the main known weaknesses and dangers of wildcard records is thatthey interact poorly with any use of the DNS which depends on "no suchname" responses. The list of such uses turns out to be quite large, andwill be discussed in some detail in a later section.

Another known weakness and danger of wildcard records stems from the factthat the wildcard label will match anything at all, so long as nonon-wildcard name within the zone is a closer match to the query name thanthe wildcard is. This doesn't sound like a major problem until oneconsiders the number of conventions and, in some cases, protocols, whichuse labels at the left (least significant) ends of the names of resourcerecords to distinguish between records associated with different services,rather than using different types of records. That is, in these cases,otherwise unrelated services use the same type of record and clients (orusers) are expected to use the name corresponding to the particular servicedesired. This applies both to the ad-hoc naming conventions described in[<http://www.ietf.org/rfc/rfc2219.txt>RFC 2219] such as www.foo.example andalso to mechanisms such as the SRV record type[<http://www.ietf.org/rfc/rfc2782.txt>RFC 2782] in which the naming schemeis part of the formal protocol. When names of this type are covered bywildcards such as an address record named *.bar.example, such a wildcardwould hand back the same address record regardless of the service nameencoded in the query name, thus ftp.foo.bar.example, mail.foo.bar.example,ntp.foo.bar.example and so forth would all end up with the same synthesizedaddress record. This problem is even worse in the SRV case, both becausenames such as _finger._tcp.foo.bar.example are part of the protocol andbecause SRV records include TCP and UDP port numbers, so the client will beconfused not only about which host it should contact but also about theport on which it should contact that host. The only way to avoid theseproblems with names of this type is to add explicit records for such namesto the DNS.

Finally, the two factors listed above ("match anything" behavior, and poorinteraction with anything that depends on "no such name" responses)interact with normal and predictable human errors to allow wildcards tohave effects far beyond their intended scope. Properly speaking, a wildcardrecord's scope is limited to a single zone, since, by definition, awildcard record never matches any name that really does exist in the zone,and thus will not match any (non-wildcard) delegation of a portion of thenamespace from a parent zone to its child. (Wildcard NS records, whiletheoretically possible, have sufficiently bizarre semantics that it isprobably best to limit their use to torture-tests of DNS software.) So, atfirst blush, it would seem that the administrator of a zone is free to usewildcards without worrying about effects which this might have on thezone's delegated children. Unfortunately, this turns out not to be thecase, because DNS names are heavily exposed in user interfaces, and users,being humans, make mistakes. So, while delegating the bar.example zone willprevent a wildcard record *.example from affecting a user who typedfoo.bar.example as foi.bar.example, it will not prevent the same wildcardrecord from affecting the same user when the error is foo.bat.example.Thus, from the users' point of view, some of the effects of wildcards doleak from a parent zone to its children. This is not a big deal if theparent and child zones are associated with a single organization, but itcan become a real problem if the parent and child zones are associated withdifferent organizations whose interests are not perfectly aligned.

The above is probably not an exhaustive list. Even after twenty years ofexperience with the DNS, the effects of unexpected uses of wildcards canstill be quite surprising, because the small but fundamental way in whichthey change the record lookup rules has a nasty way of violating implicit(or, sometimes, explicit) assumptions in deployed DNS-using software.

For these reasons, almost all use of DNS wildcards has been limited to arelatively small number of reasonably well-understood roles, and mostwildcard use has been limited to a single role: the MX records used in maildelivery.

Since MX records are only used for electronic mail delivery, wildcard MXrecords are relatively safe, and since electronic mail for any particularDNS name is generally handed by the organization that is furthest down thedelegation tree, wildcard MX records are most likely to appear in zoneswhere their effects will not cross organizational boundaries. While thelatter is not universally true, the primary use of wildcard records hasbeen and remains wildcard MX records for handling an organization's own mail.

Given these issues, it seems clear that the use of wildcards with recordtypes that affect more than one protocol should be approached with caution,that the use of wildcards in situations where their effects crossorganizational boundaries should also be approached with caution, and thatthe use of wildcards with record types that affect more than one protocolin situations where the effects cross organizational boundaries should beapproached with extreme caution, if at all.



----------



Principles To Keep In Mind

In reading the rest of this document, it may be helpful to bear in mind twobasic principles of architectural design which have served the Internetwell for many years:* The Robustness Principle: "Be conservative in what you do, be liberalin what you accept from others." [Jon Postel,<http://www.ietf.org/rfc/rfc793.txt>RFC 793]* The Principle Of Least Astonishment: A program should always respondin the way that is least likely to astonish the user. [Traditional,original source unknown]

We will come back to these points after the next section.


----------



Problems encountered in a recent experiment with wildcards

We have recently had the opportunity to observe the results of anexperiment on use of wildcards in large top-level domains, with some ratherundesirable and unintended consequences. This section attempts to detailsome of the problems that network users and operators around the worldencountered as a result of this deployment.

We must emphasize that, technically, this was a legitimate use of wildcardrecords that did not in any way violate the DNS specifications themselves.One of our main points here is that simply complying with the letter of theprotocol specification is not sufficient to ensure the operationalstability of the applications which depend on the DNS: there are protocolfeatures which simply are not safe to use in some circumstances.

The specific change which this operator chose to make was to add a singlewildcard address record at the zone apex of each of the affected zones. Asa direct result of this change, two things happened:* the authoritative servers for these two zones no longer give out "nosuch name" responses for any possible name in these zones, and* every possible name rooted in one of these zones which, until thischange, did not exist at all, now has a synthesized address record pointingat a "redirection server" run by the operator of this zone.The implications of this simple change were many and varied. The list belowis almost certainly incomplete:





Web Browsing

Web browsers all over the world stopped displaying "page not found" in thelocal language and character set of the users when given incorrect URLsrooted under these TLDs. Instead, these browsers now display an Englishlanguage search page from a web server run by the zone operator.

It should be noted that the language tags in the HTTP protocol do notalways match the locale used in the local browser. So, even though theglobal search page is dynamic and uses the information in the HTTP requestto guess what language and script is to be used -- it will never be able toemulate what the user expected. There is, in short, not enough context inthe HTTP protocol for the engine which generates the search page.

In many situations, web browsers have been written to provide someassistance to the user, often based on local conventions, directories, andlanguage, when a DNS lookup fails. All such systems are now disabled forURLs rooted under these TLDs, since DNS lookups no longer fail, even whenthe specified destination does not exist.

Even if these were acceptable changes, the new mechanism has poor scalingproperties, and unless the operator chooses to invest significant resourcesin maintaining a large, robust web server setup, the user experience isgoing to get even worse: instead of either a local language error messageor an English search page, the user is going to get "attempting toconnect..." followed by a long wait.





Email

All mail to non-existent hostnames under these TLDs now flows to theregistry operator's server, where the registry operator bounces it. Someoperators find this intolerable and have changed their mail systemconfigurations to bypass this "bounce service", but the vast majority ofmail servers undoubtedly now route mail for nonexistent names under theseTLDs to the bounce server rather than just bouncing it directly. This has anumber of ramifications:* If operators choose to allow their mail to go to the bounce server,they now have an increased mail load handling additional routing ofmessages to the bounce server; if operators choose not to allow this tohappen, they have an additional development and maintenance burdenconfiguring their servers to prevent it.* Operators who allow mail to go to the bounce server are now dependenton the performance of the bounce server. If the bounce server ever slows orfails, mail that previously would bounce will now queue at the SMTP relayfor that relay's queue time before bouncing back to the user. This createsa very poor user experience, since typographical errors that in the pastwould have bounced immediately may now go unnoticed for several days.* Operators who allow mail to go to the bounce server are alsodependent on the correct operation of the bounce server. If the bounceserver is buggy (which happened to be the case with this rollout), mail maynot bounce at all: it may be reported to the user as having been deliveredcorrectly while actually vanishing without a trace. This also creates avery poor user experience.* In some cases where the set of MX records associated with aparticular DNS name included a misconfigured record pointing to anonexistent hostname, installing these wildcard records was the last strawthat broke a misconfigured-but-functional mail configuration: previously,the nonexistent hostname would have failed to resolve and been ignored, nowit bounces.* The normal flow of data from a client in SMTP when one address has atypo is as follows:* The client looks up the IP address of his outgoing SMTP proxy inDNS.

       * The client opens a TCP connection to his outgoing SMTP proxy.
       * The client sends information about himself to the SMTP proxy.
       * The proxy accepts or rejects the client.
       * The client sends information about the recipient to the SMTP proxy.

* The proxy look up the destination in DNS, and gets "no such name"back.

       * The proxy sends information to the client that the address is wrong.
   With a wildcard for mistyped domain, the following happens:

* The client looks up the IP address of his outgoing SMTP proxy inDNS.

       * The client opens a TCP connection to his outgoing SMTP proxy.
       * The client sends information about himself to the SMTP proxy.
       * The proxy accepts or rejects the client.
       * The client sends information about the recipient to the SMTP proxy.
       * The proxy looks up the destination in DNS, and gets "success" back.

* The proxy accepts the message and closes the connection to theclient.

       * The proxy opens a TCP connection to the bounce server.
       * The proxy present himself to the bounce server.

* The bounce server indicates that the recipient address is notacceptable.* The proxy generates an error message which is sent back to thesender's email address.* A different scenario happens if the SMTP client has beenmisconfigured with the incorrect name of the outgoing SMTP proxy. As thedomain name resolves using a wildcard, the client will connect to thebounce server, and start to send mail to it. The result is that the bounceserver (at the IP address of the wildcard) says that the recipient addressis wrong even though it is in fact correct. The error presented to the useris incorrect, as it is the name of the outgoing proxy which was wrong andnot the name of the recipient.



Informing Users of Errors

Many application GUIs check domain names for validity before allowing theuser to progress to the next step. Examples include email clients thatdirectly check the domain of the email addresses resolves before sending,and network printer configuration tools that check that the print spoolername is valid before accepting the configuration. Previously the user wouldbe prompted early that they had made an error in the domain name. In thecase of email, the error may now not be noticed at the time of sending, butonly when email later bounces. In the case of the printer configuration,the error may not be noticed during configuration, but only afterwards whenprinting fails to work, where the problem diagnosis is more difficult.





Spam Filters

Installing these wildcard records broke several simple spam filterscommonly used to front end inbound mail servers, as well as more complexfiltering that checks for the existence of a sending domain in order toscreen out obviously bogus senders. This technique for spam has diminishedas this filtering mechanism has increased, but one sample operator reportsthat it still equals about 10% of inbound mail attempts on their largeshared MX cluster. ISPs who are aware of this problem will probably extendtheir filtering rules to have special knowledge of the address returned bythese wildcard records, but will have to carry the cost of doing so, bothin terms of code maintenance and increased execution time for their filtering.





Interactions with Other Protocols

The wildcard address records trap lookups for any network service, but thenumber of protocols somewhere in use on the Internet (including privateprotocols used between two or more parties on ports which they may or maynot have registered with IANA) is large enough that it simply is notpossible for the zone operator (or anyone) to provide a redirection servicefor every protocol. In this particular example, the zone operator onlyprovided handlers for HTTP (which they directed to a search page) and SMTP(which they attempted to bounce). All other protocols received at best TCPresets, or, in some cases, simply had their packets dropped. Anyapplication that uses the DNS has (or should have) some way of handling "nosuch name" errors; in almost all cases the error message is sufficientlyclear to an experienced user that it is immediately obvious when theapplication has failed because it was given an incorrect DNS name. Withthese wildcard records in place, however, incorrect DNS names which arematched by the wildcard record will not show up as DNS name errors at all,but instead will show up as mysterious connection failures or asunreachable destinations for all services that the zone operator does notredirect. Depending on the details of the application protocol andimplementation involved, this change may also convert an obvious "hardfailure" (incorrect name) into a soft failure which the application thinksit should retry, as seen above in the email case. This may result in verylong delays, perhaps of days or weeks, before even trivial errors arebrought to the user's attention. Transport protocols using UDP may alsoretry until the transport protocol retry limit is reached (especially ifICMP messages are being filtered at a firewall), which may be veryconsiderably longer than the time it would have taken to return an error tothe user indicating they mistyped the destination.





Automated Tools

Automated or embedded tools which use HTTP but which do not have a userinterface may also be confused by this change, since such tools may expectconfiguration failures to show up as DNS errors and may not realize thatthe HTTP response they have received from the zone operator's search pageis not the page which the tool expected to reach. Such tools may fail inunpredictable ways, and may not be easy to upgrade.





Charging

The current response from the service in question is just over 17 KBytes ofdata because the client has to open a TCP connection and receive a notinsignificant amount of data. A "no such data" response would have fittedin one packet. In the case of volume-based charging for Internet Access (aswith most cellular data services) the recipient will have to pay additionalcharges.





Single Point of Failure

Even for cases in which the redirection service works as intended, such aservice creates a very large single point of failure. Single points offailure are obvious targets both for deliberate attacks and for the sort ofaccidental "attacks" caused by bugs and configuration errors which alreadygenerate much of the traffic at the DNS name servers for the root zone.Furthermore, the IP address associated with this single point of failure isa likely target both for routing attacks intended to redirect the IPaddress to some other server.





Privacy

An interception service with this kind of scope raises significant privacyconcerns, since traffic received by the interception service is, prettymuch by definition, not going where its sender originally intended. Thepotential for abuse in this situation is very high, and makes theinterception service an even more attractive target, this time forattackers who wish to gain control of it in order to practice such abuse.





Reserved Names

This sort of wildcard usage is incompatible with any use of DNS whichrelies on reserving names in a registry with the express intent of notadding them to the DNS zone itself. An example of such a use is theJET-derived IDN approach of "registry restrictions" and "reserved names",which depends on the existence of names that are reserved and can beregistered only by the holder of some related name, but which do not appearin the DNS. By some readings of the current ICANN IDN policy, support forthat "reserved name" approach is required. To accomplish the goal ofreduced consumer confusion, the reserved names must not be resolvable atall. This reserved name approach appears to be completely incompatible withthis sort of wildcard usage: since the wildcard will always cause a resultto be returned, even for a reserved name which does not appear in the zone,one can support either one or the other, but not both.





Undesirable Workarounds

ISPs have responded to the deployment of these wildcards in a number ofways, all of which are both understandable and worrisome. Some ISPs havecontemplated modifying their routing systems to drop all packets destinedto the zone operator's redirection server into a black hole. Others havedeployed patches to their DNS resolvers which attempt to reverse theeffects of these wildcard records. Still other ISPs have considered usingthis as an opportunity to play the same game that the zone operator isplaying, but for the ISP's own benefit. All of these responses are bothunderstandable and predictable, but none of them are good. Even moreworrisome is that different ISPs are taking different approaches to dealingwith this, which may lead to a balkanization problem and create an ongoingheadache for anyone having to deal with cross-network DNS or applicationdebugging.



----------



Principles, Conclusions, and Recommendations

The Robustness principle tells us that in some (not all) of the problemsdetailed above, both parties could be construed as being at fault. In somecases this is hardly surprising: spam filtering in particular, by itsnature, tends to be extremely ad hoc and somewhat fragile. No doubt thereare lessons here for all parties involved.

The Principle of Least Astonishment suggests that the deployment ofwildcards was disastrous for the users. It had widesweeping effects onother users of the Internet far beyond those enumerated by the zoneoperator, created several brand new problems, and caused other internetentities to make hasty, possibly mutually incompatible and possiblydeleterious (to the internet as a whole) changes to their own operations inan attempt to react to the change.

Note that these considerations apply to any wildcard deployment of thistype. The list of problems encountered in this case clearly demonstratesthat, although wildcard records are part of the base DNS protocol, thereare situations in which it simply is not safe to use them. As noted in anearlier section, two warning flags suggesting that this type of wildcarddeployment is dangerous were that

   * it affected more than one protocol, and

* it was done high enough up in the DNS hierarchy that its effects werenot limited to the organization that chose to deploy these wildcard records.Note also that a significant component of some of the listed problems wasnot precisely the wildcard-induced behavior per se so much as it was theabrupt change in the behavior of a long established infrastructure mechanism.

In conclusion, we would like to propose a guideline for when wildcardrecords should be considered too risky to deploy, and make a fewrecommendations on how to proceed from here.

Proposed guideline: If you want to use wildcards in your zone andunderstand the risks, go ahead, but only do so with the informed consent ofthe entities that are delegated within your zone.

Generally, we do not recommend the use of wildcards for record types thataffect more than one application protocol. At the present time, the onlyrecord types that do not affect more than one application protocol are MXrecords.

For zones which do delegations, we do not recommend even wildcard MXrecords. If they are used, the owners of zones delegated from that zonemust be made aware of that policy and must be given assistance to ensureappropriate behavior for MX names within the delegated zone. In otherwords, the parent zone operator must not reroute mail destined for thechild zone without the child zone's permission.

We hesitate to recommend a flat prohibition against wildcards in"registry"-class zones, but strongly suggest that the burden of proof insuch cases should be on the registry to demonstrate that their intended useof wildcards will not pose a threat to stable operation of the DNS orpredictable behavior for applications and users.

We recommend that any and all TLDs which use wildcards in a mannerinconsistent with this guideline remove such wildcards at the earliestopportunity.



----------



Acknowledgements

The IAB gratefully acknowledges the kind assistance of David Schairer, JohnCurran, John Klensin, and Steve Bellovin for helpful suggestions and, insome cases, significant chunks of text. None of these contributors bear anyresponsibility for what the IAB has done with their contributions. We notethat Leslie Daigle recused herself from the process of producing thisdocument.



----------



IAB Contact for this Document

The contact person for the IAB on this statement is<mailto:harald@xxxxxxxxxxxxx>Harald Alvestrand.

This page is maintained by the <mailto:execd@xxxxxxx>IAB Executive Directorfor the IAB.



-------------------------------------
You are subscribed as roessler@xxxxxxxxxxxxxxxxxx
To manage your subscription, go to
 http://v2.listbox.com/member/?listname=ip

Archives at: http://www.interesting-people.org/archives/interesting-people/

Prev by Date: [IP] Quantifying SiteFinder Traffic
Next by Date: [IP] Fwd: Low-tech phones triumph.....
Previous by thread: [IP] Quantifying SiteFinder Traffic
Next by thread: [IP] Fwd: Low-tech phones triumph.....
Index(es):
- Date
- Thread